NVIDIA Unveils Vera Rubin AI Factory Platform at GTC 2026 — Dedicated Inference Chips, 35x Performance Boost, Open Model Alliance

At GTC 2026 (March 16-19, San Jose), NVIDIA CEO Jensen Huang unveiled a comprehensive expansion of the Vera Rubin AI factory platform, marking a strategic pivot from training-focused to inference-focused AI infrastructure — directly targeting the agentic AI workload explosion.

Key Announcements:

Groq 3 LPX — Dedicated Inference Hardware: For the first time, NVIDIA is adding dedicated inference chips to its platform. The new Groq 3 LPX (Logical Processing Unit) claims a 35x inference performance boost. This is a significant architectural shift — NVIDIA previously relied on GPUs for both training and inference, but the scale of agentic AI inference demands purpose-built silicon.
Vera Rubin Platform Expansion:

7-chip AI factory platform architecture
Custom CPU racks optimized for inference workloads
New storage architecture for agent state management
Inference operating system for orchestrating agent workloads

DGX Station with GB300 Superchips: First DGX Station systems powered by GB300 superchips have shipped to pioneering developers, enabling deskside development with frontier-scale models.
Open Model Alliance: NVIDIA launched a multi-lab open-source model coalition, partnering with external research labs to create shared model infrastructure.
Broader Ecosystem:

Cisco expanding Secure AI Factory strategy across data center, telco edge, and enterprise
Hitachi Vantara expanding Hitachi iQ for responsible agentic AI
Multiple partners showcasing agentic AI infrastructure at GTC booths

Jensen Huang's Five Arguments for Continued AI Build-Out: Huang positioned AI infrastructure spending as essential, noting that prospective employees in Silicon Valley are already asking how many tokens come with a job offer — compute access as a talent signal.

The conference agenda has decisively shifted from model training to practical applications: inference, autonomous AI agents, and infrastructure capable of serving them in real time.

Analysts estimate the AI chip market is approaching $1 trillion, with GTC 2026 resetting expectations for infrastructure investment.

NVIDIA Unveils Vera Rubin AI Factory Platform at GTC 2026 — Dedicated Inference Chips, 35x Performance Boost, Open Model Alliance

Sources

Share this article

🧠 Stay Updated on AI Agents

Deploy Your AI Agent Today

More from AI Infrastructure