NVIDIA Introduces Rubin CPX GPU for Ultra-Long Context AI Inference

AI Infrastructure & Compute

Nvidia unveiled the Rubin CPX, a new GPU optimized for processing context windows exceeding 1 million tokens, at the AI Infrastructure Summit. The processor is part of the upcoming Rubin series and is designed for disaggregated inference, enabling stronger performance on complex, long-context workloads such as video generation and software development.

The announcement highlights Nvidia’s continued push to expand its dominance in AI hardware, following record-breaking momentum in its data center business, which generated $41.1 billion last quarter alone. By extending context capacity at scale, the Rubin CPX is positioned to support the next wave of advanced AI applications. The GPU is expected to become commercially available by late 2026.

AI, business, NVIDIA, product release, Rubin CPX GPU

James Dargan

James Dargan is a writer and researcher at The AI Insider. His focus is on the AI startup ecosystem and he writes articles on the space that have a tone accessible to the average reader.

Share this article:

All tags

AI, business, NVIDIA, product release, Rubin CPX GPU

You May Also Be Interested In

10 Robotics Highlights From Nvidia GTC 2026

Greg Bock March 21, 2026

Manifold Announces $8M Seed Funding Round to Secure Autonomous Endpoint AI Agents at Runtime

James Dargan March 21, 2026

Respan Announces $5M in Funding from Gradient, Y Combinator and others

James Dargan March 21, 2026

WordPress.com Introduces AI Agents to Automate Website Creation and Management

James Dargan March 21, 2026

White House Proposes Centralized Federal AI Framework to Override State Regulations

James Dargan March 21, 2026

AI Insider News

Respan Announces $5M in Funding from Gradient, Y Combinator and others

James Dargan March 21, 2026

WordPress.com Introduces AI Agents to Automate Website Creation and Management

James Dargan March 21, 2026

White House Proposes Centralized Federal AI Framework to Override State Regulations

James Dargan March 21, 2026

AI Insider

Discover the future of AI technology with "AI Insider" - your go-to platform for industry data, market insights, and groundbreaking AI news

Related Articles

10 Robotics Highlights From Nvidia GTC 2026

March 21, 2026

Manifold Announces $8M Seed Funding Round to Secure Autonomous Endpoint AI Agents at Runtime

March 21, 2026

Respan Announces $5M in Funding from Gradient, Y Combinator and others

March 21, 2026

WordPress.com Introduces AI Agents to Automate Website Creation and Management

March 21, 2026