NVIDIA Introduces Rubin CPX GPU for Ultra-Long Context AI Inference

Nvidia unveiled the Rubin CPX, a new GPU optimized for processing context windows exceeding 1 million tokens, at the AI Infrastructure Summit. The processor is part of the upcoming Rubin series and is designed for disaggregated inference, enabling stronger performance on complex, long-context workloads such as video generation and software development.

The announcement highlights Nvidia’s continued push to expand its dominance in AI hardware, following record-breaking momentum in its data center business, which generated $41.1 billion last quarter alone. By extending context capacity at scale, the Rubin CPX is positioned to support the next wave of advanced AI applications. The GPU is expected to become commercially available by late 2026.

James Dargan

James Dargan is a writer and researcher at The AI Insider. His focus is on the AI startup ecosystem and he writes articles on the space that have a tone accessible to the average reader.

Share this article:

AI Insider

Discover the future of AI technology with "AI Insider" - your go-to platform for industry data, market insights, and groundbreaking AI news

Subscribe today for the latest news about the AI landscape