NVIDIA's Vera Rubin Platform Marks a Generational Leap in AI Infrastructure Design

Q: What Is Extreme Codesign and Why Does It Matter for AI?

Extreme codesign is the practice of designing hardware and software simultaneously rather than sequentially. NVIDIA founder and CEO Jensen Huang highlighted this approach as the foundation of the company's competitive advantage, noting that NVIDIA has achieved "the best token cost in the world" through this methodology. Token cost refers to the expense of processing individual units of data through an AI model, a critical metric for determining the overall economics of running large language models (LLMs) and other AI systems at scale . The Vera Rubin platform includes several new components designed to work together seamlessly. The new NVIDIA Vera CPU anchors the system, paired with the BlueField-4 STX storage architecture. These components are not standalone products but rather pieces of a larger ecosystem optimized for agentic AI, which refers to AI systems that can autonomously plan and execute tasks with minimal human intervention .

Q: What Comes After Vera Rubin?

Looking beyond the current generation, NVIDIA is already planning its next major architecture, called Feynman. This future platform will introduce the NVIDIA Rosa CPU, named after Rosalind Franklin, the scientist whose X-ray crystallography work revealed the structure of DNA. According to Huang, "As Franklin exposed the hidden architecture of life, Rosa is built to move data, tools and tokens efficiently across the full stack of agentic AI infrastructure" . The Feynman generation will pair the Rosa CPU with the LP40, NVIDIA's next-generation LPU (Learning Processing Unit), along with BlueField-5 and CX10 components. These will be connected through NVIDIA Kyber for both copper and co-packaged optics scale-up, and NVIDIA Spectrum-class optical scale-out. Together, these advances target every pillar of what NVIDIA calls the "AI factory": compute, memory, storage, networking, and security . NVIDIA also announced the Vera Rubin DSX AI Factory reference design and the NVIDIA Omniverse DSX Blueprint, tools that allow companies to simulate AI factories in software before building them physically. DSX Air, part of the broader DSX platform, enables organizations to model their infrastructure investments and optimize configurations before deployment, reducing risk and capital expenditure . The company is also extending its reach beyond Earth. NVIDIA announced plans to bring AI data centers into orbit through systems like NVIDIA Space-1 Vera Rubin, extending accelerated computing from terrestrial facilities to space-based infrastructure. This represents a significant expansion of where AI computation can occur and opens new possibilities for latency-sensitive applications and distributed AI systems . During the keynote at GTC 2026, Huang emphasized the scale of current AI demand. He noted that computing demand for NVIDIA GPUs has increased by approximately one million times over recent years, and he projects at least one trillion dollars in revenue from AI infrastructure betwee

FrontierNews.ai AI Research Desk

FrontierNews.ai