NVIDIA Vera Rubin NVL72-ingrasys

The Vera Rubin Era of AI is Shaping

AI progress is no longer defined by how fast models can be trained, but by how efficiently intelligence can be produced, reasoned, and deployed at massive scale. As AI factories are taking shape, reasoning and agentic workflows demand computing platforms that can sustain multi-step inference, with extremely long context windows, low latency, and energy efficient token generation across GPUs, racks and data centers.

The NVIDIA Vera Rubin platform is built specifically to meet this challenge. Architected as a rack-scale, fully-liquid cooled AI supercomputer, it brings together six chips — NVIDIA Vera CPU, NVIDIA Rubin GPU, NVIDIA NVLink™ 6 Switch, NVIDIA ConnectX®-9 SuperNIC™, NVIDIA BlueField®-4 DPU and NVIDIA Spectrum™-6 Ethernet Switch — through extreme co-design to operate as a unified system, enabling massive bandwidth for both scale-up and scale-out workloads. By optimizing how tokens, model state, and context flow across the AI factory efficiently, NVIDIA Vera Rubin NVL72 delivers astonishing improvements over the Blackwell generation in training efficiency and cost-per-token, setting the foundation for the next AI frontier.

Highlights

36 NVIDIA Vera CPUs
72 NVIDIA Rubin GPUs
Support 6th generation NVIDIA NVLink™ for Scale-up
Support NVIDIA Quantum-X800 InfiniBand and Spectrum-X™ Ethernet for Scale-out and Scale-across
Cableless Compute Tray Design
Built for the Age of Agentic Reasoning

Rack-scale Architecture

Design Details

Design Details

2 x TOR Switches

2 x Management Switches

2 x 3RU Power Shelves

Provides a maximum power output of 110kW per power shelf

10 x Compute Trays

Integrate 2 NVIDIA Vera Rubin Superchips per tray, combining a total of 2 NVIDIA Vera CPUs and 4 NVIDIA Rubin GPUs

9 x NVLink™ Switch Trays

Connect up to 36 NVIDIA Vera Rubin Superchips in one giant NVLink domain

8 x Compute Trays

Integrate 2 NVIDIA Vera Rubin Superchips per tray, combining a total of 2 NVIDIA Vera CPUs and 4 NVIDIA Rubin GPUs

2 x 3RU Power Shelves

Provides a maximum power output of 110kW per power shelf

Key Components of NVIDIA Vera Rubin NVL72

NVIDIA Vera Rubin Superchip

The NVIDIA Vera Rubin Superchip, combining one Vera CPU and two Rubin GPUs with HBM4 memory, forms the core engine for large-scale intelligence production. Delivering a significant boost in communication and memory movement, the superchip enables next-generation inference in the era of agentic AI.

NVIDIA Vera Rubin NVL72 Compute Tray

The Rubin NVL72 compute tray is a liquid-cooled, high-density module integrating two NVIDIA Vera CPUs, four NVIDIA Rubin GPUs, high-bandwidth memory, and seamless sixth-generation NVIDIA NVLink™ connectivity. Designed with a focus on rapid deployment, it features a cable-free modular design that greatly reduces assembly time while delivering outstanding performance to accelerate the AI industrial revolution.