Skip to content

Nvidia's Rubin CPX: Revolutionizing AI Model Analysis by End of 2026

Get ready for a game-changer in AI processing. Nvidia's Rubin CPX, due by the end of 2026, is set to revolutionize AI model analysis with its impressive specs and successful benchmark results.

In this image there are few cameras, a computer, CPU and some other objects on the table.
In this image there are few cameras, a computer, CPU and some other objects on the table.

Nvidia's Rubin CPX: Revolutionizing AI Model Analysis by End of 2026

Nvidia is set to unveil its latest GPU, Rubin CPX, designed for compute-intensive AI model analysis. The GPU will be available by the end of 2026, offering a dedicated hardware solution for disaggregated inference.

Rubin CPX is designed to excel in tasks requiring large data processing, such as analyzing software codebases or creating videos. It will feature a monolithic die design, delivering 30 PetaFLOPs of NVFP4 computing power and 128 GB of GDDR7 memory. The GPU will also boast triple attention layer acceleration compared to the Blackwell architecture.

Nvidia's Blackwell-Ultra architecture has already set new records in the MLPerf Inference v5.1 benchmark, supporting the disaggregated inference approach. Rubin CPX builds on this success, promising enhanced performance. The Dynamo Framework, used in the benchmark of the Llama 3.1 405B model, increased throughput per GPU by almost 1.5x compared to traditional methods.

Several leading AI firms, including Cursor, Runway, and Magic, are evaluating Rubin CPX for their use cases. The GPU is expected to be released as an add-in card or a standalone data center computer.

Nvidia's Rubin CPX, due by the end of 2026, is poised to revolutionize AI model analysis with its powerful hardware and disaggregated inference approach. The GPU's impressive specs and successful benchmark results promise significant advancements in AI processing.

Read also:

Latest