NVIDIA launches Vera CPU for agentic AI with 50% faster results and 2x efficiency claims
Original: NVIDIA Launches Vera CPU, Purpose-Built for Agentic AI View original →
NVIDIA unveiled Vera CPU on March 23, 2026. The company describes it as the first processor purpose-built for the age of agentic AI and reinforcement learning, claiming 50% faster results and twice the efficiency of traditional rack-scale CPUs. The pitch is aimed at workloads where infrastructure around the model matters more, including task planning, tool execution, code running, and result validation.
The technical profile is aggressive. Vera uses 88 custom Olympus cores, up to 1.2 TB/s of LPDDR5X memory bandwidth, and NVIDIA's second-generation scalable coherency fabric. NVIDIA also introduced a Vera rack with 256 liquid-cooled CPUs, which it says can sustain more than 22,500 concurrent CPU environments. In the Vera Rubin NVL72 platform, Vera pairs with NVIDIA GPUs through NVLink-C2C with 1.8 TB/s of coherent bandwidth, which NVIDIA says is 7x PCIe Gen 6.
- 88 custom NVIDIA-designed Olympus cores
- Up to 1.2 TB/s LPDDR5X memory bandwidth
- More than 22,500 concurrent CPU environments in a Vera rack configuration, according to NVIDIA
- Named partners include Alibaba Cloud, CoreWeave, Meta, Oracle Cloud Infrastructure, Dell, HPE, Lenovo, and Supermicro
The broader signal is that NVIDIA is no longer talking about the CPU as a passive companion to the GPU. Jensen Huang framed the CPU as a central part of how agentic systems scale, which fits the industry's shift from standalone model performance toward full rack-level optimization across memory, data movement, orchestration, networking, and security.
Performance and efficiency figures remain NVIDIA's own claims, so the market test will come when cloud providers and enterprises deploy Vera against existing x86 and Arm server fleets and compare total cost, migration complexity, and software compatibility. NVIDIA says Vera is already in full production and will be available from partners in the second half of 2026.
Related Articles
NVIDIA says Vera is the first processor built specifically for agentic AI and reinforcement learning. On Hacker News, the announcement reached 165 points and 98 comments as readers focused on CPU-GPU coupling, rack density, and the practical value of NVIDIA's efficiency claims.
NVIDIA says Vera is now in full production and can complete agentic workloads 1.8x faster than x86 CPUs. OpenAI, Anthropic, SpaceXAI, ByteDance, CoreWeave, and OCI are among the names tied to adoption or evaluation.
A new Goldman Sachs Alternatives report warns that agentic AI systems require 60x to 130x more energy than standard chat models, pointing to a projected 45 GW U.S. power shortfall by 2028 and a 600,000-worker skilled-trades labor gap as the real bottlenecks to AI scaling.
Comments (0)
No comments yet. Be the first to comment!