NVIDIA's new Nemotron 3 Super pairs a 120B total / 12B active hybrid Mamba-Transformer MoE with a native 1M-token context window and open weights, datasets, and recipes. LocalLLaMA discussion centered on whether those openness and efficiency claims translate into realistic home-lab deployments.
#nvidia
NVIDIA AI Developer introduced Nemotron 3 Super on March 11, 2026 as an open 120B-parameter hybrid MoE model with 12B active parameters and a native 1M-token context window. NVIDIA says the model targets agentic workloads with up to 5x higher throughput than the previous Nemotron Super model.
NVIDIAAI says it is partnering with Thinking Machines to deploy at least 1 gigawatt of NVIDIA Vera Rubin systems for frontier AI training. Thinking Machines frames the partnership as infrastructure for both frontier model training and platforms for customizable AI.
ABB Robotics and NVIDIA said they are integrating Omniverse libraries into RobotStudio and plan to ship RobotStudio HyperReality in the second half of 2026. They claim 99% sim-to-real correlation and say the platform can cut engineering time, reduce deployment cost, and speed factory rollout.
OpenAI announced $110B in new investment on February 27, 2026, alongside Amazon and NVIDIA partnerships aimed at compute scale. The company tied the move to 900M weekly ChatGPT users, 9M paying business users, and rising Codex demand.
NVIDIA said major operators and telecom suppliers have agreed to work on 6G using open and secure AI-native platforms. The coalition turns 6G planning into a broader contest over programmable AI infrastructure, not only radios and spectrum.
NVIDIA says its latest healthcare and life sciences AI survey shows the market moving beyond experimentation and toward measurable ROI. The company reports that 70% of surveyed organizations are already using AI and 69% are using generative AI and large language models.
A LocalLLaMA thread spotlights FlashAttention-4, which reports up to 1605 TFLOPs/s on B200 BF16 and introduces pipeline and memory-layout changes tuned for Blackwell constraints.
At Mobile World Congress on February 28, 2026, NVIDIA and major global telecom and infrastructure partners announced a joint commitment to open and secure AI-native 6G platforms. The initiative ties operator adoption, ecosystem standards, and AI-RAN execution into a single coalition roadmap.
NVIDIA and Coherent announced a March 2, 2026 multiyear strategic agreement focused on advanced optics for AI data centers. The deal includes a $2 billion NVIDIA investment in Coherent plus a multibillion purchase commitment.
A popular r/pcgaming thread spotlights PCWorld’s report citing Jon Peddie Research data: Nvidia reportedly controls over 90% of discrete PC graphics cards, while AMD falls below 10%.
NVIDIA announced a multiyear strategic agreement with Lumentum focused on advanced optics for next-generation AI infrastructure. The nonexclusive deal includes a multibillion purchase commitment and capacity access rights for laser components. NVIDIA also said it will invest $2 billion in Lumentum for R&D, future capacity, and U.S.-based manufacturing expansion.