NVIDIA Releases Nemotron 3 Nano Omni: Open 30B Multimodal Model With 9x Higher Throughput

Read in other languages: 한국어日本語
AI May 5, 2026 By Insights AI 1 min read 1 views Source

Open Multimodal AI for Agents

NVIDIA launched Nemotron 3 Nano Omni on April 28, 2026, available immediately via Hugging Face, OpenRouter, build.nvidia.com, and over 25 partner platforms.

Technical Specifications

  • Architecture: 30B-A3B hybrid MoE with Conv3D and EVS
  • Context: 256K tokens
  • Modalities: Video, audio, image, and text in a single model
  • Throughput: 9x higher than comparable open omni models

Designed for Multimodal Agents

Traditional multimodal pipelines require separate systems for vision, speech, and language — introducing latency and complexity. Nemotron 3 Nano Omni integrates these into one model, suited for agents that need to process multiple input types simultaneously without switching between systems.

Early Adoption

Early adopters include Aible, Applied Scientific Intelligence, Eka Care, Foxconn, H Company, Palantir, and Pyler. Dell Technologies, Docusign, Infosys, Oracle, and Zefr are evaluating the model.

Source: NVIDIA Blog

Share: Long

Related Articles

Comments (0)

No comments yet. Be the first to comment!

Leave a Comment