#nvidia

AI 2d ago 2 min read

NVIDIA and Google Cloud push AI factories toward 960,000 Rubin GPUs

This is less about one more cloud partnership and more about the infrastructure shape of the next agent wave. NVIDIA and Google Cloud say A5X Rubin systems can scale to 80,000 GPUs per site and 960,000 across multisite clusters, while cutting inference cost per token and boosting token throughput per megawatt by up to 10x versus the prior generation.

#nvidia #google-cloud #ai-infrastructure

AI 4d ago 2 min read

NVIDIA’s Korean personas give agents 7M synthetic users

NVIDIA released Nemotron-Personas-Korea on Hugging Face with 7 million synthetic personas grounded in Korean public statistics. The dataset matters because agent localization is no longer only translation; it needs region, honorifics, occupations, and public-service context.

#nvidia #nemotron #synthetic-data

LLM sources.twitter 4d ago 1 min read

NVIDIA NeMo RL uses FP8 to speed Qwen3-8B training by 1.48x

Why it matters: post-training agents increasingly depend on reinforcement learning throughput, not only inference speed. NVIDIA says NeMo RL’s FP8 path speeds RL workloads by 1.48x on Qwen3-8B-Base while tracking BF16 accuracy.

#nvidia #nemo-rl #fp8

AI sources.twitter Apr 17, 2026 2 min read

NVIDIA Lyra 2.0 turns single images into explorable 3D worlds

Why it matters: NVIDIA is aiming generative video research at simulation-ready 3D environments rather than short clips. The tweet says Lyra 2.0 maintains per-frame 3D geometry and uses self-augmented training, while the project page shows outputs as Gaussian splats and meshes that can be exported to Isaac Sim.

#nvidia #lyra-2.0 #3d-generation

AI sources.twitter Apr 16, 2026 1 min read

Cursor agents lift NVIDIA Blackwell CUDA kernels by 38%

Coding agents are being tested on GPU performance work, not just app scaffolding. Cursor says its NVIDIA collaboration produced a 38% geomean speedup across 235 CUDA kernel problems in three weeks.

#ai-agents #cuda #nvidia

Sciences sources.twitter Apr 15, 2026 2 min read

NVIDIA Ising beats GPT-5.4 by 14.5% on QCalEval quantum benchmark

Why it matters: NVIDIA is turning quantum calibration and error correction into an open model-and-tooling stack instead of a lab-only workflow. The April 14 tweet framed Ising as an open suite, and NVIDIA’s technical post says Ising Calibration 1 scored 14.5% above GPT-5.4 and 3.27% above Gemini 3.1 Pro on QCalEval.

#nvidia #ising #quantum-computing

Sciences Apr 15, 2026 2 min read

NVIDIA open-sources Ising to speed fault-tolerant quantum work

NVIDIA is turning quantum chip calibration and error correction into an open AI stack, with one model family that beats GPT 5.4 on QCalEval and another that speeds decoding by 2.25x. If those gains travel outside NVIDIA's own workflow, one of quantum computing's nastiest software bottlenecks just moved closer to something teams can actually deploy.

#nvidia #quantum #open-models

AI Apr 14, 2026 2 min read

The first serious orbital GPU cluster is live with 40 Nvidia Orins in orbit

Space data centers are still mostly future tense, but space inference is starting to look like a real business. Kepler’s in-orbit cluster already ties 40 Nvidia Orin processors across 10 satellites and has 18 customers, which is enough to move the idea out of pitch-deck territory.

#kepler #space-computing #nvidia

Gaming Reddit Apr 14, 2026 2 min read

Nvidia defends its DLSS 5 demo after Resident Evil Requiem backlash

A high-signal r/Games post amplified GamesRadar+ coverage of Jensen Huang defending DLSS 5 as an optional artist tool after the Resident Evil Requiem demo drew AI slop criticism.

#nvidia #dlss 5 #resident evil requiem

LLM sources.twitter Apr 12, 2026 2 min read

NVIDIA and Google position Gemma 4 for local agentic AI on RTX GPUs and DGX Spark

NVIDIA AI PC said on April 2, 2026 that the new Gemma 4 models are optimized for RTX GPUs and DGX Spark, with the 26B and 31B variants aimed at local agentic AI. NVIDIA's official blog says the collaboration spans RTX PCs, workstations, DGX Spark, Jetson Orin Nano, and data center deployments, with native tool use, multimodal inputs, and local runtime support through Ollama and llama.cpp.

#gemma-4 #nvidia #rtx

Gaming Reddit Apr 12, 2026 2 min read

Tom's Hardware Benchmarks Nvidia RTX Neural Texture Compression and Finds Huge VRAM Savings With Tradeoffs

Tom's Hardware says Nvidia's RTX Neural Texture Compression can cut texture memory by around 85% in its sample scene, but the lowest-VRAM mode adds a measurable performance cost and looks best with anti-aliasing such as DLSS.

#nvidia #rtx #vram

LLM Apr 11, 2026 2 min read

NVIDIA tunes Gemma 4 for local agentic AI across RTX PCs, DGX Spark, and Jetson

On April 2, 2026 NVIDIA said it has optimized Google’s latest Gemma 4 models for RTX PCs, DGX Spark, and Jetson edge modules. The move is aimed at turning compact multimodal models into practical local agent stacks rather than leaving them mainly in the cloud.

#nvidia #gemma-4 #rtx