SpatialClaw beats a prior spatial agent by 11.2 points on 20 tests

Spatial reasoning agents may need a better action interface more than a longer list of tools. NVIDIA AI wrote on X that “Code is the right action interface” for these agents, pointing to SpatialClaw, a training-free system that lets a VLM-backed agent write Python inside a persistent kernel. Instead of dispatching only fixed tool calls, the agent can compose perception modules, inspect intermediate outputs, and revise its strategy step by step.

The linked project page gives the strongest evidence. SpatialClaw reports an 11.2-point margin over a recent prior spatial agent across 20 benchmarks, with no benchmark-specific or model-specific tuning. It improves on 19 of 20 benchmarks on the same backbone and shows consistent gains across six VLM backbones. The page also reports an average +6.5 point gain over a no-tool baseline, with larger single-benchmark jumps such as DSI-Bench +17.6 points, MindCube +15.3 points, and MMSI +13.4 points.

NVIDIA AI’s account typically posts research, developer tooling, and infrastructure updates, and this item is more architectural than promotional. The claim is not that a new model alone solved spatial reasoning, but that executable code lets the agent turn perception outputs into reusable variables and computations. What to watch next is whether this pattern survives outside curated benchmarks: sandboxing, tool-state reproducibility, latency, and error recovery will decide whether code-as-action becomes a common interface for visual agents. The source tweet is available on X.

AI 6d ago 2 min read

NVIDIA turns open AI security into a coalition with Microsoft and Cloudflare

NVIDIA launched the Open Secure AI Alliance with Microsoft, Cloudflare, Hugging Face, Palantir, and other partners. The bet is that AI-agent defense needs open models, harnesses, logs, and evaluation tools that defenders can inspect and run themselves.

#nvidia #cybersecurity #open-source

AI Jul 8, 2026 2 min read

NVIDIA Vera targets agent loops with 1.8x sustained per-core x86 performance

NVIDIA detailed Vera, a CPU designed for agentic AI workloads where tool calls, code execution, retrieval, and verification sit between model calls. The company claims 50% higher IPC than Grace and 1.8x sustained per-core performance versus x86 on agentic execution workloads.

#nvidia #vera #ai-infrastructure

AI 6d ago 2 min read

SSI gets 10x more compute as NVIDIA puts Vera Rubin behind Sutskever’s lab

Safe Superintelligence will expand its compute by an order of magnitude through NVIDIA investment and access to Vera Rubin systems. The deal turns a productless frontier lab into a strategic customer for upcoming AI infrastructure.

#nvidia #ssi #compute

Related Articles

NVIDIA turns open AI security into a coalition with Microsoft and Cloudflare

NVIDIA Vera targets agent loops with 1.8x sustained per-core x86 performance

SSI gets 10x more compute as NVIDIA puts Vera Rubin behind Sutskever’s lab