Claude Opus 4.7 finishes a robodog coding task about 20x faster

Agent benchmarks are beginning to leave the IDE and touch hardware. Anthropic’s Project Fetch phase 2 tested whether Claude Opus 4.7 could program a quadruped robot dog, and the company says the model working on its own was about 20 times faster than last year’s best human team aided by Opus 4.1.

In the source tweet, Anthropic wrote that Opus 4.7 was "~20x faster" than the prior human-plus-Claude team. The same post included the grounding caveat: the robodog still failed to fetch the beach ball. That makes the result more useful, not less, because it separates rapid code generation and hardware orchestration from full real-world task success.

Project Fetch comes from Anthropic’s Frontier Red Team work. The setup asks people without robotics expertise, and now a model acting more autonomously, to make unfamiliar robot hardware complete a simple physical objective. This is exactly the kind of test that exposes the gap between software fluency and embodied reliability: documentation has to become control code, control code has to survive noisy sensors, and the final behavior has to work in a room rather than a prompt.

The next metric to watch is not only whether Claude can eventually make the robot fetch the ball. More telling measures include the number of intervention points, the repeatability across robot platforms, the safety envelope around autonomous hardware control, and whether speed gains come with brittle assumptions. If agentic models keep improving here, robotics teams may use them first as fast integration engineers before trusting them as full autonomy systems.

Humanoid Robots Hacker News 4d ago 2 min read

Gemini Robotics 2 shifts the debate from robot hands to whole-body control

The thread’s energy centered on whether one model family can coordinate walking, dexterity, planning, and teamwork without running into control and latency limits.

#google #gemini #robotics

Humanoid Robots Jul 9, 2026 2 min read

Mistral’s Robostral Navigate hits 76.6% unseen success with one RGB camera

Robostral Navigate is an 8B embodied navigation model that uses a single RGB camera instead of LiDAR, depth sensors, or multiple cameras. Mistral says it reaches 76.6% success on unseen R2R-CE environments and beats multi-sensor systems by 4.5 points.

#mistral #robostral #robotics

Humanoid Robots Jul 8, 2026 1 min read

GR00T 1.7 and Isaac Teleop give LeRobot an open humanoid pipeline

NVIDIA and Hugging Face are bringing Isaac GR00T 1.7, Isaac Teleop, datasets, and robotics workflows into LeRobot. The move connects NVIDIA’s 3 million robotics developers with Hugging Face’s 16 million AI builders through a shared open robotics stack.

#nvidia #hugging-face #lerobot

Related Articles

Gemini Robotics 2 shifts the debate from robot hands to whole-body control

Mistral’s Robostral Navigate hits 76.6% unseen success with one RGB camera

GR00T 1.7 and Isaac Teleop give LeRobot an open humanoid pipeline