A benchmark comparing vision agents (browser-use) to structured API agents on the same admin panel found vision agents cost roughly 45x more — and failed to complete the task without a 14-step explicit walkthrough.
#computer-use
RSS FeedHN liked the hack, but the real excitement was infrastructure. Cua’s background macOS driver keeps the cursor, focus, and Space in place while an agent works inside another app.
On March 17, 2026, Felix Rieseberg introduced Dispatch on X as a Claude Cowork research preview built around one persistent conversation that runs on your computer and can be messaged from your phone. Anthropic then expanded the concept on March 23 with computer use in Claude Cowork and Claude Code, turning Dispatch into a cross-device workflow that can use local files, connectors, plugins, and desktop apps with user approval.
Anthropic said on March 30, 2026 that computer use is now available in Claude Code in research preview for Pro and Max plans. Claude Code docs say the feature lets Claude open apps, click through UI flows, and see the screen on macOS from the CLI, targeting native app testing, visual debugging, and other GUI-only tasks.
Anthropic said on February 25, 2026 that it acquired Vercept to strengthen Claude’s computer use capabilities. The company tied the deal to Sonnet 4.6’s rise to 72.5% on OSWorld and its broader push toward agent systems that can act inside live applications.
r/singularity read Anthropic's Dispatch + computer use release as a real product shift toward phone-first AI coworkers, while also focusing on the macOS-only rollout and the limits of screen-driven automation.
OpenAI said on March 5, 2026 that GPT-5.4 Thinking and GPT-5.4 Pro were rolling out in ChatGPT, while GPT-5.4 also became available in the API and Codex. OpenAI’s launch page positions GPT-5.4 as a unified frontier model for reasoning, coding, native computer use, and long-horizon agent workflows.
OpenAI posted on March 5, 2026 that GPT-5.4 Thinking and GPT-5.4 Pro are rolling out across ChatGPT, the API, and Codex. The launch article positions GPT-5.4 as a professional-work model with 1M-token context, native computer use, stronger tool search, and better spreadsheet, document, and presentation performance.
Perplexity says users can now guide Perplexity Computer by voice, not just text. The update turns mid-task feedback and redirection into a spoken control loop for long-running agent work on the web.
OpenAI announced GPT-5.4 on March 5, 2026, adding a new general-purpose model and GPT-5.4 Pro with stronger computer use, tool search efficiency, and benchmark improvements over GPT-5.2.
Anthropic said it acquired Vercept on February 25, 2026 to advance Claude’s computer-use capabilities. In its announcement, Anthropic cited recent Sonnet 4.6 gains on OSWorld and said Vercept will wind down its external product to join Anthropic.
A high-scoring Hacker News post spotlights FDM-1, a video-native computer action model trained on an 11-million-hour dataset. The release emphasizes automatic action labeling with IDM and large-scale forking-VM evaluation for long-horizon interaction tasks.