#computer-use

LLM Hacker News Jun 26, 2026 1 min read

Gemini 3.5 Flash gets computer use, and HN focuses on trust boundaries

The 241-point HN thread treated Google’s release less as a feature checklist and more as a test of what users can safely delegate.

#gemini #computer-use #agents

LLM X/Twitter Jun 26, 2026 2 min read

Gemini 3.5 Flash adds native computer use for cross-interface agents

Agent competition is moving from answer quality to controlled action on screens. Google DeepMind says Gemini 3.5 Flash now has a built-in computer-use tool for browser, mobile, and desktop interfaces.

#google-deepmind #gemini #computer-use

AI Hacker News May 5, 2026 1 min read

Computer Use Is 45x More Expensive Than Structured APIs

A benchmark comparing vision agents (browser-use) to structured API agents on the same admin panel found vision agents cost roughly 45x more — and failed to complete the task without a 14-step explicit walkthrough.

#computer-use #ai-agents #api

AI Hacker News Apr 30, 2026 2 min read

HN sees Cua as the missing layer for background computer-use on macOS

HN liked the hack, but the real excitement was infrastructure. Cua’s background macOS driver keeps the cursor, focus, and Space in place while an agent works inside another app.

#computer-use #macos #automation

LLM X/Twitter Apr 2, 2026 2 min read

Dispatch pushes Claude toward a persistent cross-device agent for desktop work

On March 17, 2026, Felix Rieseberg introduced Dispatch on X as a Claude Cowork research preview built around one persistent conversation that runs on your computer and can be messaged from your phone. Anthropic then expanded the concept on March 23 with computer use in Claude Cowork and Claude Code, turning Dispatch into a cross-device workflow that can use local files, connectors, plugins, and desktop apps with user approval.

#anthropic #claude #dispatch

LLM X/Twitter Mar 31, 2026 2 min read

Anthropic adds computer use to Claude Code for GUI testing and app control on macOS

Anthropic said on March 30, 2026 that computer use is now available in Claude Code in research preview for Pro and Max plans. Claude Code docs say the feature lets Claude open apps, click through UI flows, and see the screen on macOS from the CLI, targeting native app testing, visual debugging, and other GUI-only tasks.

#claude-code #computer-use #anthropic

110

LLM Mar 26, 2026 2 min read

Anthropic Acquires Vercept to Push Claude Deeper Into Computer Use

Anthropic said on February 25, 2026 that it acquired Vercept to strengthen Claude’s computer use capabilities. The company tied the deal to Sonnet 4.6’s rise to 72.5% on OSWorld and its broader push toward agent systems that can act inside live applications.

#anthropic #claude #computer-use

115

LLM Reddit Mar 24, 2026 2 min read

r/singularity treats Anthropic Dispatch as the next step toward phone-first AI coworkers

r/singularity read Anthropic's Dispatch + computer use release as a real product shift toward phone-first AI coworkers, while also focusing on the macOS-only rollout and the limits of screen-driven automation.

#claude #computer-use #mobile

LLM X/Twitter Mar 21, 2026 2 min read

OpenAI rolls GPT-5.4 Thinking and GPT-5.4 Pro across ChatGPT, API, and Codex

OpenAI said on March 5, 2026 that GPT-5.4 Thinking and GPT-5.4 Pro were rolling out in ChatGPT, while GPT-5.4 also became available in the API and Codex. OpenAI’s launch page positions GPT-5.4 as a unified frontier model for reasoning, coding, native computer use, and long-horizon agent workflows.

#openai #gpt-5.4 #codex

LLM X/Twitter Mar 14, 2026 2 min read

OpenAI Rolls Out GPT-5.4 Across ChatGPT, the API, and Codex with 1M Context and Native Computer Use

OpenAI posted on March 5, 2026 that GPT-5.4 Thinking and GPT-5.4 Pro are rolling out across ChatGPT, the API, and Codex. The launch article positions GPT-5.4 as a professional-work model with 1M-token context, native computer use, stronger tool search, and better spreadsheet, document, and presentation performance.

#openai #gpt-5.4 #agents

AI X/Twitter Mar 9, 2026 1 min read

Perplexity adds Voice Mode to Perplexity Computer for spoken agent steering

Perplexity says users can now guide Perplexity Computer by voice, not just text. The update turns mid-task feedback and redirection into a spoken control loop for long-running agent work on the web.

#perplexity #voice-mode #computer-use

LLM Hacker News Mar 6, 2026 2 min read

OpenAI Releases GPT-5.4 Across ChatGPT, API, and Codex With Major Tool-Use Gains

OpenAI announced GPT-5.4 on March 5, 2026, adding a new general-purpose model and GPT-5.4 Pro with stronger computer use, tool search efficiency, and benchmark improvements over GPT-5.2.

#openai #gpt-5-4 #tool-use