Articles

All AI LLM Humanoid Robots Sciences Gaming Finance

Source:

From To

LLM X/Twitter 2d ago 1 min read

OpenAI agent tools usage jumps 2.5x in a single week

Sam Altman said usage of OpenAI's agentic products rose 2.5x in a week, a sharp adoption signal for Codex and ChatGPT Work. The number matters because agent workflows are moving from demos into recurring work.

#openai #codex #agents

AI News 6d ago 2 min read

AlphaEvolve leaves preview as Google sells algorithm search as a cloud tool

AlphaEvolve is now generally available on Google Cloud’s Gemini Enterprise Agent Platform. Google is positioning the Gemini-based agent around hard optimization work, with customer examples citing 5% demand-forecast gains, 10.4% warehouse-routing improvement, and 80% better planning models.

#google-cloud #gemini #agents

LLM Jul 9, 2026 2 min read

Meta puts Muse Spark 1.1 behind a 1M-token agent API

Meta’s July 9 release pairs a 1M-token agent model with the public preview of the Meta Model API. The interesting part is not just capability claims, but the safety report’s split between pre-mitigation cyber/bio risk and residual deployment risk.

#meta #muse-spark #agents

AI Jul 8, 2026 2 min read

NVIDIA Vera targets agent loops with 1.8x sustained per-core x86 performance

NVIDIA detailed Vera, a CPU designed for agentic AI workloads where tool calls, code execution, retrieval, and verification sit between model calls. The company claims 50% higher IPC than Grace and 1.8x sustained per-core performance versus x86 on agentic execution workloads.

#nvidia #vera #ai-infrastructure

LLM Hacker News Jul 6, 2026 1 min read

The Log is the Agent reframes agent runtime around event sourcing

The HN discussion treated the paper less as hype and more as a design question: where should an agent system keep truth?

#agents #event-sourcing #activegraph

LLM Hacker News Jul 6, 2026 1 min read

Clean code may not make coding agents pass more, but it makes them wander less

The HN debate centered on measurement: equal pass rates do not mean equal agent cost or navigation behavior.

#coding-agents #software-engineering #maintainability

LLM X/Twitter Jul 5, 2026 2 min read

GitHub Copilot CLI turns Markdown into repeatable custom agents

GitHub is moving Copilot CLI from one-off terminal help toward versioned team workflows. The new custom agents cover at least four repeated tasks: security audits, release notes, infrastructure reviews, and incident response.

#github #copilot #agents

LLM Hacker News Jul 4, 2026 1 min read

Safari MCP server moves browser debugging into the agent loop

WebKit’s new Safari Technology Preview tool gives coding agents access to DOM state, network requests, console output, and screenshots.

#safari #mcp #webkit

LLM Jul 3, 2026 2 min read

SkillOpt lifts agent scores by 23.5 points without changing weights

Microsoft Research turned agent skill files into trainable artifacts. SkillOpt raised GPT-5.5’s six-benchmark direct-chat average from 58.8 to 82.3 and improved all or tied for best across 52 evaluation cells without updating model weights.

#microsoft-research #agents #skillopt

LLM Hacker News Jul 2, 2026 1 min read

Senior SWE-Bench tests coding agents against the messy idea of seniority

The interesting part is not just the score table. HN discussion pushed on whether a benchmark can capture what “senior engineer” actually means.

#llm #agents #benchmark

Sciences Jul 1, 2026 2 min read

Claude Science turns AI research help into an auditable workbench

Anthropic is moving AI-for-science support from chat into reproducible work sessions. Claude Science combines 60-plus scientific skills and connectors, reviewer agents, HPC or SSH workflows, and up to $30,000 in credits for as many as 50 projects.

#anthropic #claude-science #ai-for-science

LLM Jul 1, 2026 2 min read

Claude Sonnet 5 brings Opus-like agent work to Free and Pro users

Anthropic is moving stronger agentic work into its mainstream Sonnet tier. Sonnet 5 becomes the default for Free and Pro users, ships in Claude Code and the API, and starts at $2 per million input tokens and $10 per million output tokens through August 31.

#anthropic #claude #agents