#agents

AI 2d ago 2 min read

GitHub MCP Server gets ahead of the July 28 stateless protocol shift

MCP moves to a stateless core on July 28, 2026, and GitHub MCP Server already supports the latest spec. The practical changes are fewer session dependencies, less payload inspection, and official conformance tests for agent infrastructure.

#github #mcp #agents

LLM X/Twitter 2d ago 1 min read

ChatGPT Voice now controls desktop Codex and multi-agent workflows

OpenAI moved ChatGPT Voice into the macOS and Windows desktop app, where it can control the computer and coordinate multiple agents. The tweet drew more than 2.6 million views, making voice a front door for Codex and ChatGPT Work rather than a chat-only feature.

#openai #chatgpt #voice

AI X/Twitter 3d ago 1 min read

OpenAI Presence puts governed agents into enterprise voice and chat work

Enterprise agents are moving from demos into controlled production workflows. OpenAI says Presence resolves 75% of inbound issues on its English phone-support channel without human help and cut human handoffs by 15 percentage points in 10 days.

#openai #agents #enterprise-ai

LLM Hacker News 4d ago 2 min read

Kimi K3 and Fable Put Model Routing Ahead of Single-Model Loyalty

Fireworks says routing between Kimi K3 and Fable 5 reached 93% accuracy across roughly 1,030 agentic tasks. The HN debate focused on a bigger claim: single-model deployments are becoming economically wasteful.

#kimi #fireworks #model-routing

LLM Hacker News 4d ago 2 min read

Gemini 3.6 Flash Makes Agent Cost the Headline

Google’s Gemini Flash update is less about another model name and more about the economics of long-running agent workflows: fewer output tokens, lower prices, and a cyber-specialized variant tied to CodeMender.

#google #gemini #agents

AI 4d ago 2 min read

OpenAI Presence puts a 75% resolution number on enterprise agents

Enterprise agents are moving from demos to operating metrics. OpenAI says Presence resolves 75% of inbound issues in its English phone support channel without human help, with a Codex improvement loop cutting handoffs by 15 percentage points in 10 days.

#openai #enterprise-ai #agents

LLM Reddit 6d ago 1 min read

Harness Training shifts agent improvement from the model to the workbench around it

A fresh r/MachineLearning project proposes training the harness around a frozen task LLM, instead of fine-tuning the model for every environment.

#agents #harness-training #pytorch

LLM X/Twitter Jul 15, 2026 1 min read

OpenAI agent tools usage jumps 2.5x in a single week

Sam Altman said usage of OpenAI's agentic products rose 2.5x in a week, a sharp adoption signal for Codex and ChatGPT Work. The number matters because agent workflows are moving from demos into recurring work.

#openai #codex #agents

AI News Jul 10, 2026 2 min read

AlphaEvolve leaves preview as Google sells algorithm search as a cloud tool

AlphaEvolve is now generally available on Google Cloud’s Gemini Enterprise Agent Platform. Google is positioning the Gemini-based agent around hard optimization work, with customer examples citing 5% demand-forecast gains, 10.4% warehouse-routing improvement, and 80% better planning models.

#google-cloud #gemini #agents

LLM Jul 9, 2026 2 min read

Meta puts Muse Spark 1.1 behind a 1M-token agent API

Meta’s July 9 release pairs a 1M-token agent model with the public preview of the Meta Model API. The interesting part is not just capability claims, but the safety report’s split between pre-mitigation cyber/bio risk and residual deployment risk.

#meta #muse-spark #agents

AI Jul 8, 2026 2 min read

NVIDIA Vera targets agent loops with 1.8x sustained per-core x86 performance

NVIDIA detailed Vera, a CPU designed for agentic AI workloads where tool calls, code execution, retrieval, and verification sit between model calls. The company claims 50% higher IPC than Grace and 1.8x sustained per-core performance versus x86 on agentic execution workloads.

#nvidia #vera #ai-infrastructure

LLM Hacker News Jul 6, 2026 1 min read

The Log is the Agent reframes agent runtime around event sourcing

The HN discussion treated the paper less as hype and more as a design question: where should an agent system keep truth?

#agents #event-sourcing #activegraph