#multi-agent

LLM 2d ago 2 min read

Sakana Fugu Opens Beta With 54.2 SWE-Pro and OpenAI-Style API

Sakana AI is trying to sell orchestration itself as a model product, not just a prompt hack around other APIs. In its beta table, fugu-ultra posts 54.2 on SWEPro and 95.1 on GPQAD while shipping behind an OpenAI-compatible API.

#sakana-ai #multi-agent #benchmarks

AI Reddit Apr 8, 2026 2 min read

r/artificial treats the Claude Code leak as a field manual for production AI agents

A recent r/artificial post argues that the Claude Code leak mattered less as drama than as a rare look at the engineering layer around a production AI coding agent. The real takeaway was not model internals but the exposed patterns for memory, permissions, tool orchestration, and multi-agent coordination.

#anthropic #claude-code #ai-agents

LLM sources.twitter Apr 6, 2026 2 min read

GitHub highlights Squad, a repository-native path to multi-agent coding workflows

GitHub’s April 5 X post pointed developers to Squad, an open-source project built on GitHub Copilot that initializes a preconfigured AI team inside a repository. GitHub says the model works by routing work through a thin coordinator, storing shared decisions in versioned repo files, and letting specialist agents operate in parallel with separate context windows.

#github #multi-agent #copilot

AI Hacker News Apr 3, 2026 2 min read

Hacker News Pushes Cursor 3 as a Unified Workspace for Coding Agents

Cursor 3 reframes AI coding as multi-agent orchestration, combining local and cloud agents, multi-repo context, and PR-oriented workflows in a single interface.

#cursor #coding-agents #developer-tools

LLM sources.twitter Mar 27, 2026 2 min read

Together Research says divide-and-conquer long-context pipelines can beat GPT-4o single-shot

Together Research said on March 27, 2026 that a smaller model using divide-and-conquer can match or outperform GPT-4o on long-context tasks, with the work accepted at ICLR 2026. Together's blog and the arXiv paper say the method uses a planner-worker-manager pipeline and explains long-context failures in terms of task, model, and aggregator noise.

#together-ai #long-context #multi-agent

LLM sources.twitter Mar 25, 2026 2 min read

Anthropic details a multi-agent harness for frontend design and long-running software engineering

Anthropic said on March 24, 2026 that a new Engineering Blog post explains how it used a multi-agent harness to improve Claude on frontend design and long-running autonomous software engineering. The write-up separates planning, generation, and evaluation, and reports clear gains over simpler solo-agent runs.

#anthropic #claude #multi-agent

LLM Mar 21, 2026 2 min read

GitHub expands repository-native multi-agent development with Squad

On 2026-03-19, GitHub outlined Squad, an open-source GitHub Copilot project that initializes a preconfigured AI team inside a repository. The design matters because it packages routing, shared memory, and review separation into a repo-native workflow instead of relying on a separate orchestration stack.

#github #copilot #agents

LLM sources.twitter Mar 10, 2026 2 min read

GitHub says reliable multi-agent systems need schemas, actions, and MCP

GitHub used X on March 9, 2026 to resurface its guide to building reliable multi-agent systems. The company argues that most failures come from missing structure, and recommends typed schemas, action schemas, and Model Context Protocol as the core engineering controls.

#github #agentic-ai #mcp

AI Reddit Mar 3, 2026 1 min read

Scientists Made AI Agents Ruder — And They Performed Better at Complex Reasoning Tasks

A counterintuitive study found that programming AI agents with more assertive, 'rude' conversational behaviors — including interrupting and strategic silence — significantly improved their performance on complex reasoning tasks.

#ai-agents #reasoning #research

AI sources.twitter Feb 22, 2026 1 min read

xAI Launches Grok 4.2 Public Beta with Multi-Agent System and Rapid Weekly Learning

Elon Musk has released the Grok 4.2 public beta, featuring four specialized AI agents (Grok, Harper, Benjamin, Lucas) working in parallel. The rapid learning architecture improves the model weekly and reduces hallucinations by 65%.

#grok #xai #multi-agent

AI sources.twitter Feb 22, 2026 1 min read

OpenClaw Founder Peter Steinberger Joins OpenAI to Lead Next-Gen Personal Agents

OpenAI CEO Sam Altman announced that Peter Steinberger, creator of viral AI personal agent app OpenClaw, is joining the company to drive the next generation of personal agents across a multi-agent future.

#openai #personal-agents #multi-agent