#api

LLM Apr 24, 2026 2 min read

Sakana Fugu Opens Beta With 54.2 SWE-Pro and OpenAI-Style API

Sakana AI is trying to sell orchestration itself as a model product, not just a prompt hack around other APIs. In its beta table, fugu-ultra posts 54.2 on SWEPro and 95.1 on GPQAD while shipping behind an OpenAI-compatible API.

#sakana-ai #multi-agent #benchmarks

AI X/Twitter Apr 18, 2026 2 min read

Grok STT API targets voice apps with 25+ languages at $0.10/hour

Why it matters: xAI has turned the Grok Voice stack into standalone STT/TTS APIs with batch transcription at $0.10/hour and streaming at $0.20/hour. The post puts 25+ languages, diarization, and word-level timestamps in direct competition with enterprise transcription tools.

#xai #grok #speech-to-text

LLM X/Twitter Apr 12, 2026 2 min read

Anthropic launches Claude Managed Agents to move production agents onto hosted infrastructure

Claude said on April 8, 2026 that Managed Agents lets teams define tasks, tools, and guardrails while Anthropic runs the agent infrastructure. Anthropic's official materials describe a composable API suite for cloud-hosted, versioned agents, with advanced capabilities like outcomes, memory, and multi-agent orchestration in limited research preview.

#anthropic #claude #managed-agents

LLM Apr 12, 2026 2 min read

Meta Launches Muse Spark, the First Model From Meta Superintelligence Labs

Meta introduced Muse Spark on April 8, 2026 as the first model from Meta Superintelligence Labs. It already powers the Meta AI app and website and will expand to WhatsApp, Instagram, Facebook, Messenger, and AI glasses, with a private-preview API for partners.

#meta #muse-spark #llm

LLM X/Twitter Apr 11, 2026 2 min read

Claude turns the advisor pattern into a native tool on Claude Platform

Claude said on April 9, 2026 that the advisor strategy is now in beta on Claude Platform. The new tool lets Sonnet or Haiku call Opus for planning help inside a single Messages API request, which Anthropic says raised SWE-bench Multilingual by 2.7 points while cutting cost per task by 11.9% versus Sonnet alone.

#anthropic #claude #agents

AI Reddit Apr 6, 2026 2 min read

r/artificial Maps the Agent-Native Stack From Email and Phones to Wallets and Browsers

A Reddit discussion on r/artificial argues that the agent ecosystem is rapidly turning once-human capabilities like email, phone numbers, browsers, memory, payments, and SaaS access into composable APIs.

#ai-agents #infrastructure #automation

LLM Mar 19, 2026 1 min read

OpenAI brings GPT-5.4 mini to ChatGPT, Codex, and the API

OpenAI said on March 17, 2026 that GPT-5.4 mini is now available in ChatGPT, Codex, and the API. The company positioned it as a faster model for coding, computer use, multimodal understanding, and subagents.

#openai #gpt-5.4 #api

LLM Mar 19, 2026 2 min read

Google adds context circulation, tool combos, and Maps grounding to the Gemini API

Google on Mar 17, 2026 introduced new Gemini API features for agentic workflows, including combined built-in and custom tools, context circulation across tool calls, and Maps grounding for Gemini 3. The update is designed to reduce orchestration work for complex multi-step applications.

#google #gemini #api

LLM Hacker News Mar 18, 2026 2 min read

Hacker News Tracks GPT-5.4 Mini and Nano as OpenAI Pushes Small Models Into Codex and Agent Work

A March 17, 2026 Hacker News post about GPT-5.4 mini and nano reached 236 points and 143 comments. OpenAI is positioning mini as a fast coding and tool-use model for Codex, the API, and ChatGPT, while nano targets cheaper classification, extraction, and subagent workloads.

#openai #gpt-5.4 #small-models

LLM Mar 17, 2026 2 min read

Google adds project spend caps and faster tier upgrades for the Gemini API

Google introduced Project Spend Caps, revamped Usage Tiers, and new billing dashboards for Gemini API developers in AI Studio. The update is aimed at making cost control and scaling behavior more predictable for teams moving into paid usage.

#google #gemini #api

118

LLM Mar 16, 2026 2 min read

Perplexity launches Agent API as a managed runtime for search and tool-using workflows

Perplexity said on March 11, 2026 that its new Agent API combines search, tool execution, and multi-model orchestration behind one managed runtime. The launch positions Perplexity less as a single-answer interface and more as infrastructure for production agent workflows.

#perplexity #agents #api

118

LLM X/Twitter Mar 14, 2026 2 min read

OpenAI Rolls Out GPT-5.4 Across ChatGPT, the API, and Codex with 1M Context and Native Computer Use

OpenAI posted on March 5, 2026 that GPT-5.4 Thinking and GPT-5.4 Pro are rolling out across ChatGPT, the API, and Codex. The launch article positions GPT-5.4 as a professional-work model with 1M-token context, native computer use, stronger tool search, and better spreadsheet, document, and presentation performance.

#openai #gpt-5.4 #agents