Amazon and OpenAI announced on February 27, 2026 a multi-year strategic partnership built around a Stateful Runtime Environment on Amazon Bedrock, Frontier distribution on AWS, and long-term Trainium capacity. Amazon also said it will invest $50 billion in OpenAI.
LLM
RSS FeedAnthropic said on February 25, 2026 that it acquired Vercept to strengthen Claude’s computer use capabilities. The company tied the deal to Sonnet 4.6’s rise to 72.5% on OSWorld and its broader push toward agent systems that can act inside live applications.
Anthropic launched the Claude Partner Network on March 12, 2026 with an initial $100 million commitment. The program is designed to help service partners move enterprise Claude deployments from pilot projects into production.
A llama.cpp comparison on r/LocalLLaMA reached 55 upvotes and 81 comments. By testing RTX 5090, DGX Spark, AMD AI395, and single or dual R9700 setups under the same parameters, the post offers a practical view of local inference trade-offs that vendor slides usually hide.
A LocalLLaMA thread about Intel’s Arc Pro B70 and B65 reached 213 upvotes and 133 comments. Intel says the B70 is available from March 25, 2026 with a suggested starting price of $949, while the B65 follows in mid-April.
Google Research introduced TurboQuant on March 24, 2026 as a compression approach for KV cache and vector search bottlenecks. Hacker News pushed the post to 491 points and 129 comments, reflecting how central memory efficiency has become for long-context inference.
AWS and Cerebras said on March 13, 2026 that they are building a high-speed inference offering for Amazon Bedrock. The design splits prefill work to AWS Trainium and decode work to Cerebras CS-3 systems.
NVIDIA said on March 25, 2026 that Nemotron Nano 12B v2 VL delivers on-prem video understanding and, in NVIDIA's telling, performs near 30B-class alternatives on the MediaPerf benchmark at less than half the footprint. NVIDIA's model card describes it as a commercially usable multimodal model for multi-image reasoning, video understanding, visual Q&A, and summarization.
Hacker News pushed Ente's Ensu announcement because it treats local LLM software as a privacy and ownership product: offline chat across major platforms, open source core logic, and planned encrypted sync.
Microsoft Research has open-sourced AgentRx, a framework for pinpointing the first critical failure in long AI-agent trajectories. It ships with a 115-trajectory benchmark and reports gains in both failure localization and root-cause attribution.
Anthropic said on March 24, 2026 that a new Engineering Blog post explains how it used a multi-agent harness to improve Claude on frontend design and long-running autonomous software engineering. The write-up separates planning, generation, and evaluation, and reports clear gains over simpler solo-agent runs.
r/artificial focused on ATLAS because it shows how planning, verification, and repair infrastructure can push a frozen 14B local model far closer to frontier coding performance.