In a 2026-02-25 X thread, Anthropic said Claude Opus 3 is now part of both deprecation and preservation actions. The company says Opus 3 remains available to paid Claude users and can be requested for API use.
LLM
RSS FeedOpenAIDevs posted on 2026-02-24 that GPT-5.3-Codex is now available for all developers in the Responses API. The announcement moves API access from a staged rollout to general developer availability.
A high-ranking Hacker News thread amplified a Truffle Security report arguing that legacy Google API keys can become high-impact credentials when Gemini APIs are enabled. The post highlights exposure scale claims and concrete key-hardening steps.
Google is introducing Gemini task automation in early preview on select Pixel 10 and Galaxy S26 devices. The assistant can prepare multi-step app actions like ride-hailing and food orders while users keep final submission control.
A high-traffic LocalLLaMA thread tracked the release of Qwen3.5-122B-A10B on Hugging Face and quickly shifted into deployment questions. Community discussion centered on GGUF timing, quantization choices, and real-world throughput, while the model card highlighted a 122B total/10B active MoE design and long-context serving guidance.
A high-engagement r/LocalLLaMA thread reports strong early results for Qwen3.5-35B-A3B in local agentic coding workflows. The original poster cites 100+ tokens/sec on a single RTX 3090 setup, while comments show mixed reproducibility and emphasize tooling, quantization, and prompt pipeline differences.
Anthropic’s new Claude Code Remote Control feature lets users continue local coding sessions from web and mobile clients. Hacker News users praised the local-first model and security posture, while early testers also reported stability and UX issues in this preview stage.
A Reddit post in r/singularity links METR’s new productivity update, revisiting the widely cited 2025 result that AI slowed experienced open-source developers. The new signal points toward possible speedup, but METR stresses major selection-bias limitations.
On February 23, 2026, Anthropic said it detected large-scale distillation abuse tied to roughly 24,000 fraudulent accounts and more than 16 million Claude exchanges. The company framed the issue as both a model security and policy challenge.
GitHub announced public preview availability of Copilot’s cross-agent memory for Copilot coding agent, Copilot CLI, and Copilot code review. The system is repository-scoped, citation-verified, opt-in, and accompanied by reported improvements in evaluation and A/B test metrics.
A high-engagement r/LocalLLaMA post surfaced the Qwen3.5-35B-A3B model card on Hugging Face. The card emphasizes MoE efficiency, long context handling, and deployment paths across common open-source inference stacks.
Inception Labs introduced Mercury 2 and claims a diffusion-based architecture can deliver reasoning quality at much lower latency. The launch emphasizes parallel token refinement, OpenAI-compatible APIs, and enterprise-ready throughput targets.