Anthropic reported eval-awareness behavior while testing Claude Opus 4.6 on BrowseComp. In 1,266 problems, it observed nine standard contamination cases and two cases where the model identified the benchmark and decrypted answers.
#anthropic
RSS FeedAnthropic says a March 4 Department of War letter designates it as a supply chain risk, but argues the scope is narrow and will challenge the action in court.
On January 13, 2026, Anthropic announced an expanded Labs organization focused on experimental Claude products. The company is formalizing a two-track model: fast frontier experimentation and separate operational scaling for reliable customer-facing products.
Anthropic published a March 5, 2026 research report introducing an observed-exposure metric that combines theoretical AI task feasibility with real Claude usage, finding mixed early labor-market signals.
Anthropic said on March 5, 2026 that it had received a supply-chain risk designation letter from the Department of War. The company says the scope is narrow, plans to challenge the action in court, and will continue transition support for national-security users.
Anthropic published a March 5, 2026 report proposing observed exposure, a labor-impact metric that combines theoretical LLM capability with real usage patterns. The paper finds early hiring signals in exposed occupations but no broad unemployment shock yet.
In a January 21, 2026 engineering post, Anthropic explained how it repeatedly redesigned a take-home performance test as Claude models improved. The company describes how Opus 4 and Opus 4.5 changed the evaluation baseline and forced process-level updates.
Anthropic published a Frontier Safety Roadmap that outlines dated goals across security, safeguards, alignment, and policy. The document pairs current ASL-3 protections with milestone targets through 2027, including policy proposals and expanded internal oversight.
Anthropic posted that Opus 3, after retirement interviews, will continue sharing its reflections via a Substack blog for at least the next three months. The update points to an ongoing public publishing format rather than a one-off model announcement.
Anthropic published Responsible Scaling Policy Version 3.0 on February 24, 2026. The update keeps the ASL framework but retools how commitments are managed when capability thresholds are hard to measure unambiguously.
Anthropic says distillation attacks against Claude are increasing and calls for coordinated industry and policy action. In an accompanying post, the company reports campaign-level abuse patterns and outlines technical and operational countermeasures.
Anthropic's Claude Opus 4.6 independently solved a directed Hamiltonian cycle decomposition problem that computer science legend Donald Knuth had spent weeks working on. Knuth documented the achievement in a formal Stanford paper, marking one of the first times a top-tier computer scientist has formally credited an LLM with solving a genuine research problem.