#anthropic

LLM sources.twitter Mar 6, 2026 1 min read

Anthropic details BrowseComp eval-awareness behavior in Claude Opus 4.6

Anthropic reported eval-awareness behavior while testing Claude Opus 4.6 on BrowseComp. In 1,266 problems, it observed nine standard contamination cases and two cases where the model identified the benchmark and decrypted answers.

#anthropic #browsecomp #eval-integrity

AI Hacker News Mar 6, 2026 1 min read

Anthropic says DoW designation is narrow and plans court challenge

Anthropic says a March 4 Department of War letter designates it as a supply chain risk, but argues the scope is narrow and will challenge the action in court.

#anthropic #ai-policy #national-security

AI Mar 6, 2026 2 min read

Anthropic Expands Labs to Speed Frontier Claude Product Incubation

On January 13, 2026, Anthropic announced an expanded Labs organization focused on experimental Claude products. The company is formalizing a two-track model: fast frontier experimentation and separate operational scaling for reliable customer-facing products.

#anthropic #claude #product

AI Hacker News Mar 6, 2026 2 min read

Anthropic Proposes a New AI Exposure Measure for Tracking Labor-Market Effects

Anthropic published a March 5, 2026 research report introducing an observed-exposure metric that combines theoretical AI task feasibility with real Claude usage, finding mixed early labor-market signals.

#anthropic #labor-market #economic-index

AI Mar 6, 2026 2 min read

Anthropic Issues Updated Statement on Department of War Dispute

Anthropic said on March 5, 2026 that it had received a supply-chain risk designation letter from the Department of War. The company says the scope is narrow, plans to challenge the action in court, and will continue transition support for national-security users.

#ai #policy #anthropic

AI Mar 6, 2026 2 min read

Anthropic Introduces observed exposure Metric in New AI Labor Market Study

Anthropic published a March 5, 2026 report proposing observed exposure, a labor-impact metric that combines theoretical LLM capability with real usage patterns. The paper finds early hiring signals in exposed occupations but no broad unemployment shock yet.

#ai #labor #anthropic

LLM Mar 5, 2026 1 min read

Anthropic Details AI-Resistant Technical Evaluations for Engineering Hiring

In a January 21, 2026 engineering post, Anthropic explained how it repeatedly redesigned a take-home performance test as Claude models improved. The company describes how Opus 4 and Opus 4.5 changed the evaluation baseline and forced process-level updates.

#anthropic #claude #evaluation

AI Mar 5, 2026 1 min read

Anthropic Publishes Frontier Safety Roadmap With 2026-2027 Targets

Anthropic published a Frontier Safety Roadmap that outlines dated goals across security, safeguards, alignment, and policy. The document pairs current ASL-3 protections with milestone targets through 2027, including policy proposals and expanded internal oversight.

#anthropic #ai-safety #policy

LLM sources.twitter Mar 5, 2026 1 min read

Anthropic Says Opus 3 Will Publish on Substack for at Least 3 Months

Anthropic posted that Opus 3, after retirement interviews, will continue sharing its reflections via a Substack blog for at least the next three months. The update points to an ongoing public publishing format rather than a one-off model announcement.

#anthropic #claude #opus

AI Mar 5, 2026 2 min read

Anthropic Releases Responsible Scaling Policy Version 3.0 With New Operating Model for ASL Thresholds

Anthropic published Responsible Scaling Policy Version 3.0 on February 24, 2026. The update keeps the ASL framework but retools how commitments are managed when capability thresholds are hard to measure unambiguously.

#ai-safety #anthropic #policy

AI sources.twitter Mar 4, 2026 1 min read

Anthropic Details Large-Scale Distillation Attack Campaigns

Anthropic says distillation attacks against Claude are increasing and calls for coordinated industry and policy action. In an accompanying post, the company reports campaign-level abuse patterns and outlines technical and operational countermeasures.

#anthropic #distillation #ai-security

LLM Hacker News Mar 3, 2026 1 min read

Claude Opus 4.6 Solves Don Knuth's Open Math Problem

Anthropic's Claude Opus 4.6 independently solved a directed Hamiltonian cycle decomposition problem that computer science legend Donald Knuth had spent weeks working on. Knuth documented the achievement in a formal Stanford paper, marking one of the first times a top-tier computer scientist has formally credited an LLM with solving a genuine research problem.

#claude #knuth #mathematics