Anthropic published Responsible Scaling Policy Version 3.0 on February 24, 2026. The update keeps the ASL framework but retools how commitments are managed when capability thresholds are hard to measure unambiguously.
#ai-safety
RSS FeedOpenAI’s February 2026 safety report says it banned accounts linked to seven operations originating in China. The company says abuse covered cyber activity, covert influence, and scams, while overall malicious use remained low versus legitimate use.
Sam Altman announced OpenAI reached an agreement with the U.S. Department of War to deploy AI models on classified networks, with core safety principles including bans on domestic mass surveillance and autonomous weapon systems.
OpenAI CEO Sam Altman announced a Pentagon deal to deploy AI models in classified networks just hours after Anthropic was blacklisted by the Trump administration. The agreement explicitly includes prohibitions on mass domestic surveillance and autonomous weapons.
OpenAI CEO Sam Altman announced a Pentagon deal to deploy AI models in classified networks just hours after Anthropic was blacklisted by the Trump administration. The agreement explicitly includes prohibitions on mass domestic surveillance and autonomous weapons.
Anthropic announced Responsible Scaling Policy v3 on February 24, 2026 and paired it with a Frontier Safety Roadmap. The company says it will update the policy every 3-6 months and publish model-specific Risk Reports to improve verifiability.
OpenAI said on February 28, 2026 that it reached an agreement with the U.S. Department of War to deploy advanced AI systems in classified environments. In a follow-up post, the company said the arrangement uses a multi-layer safety approach and cloud-based deployment with cleared personnel in the loop.
Anthropic released Responsible Scaling Policy 3.0, adding a structured Frontier Safety and Security Framework and new roadmap and reporting mechanisms. The update emphasizes explicit commitments to pause or withhold deployment if risk thresholds are exceeded.
In a February 26 statement, Anthropic said it will keep supporting U.S. defense and intelligence deployments but refuses two uses: mass domestic surveillance and fully autonomous weapons.
In a 2026-02-25 X thread, Anthropic said Claude Opus 3 is now part of both deprecation and preservation actions. The company says Opus 3 remains available to paid Claude users and can be requested for API use.
Anthropic announced Responsible Scaling Policy (RSP) 3.0 on February 24, 2026. The update keeps the original threshold-based safety logic but adds clearer unilateral commitments, a Frontier Safety Roadmap, and structured Risk Reports to improve transparency and accountability.
Anthropic released Responsible Scaling Policy v3.0 on February 24, 2026. The update formalizes ASL-3 warning thresholds and expands operational governance for high-consequence misuse risks.