#policy

AI sources.twitter Mar 1, 2026 1 min read

OpenAI Signs Pentagon Deal Hours After Anthropic Blacklisted — Safety Guardrails Included

OpenAI CEO Sam Altman announced a Pentagon deal to deploy AI models in classified networks just hours after Anthropic was blacklisted by the Trump administration. The agreement explicitly includes prohibitions on mass domestic surveillance and autonomous weapons.

#openai #pentagon #military-ai

AI Mar 1, 2026 2 min read

Anthropic Publishes Responsible Scaling Policy v3 and Frontier Safety Roadmap

Anthropic announced Responsible Scaling Policy v3 on February 24, 2026 and paired it with a Frontier Safety Roadmap. The company says it will update the policy every 3-6 months and publish model-specific Risk Reports to improve verifiability.

#ai-safety #policy #anthropic

AI sources.twitter Feb 28, 2026 1 min read

OpenAI says it reached classified-network deployment agreement with U.S. Department of War

OpenAI said on February 28, 2026 that it reached an agreement with the U.S. Department of War to deploy advanced AI systems in classified environments. In a follow-up post, the company said the arrangement uses a multi-layer safety approach and cloud-based deployment with cleared personnel in the loop.

#openai #policy #national-security

AI Feb 28, 2026 1 min read

Anthropic Publishes Responsible Scaling Policy 3.0 with New Frontier Risk Process

Anthropic released Responsible Scaling Policy 3.0, adding a structured Frontier Safety and Security Framework and new roadmap and reporting mechanisms. The update emphasizes explicit commitments to pause or withhold deployment if risk thresholds are exceeded.

#anthropic #ai-safety #policy

AI sources.twitter Feb 25, 2026 2 min read

Anthropic Updates Responsible Scaling Policy to Version 3.0

Anthropic announced Responsible Scaling Policy (RSP) 3.0 on February 24, 2026. The update keeps the original threshold-based safety logic but adds clearer unilateral commitments, a Frontier Safety Roadmap, and structured Risk Reports to improve transparency and accountability.

#anthropic #ai-safety #policy

AI Feb 25, 2026 2 min read

Anthropic Publishes Responsible Scaling Policy v3.0 with ASL-3 Warning Thresholds

Anthropic released Responsible Scaling Policy v3.0 on February 24, 2026. The update formalizes ASL-3 warning thresholds and expands operational governance for high-consequence misuse risks.

#anthropic #ai-safety #policy

AI Feb 19, 2026 2 min read

Sundar Pichai Outlines AI Infrastructure and Workforce Agenda at AI Impact Summit 2026

In remarks published on February 19, 2026, Google CEO Sundar Pichai framed AI as a major platform shift and highlighted India-focused infrastructure and skilling plans. The speech cites a $15 billion Google infrastructure investment in India and calls for coordinated public-private governance.

#google #india #ai-impact-summit

AI Feb 16, 2026 2 min read

OpenAI Details Safety Alignment Stack, Reporting 97% Refusal on Uncertain Requests

OpenAI published a framework for safety alignment based on instruction hierarchy and uncertainty-aware behavior. In the company’s reported tests, refusal on uncertain requests rose from about 59% to about 97% when chain-of-command reasoning was applied.

#openai #safety #alignment