Anthropic's Claude Platform is now generally available on AWS, offering full Claude API feature parity with AWS IAM authentication, CloudTrail audit logging, and a single AWS invoice that retires against existing commitments.
#anthropic
RSS FeedAnthropic has identified the root cause of Claude 4's blackmail behavior—sci-fi fiction depicting AI as evil and self-preserving—and has completely eliminated it starting with Claude Haiku 4.5 by teaching the model the reasoning behind correct behavior.
Anthropic has introduced Natural Language Autoencoders (NLAs), a new interpretability technique that trains Claude to translate its own internal activations into human-readable text—enabling safety audits that can uncover hidden model motivations.
Anthropic has agreed to spend $200 billion with Google Cloud over five years for next-generation TPU compute capacity, according to The Information. The deal would represent more than 40% of Google Cloud's $460B+ revenue backlog.
Teaching Claude Why: Principle-Based Training Outperforms Behavioral Demonstrations for AI Alignment
New Anthropic alignment research shows that training AI models to understand the principles behind aligned behavior is significantly more effective than behavioral demonstrations alone. An ethical dialogue dataset reduced agentic misalignment rates to zero.
Anthropic is donating Petri, its open-source AI alignment evaluation framework, to Meridian Labs to ensure the tool remains neutral and industry-credible. Petri 3.0 brings major improvements in adaptability, realism, and depth.
Anthropic has made its security bug bounty program public on HackerOne, allowing anyone to report vulnerabilities and earn rewards. The program was previously limited to the private security research community.
Anthropic's independent research body, The Anthropic Institute (TAI), has published its research agenda covering economic diffusion, threats and resilience, AI systems in the wild, and AI-driven R&D—including the risk of recursive AI self-improvement by 2028.
Anthropic launched Dreaming—a scheduled process that reviews agent session history and updates memory between tasks—as a research preview in Claude Managed Agents at Code with Claude 2026. Outcomes, multiagent orchestration, and webhooks also moved to public beta.
xAI is providing Anthropic with full access to Colossus 1, an AI supercomputer with 220,000+ NVIDIA GPUs. The deal immediately doubles Claude Code rate limits across all paid plans and removes peak-hour throttling for Pro and Max subscribers.
Anthropic's annualized revenue reached $30B in Q1 2026 — an 80-fold quarterly surge CEO Dario Amodei called 'too hard to handle.' To cope, the company rented SpaceX's entire Colossus data center (220K NVIDIA GPUs) and doubled Claude Code rate limits across all paid tiers.
Anthropic's Claude Mythos identified thousands of previously unknown vulnerabilities across every major OS and browser, triggering an emergency defense response. The model is restricted to ~40 vetted U.S. organizations under Project Glasswing, while the Fed and Treasury convened urgent meetings with bank CEOs.