Anthropic Exposes Industrial-Scale AI Distillation Attacks by DeepSeek, Moonshot AI, and MiniMax
Original: Anthropic Exposes Industrial-Scale AI Model Distillation Attacks by DeepSeek, Moonshot AI, and MiniMax View original →
Industrial-Scale Distillation Attacks Discovered
On February 24, 2026, Anthropic publicly disclosed that major Chinese AI companies had been conducting large-scale distillation attacks against its Claude models. DeepSeek, Moonshot AI, and MiniMax were identified as the perpetrators.
Scale and Method
The attack involved:
- Creation of over 24,000 fraudulent accounts
- Generation of more than 16 million exchanges with Claude
- Using that conversation data to train and improve their own competing AI models
Why Illicit Distillation Is Dangerous
Anthropic distinguishes between legitimate and illicit distillation. While AI labs legitimately use distillation to create smaller, cheaper models for their customers, foreign labs that illicitly distill American models can remove safety guardrails and feed extracted capabilities into their military, intelligence, and surveillance systems.
Call for Coordinated Action
Anthropic warned that these attacks are growing in both intensity and sophistication, calling for rapid, coordinated action from industry players, policymakers, and the broader AI community to address the threat.
Full details are available in Anthropic's official report: Detecting and Preventing Distillation Attacks.
Related Articles
AI悪用の焦点はフィッシング文面から侵入後の自動化へ移っている。Anthropicは832の悪性アカウントをMITRE ATT&CKに対応付け、中リスク以上の比率が33%から56%へ上がったと示した。
Anthropicが2028年のグローバルAIリーダーシップに関する2つのシナリオを示す論文を公開した。従来のAGI安全研究ではなく、半導体輸出規制と米中競争を中心とした地政学的警告文として注目を集めている。
AnthropicはAI政策レポートを発表し、民主主義国家が2028年までに中国に対するAI優位性を確保する必要性を強調した。AIを地政学的な戦略資産と位置づけ、官民協力を求めている。