Project Glasswing: How Anthropic's Mythos AI Chains Vulnerabilities into Working Exploits

What Is Project Glasswing?

Project Glasswing is Anthropic's controlled research program providing select organizations access to Mythos Preview — a security-specialized LLM distinct from general-purpose frontier models. Cloudflare participated and tested the model against their own infrastructure, publishing the full results on their blog.

What Mythos Can Do

Exploit chain construction: Mythos can take multiple low-severity vulnerability primitives and chain them into a single, more severe working exploit. This is senior security researcher reasoning, not automated scanning.

Proof generation: Rather than generating speculative findings, Mythos writes, compiles, and executes code to verify vulnerabilities. When initial hypotheses fail, it iterates independently.

Cloudflare's 8-Stage Architecture

Recon: Architecture mapping and initial task queue
Hunt: ~50 concurrent agents targeting specific attack classes
Validate: Independent adversarial review to filter false positives
Gapfill: Re-queues under-explored areas
Dedupe: Collapses duplicate findings
Trace: Cross-repo exploitability analysis
Feedback: Loops validated findings back into hunting
Report: Structured output

Limitations and Dual-Use Warning

Despite lacking standard guardrails, Mythos exhibited unpredictable refusals on legitimate security tasks — identical requests produced different outcomes across runs. Cloudflare is explicit: these capabilities will eventually reach attackers. Their recommendation shifts emphasis from fast patching to defensive architecture — separating security boundaries, implementing blocking infrastructure, and coordinating simultaneous global deployments.

Project Glasswing: How Anthropic's Mythos AI Chains Vulnerabilities into Working Exploits

What Is Project Glasswing?

What Mythos Can Do

Cloudflare's 8-Stage Architecture

Limitations and Dual-Use Warning

Related Articles

GitHub uses LLM context to cut secret-scanning false positives 75.76%

Anthropic and Mozilla Detail 22 Firefox Vulnerabilities Found by Claude

No AI lab clears C+: safety index puts weakened pledges on the scoreboard

Related Articles

GitHub uses LLM context to cut secret-scanning false positives 75.76%
AI X/Twitter Jun 21, 2026 1 min read

Anthropic and Mozilla Detail 22 Firefox Vulnerabilities Found by Claude
AI X/Twitter Mar 10, 2026 1 min read

No AI lab clears C+: safety index puts weakened pledges on the scoreboard