AI 2h ago 2 min read
Anthropic is trying to make AI jailbreaks measurable, not just viral. Its July 2 framework separates minor bypasses from universal failures, adds a HackerOne path for Fable 5 reports, and says one new classifier blocks the Amazon-reported technique in over 99% of cases.