Mythos級セキュリティ解析は frontier model 専用か、Reddit が掘り下げた検証

何が起きたのか

r/LocalLLaMA で 587 upvotes と 120 comments を集めた投稿は、AISLE の AI Cybersecurity After Mythos: The Jagged Frontier を広く共有した。記事の主張は明確だ。Anthropic が Mythos と Project Glasswing の発表で強調した脆弱性解析の一部は、もっと小さくて安い model、あるいは open-weights model でもかなり再現できるという。そこから AISLE は、AI cybersecurity の moat は frontier model 単体よりも、system と scaffold、そしてそこに埋め込まれた security expertise にあるのではないかと論じている。

記事が示す evidence は具体的だ。AISLE は、関連 function を切り出した状態では FreeBSD NFS の代表的 bug を 8/8 の model が見つけたと報告する。さらに OpenBSD SACK bug のような難しいケースでも、より小さな model が核心部分の reasoning をかなり回収したとしている。加えて OWASP の false-positive 判別では、いくつかの小型 model が高価な frontier model を上回ったとされ、これが capability frontier は滑らかではなく "jagged" だという結論につながっている。

なぜ Reddit が食いついたのか

Reddit の重要な反応は実験の否定ではなく、前提条件への突っ込みだった。最も共感を集めたコメントの一つは「本当に難しいのは、その vulnerable code を見つけることだ」というものだ。ここが解釈の分岐点である。isolated な function を与えられて reasoning するのと、大きな repository を探索して attack surface を見つけ、正しい code path を絞り、exploitability を検証して patch や exploit chain に落とし込むのは別の仕事だ。さらに別の上位コメントでは、比較対象に使った model 世代の選び方が偏っているのではないかという批判もあった。

それでも thread が重要なのは、議論の軸を「どの lab の model が一番賢いか」からずらしたことだ。security work は modular であり、broad scanning、vulnerability detection、false-positive triage、patch generation、exploit construction は同じようには scaling しない。もし pipeline の一部で小型 model がすでに十分なら、競争力は orchestration、cost structure、tooling、evaluation design に移っていく。

Insights 読者にとっての要点もそこにある。この thread は frontier model が不要だと証明したわけではないし、小型 model が end-to-end autonomous security system を置き換えられるとも示していない。ただし Mythos の launch narrative よりも、AI security product の economics と architecture が開かれている可能性を示している。原文: r/LocalLLaMA, AISLE blog。

Mythos級セキュリティ解析は frontier model 専用か、Reddit が掘り下げた検証

何が起きたのか

なぜ Reddit が食いついたのか

Related Articles

小型 Open model でも Mythos 級の脆弱性分析は再現できる

Hacker Newsで続いたMythos後の論争: 小さなopen-weight modelでもAI security分析の一部を再現できるのか

Anthropic、Project Glasswingで Mythos Preview を defensive cybersecurity に投入

Related Articles

小型 Open model でも Mythos 級の脆弱性分析は再現できる
AI Hacker News Apr 12, 2026 1 min read

Hacker Newsで続いたMythos後の論争: 小さなopen-weight modelでもAI security分析の一部を再現できるのか
AI Hacker News Apr 13, 2026 1 min read

Anthropic、Project Glasswingで Mythos Preview を defensive cybersecurity に投入
AI Apr 13, 2026 1 min read