Anthropic’s vuln harness is more workshop jig than boxed scanner
Original: Anthropic's open-source framework for AI-powered vulnerability discovery View original →
Anthropic’s Defending Code Reference Harness is not presented as a maintained product. The README says the repository is not maintained and is not accepting contributions. That makes the release more useful as a worked example: how should a team wrap an LLM when the goal is vulnerability discovery rather than ordinary code review?
The reference pipeline follows a recon → find → verify → report → patch loop. Its default target is C/C++ memory vulnerability work with Docker and ASAN, while autonomous runs are expected to happen inside a gVisor sandbox. The repo also includes Claude Code skills for threat modeling, scanning, triage, customization, and patch drafting, separating interactive human-reviewed work from runs that execute target code.
The HN discussion quickly moved to practical questions. One prominent comment compared tools like this to shop jigs: valuable for ideas, but often rebuilt around a team’s own interface, target definitions, and alerting style. Another thread focused on operating cost, pointing to the README’s rough token-rate guidance and asking what parallel agents would cost with stronger models.
That is the real takeaway. A vulnerability agent is only partly about the model. The surrounding harness decides what counts as a finding, how crashes are reproduced, how duplicates are removed, which sandbox boundaries are enforced, and when a patch is considered verified. Anthropic also points readers toward its hosted Claude Security product, but the open repo is most interesting as a blueprint for teams that want to build their own pipeline and make those tradeoffs explicit.
Related Articles
Anthropic’s May 29 platform notes move Claude Managed Agents deeper into AWS operations. Webhooks, multiagent orchestration, and self-hosted sandboxes are now available on Claude Platform on AWS, with new IAM actions and a managed policy for self-hosted execution.
Claude Opus 4.8 is showing its strongest early signal in agentic work, not only coding. Artificial Analysis says the model scored 1890 on GDPval-AA, 121 points ahead of GPT-5.5 xhigh.
Anthropic said on February 25, 2026 that it acquired Vercept to strengthen Claude’s computer use capabilities. The company tied the deal to Sonnet 4.6’s rise to 72.5% on OSWorld and its broader push toward agent systems that can act inside live applications.