AI coding slows down when review becomes the product
Original: Using AI to write better code more slowly View original →
Nolan Lawson’s post argues against the default framing of AI coding as a race to produce more code faster. The more interesting use case is slower: ask LLMs to inspect pull requests, compare findings across models, and let humans decide which issues are real. In that workflow, the gain is not raw output. It is earlier detection of defects, missing tests, accessibility problems, and architectural drift.
The suggested pattern is deliberately redundant. A team can run several reviewers over the same change, such as a Claude sub-agent, Codex, and Cursor Bugbot, then synthesize the findings into one human-reviewed report. The point is not to accept every model complaint. It is to use disagreement and overlap as a filter. If multiple systems find the same failure mode, it deserves attention. If one model invents a problem, the human review step should catch it.
The Hacker News discussion added practical texture. Some developers described a workflow where they first talk through architecture, turn that into a spec, implement against smaller tasks, and then ask separate model sessions to inspect error handling, concurrency, and edge cases. Others pushed back that agentic coding can erase the small design decisions programmers normally make while writing. That objection matters: a tool that reaches the goal too eagerly can still leave maintainers with low confidence in how the code fits the project.
The takeaway is to split “AI coding” into distinct jobs. Drafting code, reviewing code, designing tests, checking security, and challenging architecture are not the same task. Models may be mediocre at autonomous end-to-end implementation while still being useful at targeted critique. Slower loops can look inefficient in the moment, but they make the cost visible before defects reach production.
Source: Nolan Lawson’s post. Community context: Hacker News discussion.
Related Articles
HN did not treat this as abstract legal trivia. Once the Claude Code leak became the hook, the thread turned into a practical question for every team shipping AI-assisted software: if the model wrote the bulk of it, what is actually yours?
Alibaba's Qwen team has released Qwen3.7-Max, an agent-focused frontier LLM. It ranks 5th on Artificial Analysis's Intelligence Index, nearly matching GPT 5.4, and is available as both an API and open weights.
Forge is a new open-source Python framework that applies structured guardrails to self-hosted LLMs. The best config — Ministral-3 8B Q8 — jumps from a 53% baseline to 86.5% on the 26-scenario eval suite, with 99% achievable on agentic tasks.
Comments (0)
No comments yet. Be the first to comment!