A March 17, 2026 Hacker News post about Get Shit Done reached 404 points and 223 comments. The project presents itself as a lightweight context-engineering and spec-driven workflow for Claude Code, Codex, Gemini CLI, Copilot, and other coding-agent runtimes.
#coding-agents
RSS FeedA March 17, 2026 Hacker News post about GPT-5.4 mini and nano reached 236 points and 143 comments. OpenAI is positioning mini as a fast coding and tool-use model for Codex, the API, and ChatGPT, while nano targets cheaper classification, extraction, and subagent workloads.
A March 16, 2026 Hacker News thread pushed Mistral's Leanstral launch to 277 points and 49 comments, focusing attention on an Apache 2.0 Lean 4 model built for proof engineering rather than generic code generation.
Hacker News Pushes “Agentic Engineering” Forward as Simon Willison Defines the Coding-Agent Workflow
An HN discussion on March 16, 2026 lifted Simon Willison’s new guide chapter on “agentic engineering,” framing coding agents as systems that write and execute code in a loop while humans stay responsible for tools, scope, and verification.
A LocalLLaMA release post presents OmniCoder-9B as a Qwen3.5-9B-based coding agent fine-tuned on 425,000-plus agentic trajectories, with commenters focusing on its read-before-write behavior and usefulness at small model size.
A fresh Hacker News thread pushed Chrome DevTools MCP back into view because it lets coding agents attach to an already running Chrome session instead of starting from a clean browser. Google’s December 11, 2025 update combines Chrome 144 remote-debugging controls with an --autoConnect flow in the MCP server, making it easier to hand an in-progress DevTools investigation to an agent.
A Hacker News thread amplified a March 12 analysis arguing that LLM coding progress looks much weaker when measured by maintainer merge decisions rather than test-passing SWE-bench scores.
METR's March 10, 2026 note argues that about half of test-passing SWE-bench Verified PRs from recent agents would still be rejected by maintainers. HN treated it as a warning that benchmark wins do not yet measure scope control, code quality, or repo fit.
Hacker News elevated Bassim Eledath’s eight-level framework, responding to an article that explains coding-agent performance gaps through workflow maturity instead of model benchmarks.
A high-traction Hacker News thread highlighted Simon Willison’s "Agentic Engineering Patterns" guide, which organizes practical workflows for coding agents. The focus is operational discipline: testing-first loops, readable change flow, and reusable prompts.
A LocalLLaMA post reports that a simple “verify after every edit” loop raised Qwen3.5-35B-A3B from 22.2% to 37.8% on SWE-bench Verified Hard, approaching a cited 40% reference for Claude Opus 4.6.
Software developer Manuel Schipper shares a practical workflow for running 4-8 parallel AI coding agents simultaneously using tmux, Markdown Feature Design files, and slash commands — no orchestrators required.