HashiCorp founder Mitchell Hashimoto's warning about companies in 'AI psychosis' hit 1,650 points on Hacker News. His thesis: AI's ability to fix bugs quickly is masking accumulating systemic risk that no one can see in local metrics.
#software-engineering
RSS FeedSimon Willison reflects on how the once-clear line between careless vibe coding and responsible agentic engineering has begun to blur in his own work, raising sharp questions about trust and accountability in AI-assisted software development.
Andrej Karpathy shared highlights from his Sequoia Ascent 2026 fireside chat, arguing that LLMs open genuinely new categories of functionality, not just faster versions of what already existed.
HN latched onto a post that says the real bottleneck in multi-agent coding is coordination, not just model IQ. Once work is split across agents, the old distributed-systems vocabulary starts showing up whether the models are brilliant or not.
Cursor 3 reframes AI coding as multi-agent orchestration, combining local and cloud agents, multi-repo context, and PR-oriented workflows in a single interface.
A March 25, 2026 Hacker News post about Reco's `gnata` rewrite reached 256 points and 237 comments at crawl time. Reco says AI-assisted porting of JSONata 2.x to Go took about 7 hours and $400 in tokens, then removed an RPC-heavy Node fleet and eventually cut roughly $500,000 per year in infrastructure cost.
Anthropic said on March 24, 2026 that a new Engineering Blog post explains how it used a multi-agent harness to improve Claude on frontend design and long-running autonomous software engineering. The write-up separates planning, generation, and evaluation, and reports clear gains over simpler solo-agent runs.
METR's March 10, 2026 note argues that about half of test-passing SWE-bench Verified PRs from recent agents would still be rejected by maintainers. HN treated it as a warning that benchmark wins do not yet measure scope control, code quality, or repo fit.
A LocalLLaMA post pointed to a new Hugging Face dataset of human-written code reviews, pairing before-and-after code changes with inline reviewer comments and negative examples across 37 languages.
Hacker News highlighted SWE-CI, an arXiv benchmark that evaluates whether LLM agents can sustain repository quality across CI-driven iterations, not just land a single passing patch.
The open-source project Memento sparked a heated debate on Hacker News: as AI writes more code, should the AI session itself become part of the commit history? It raises fundamental questions about code provenance in the age of AI-assisted development.
While AI tools have accelerated code production, they have simultaneously expanded engineering responsibilities and raised unspoken expectations, driving burnout and an identity crisis among developers.