#coding-agents

AI Hacker News Apr 7, 2026 2 min read

Launch HN Puts Freestyle’s Live-Forked Agent Sandboxes on the Map

A Launch HN post with around 260 points introduced Freestyle as infrastructure for coding agents, highlighting sub-second VM startup, live forking of running sandboxes, pause-and-resume persistence, built-in git hosting, and full Linux VMs intended for agent platforms rather than lightweight demo containers.

#coding-agents #sandbox #virtualization

LLM Hacker News Apr 6, 2026 2 min read

Hacker News Highlights a Six-Part Blueprint for Coding Agents

Sebastian Raschka's April 4, 2026 article argues that coding-agent quality is shaped as much by the harness as by the base model. He breaks the stack into six components: live repo context, prompt and cache reuse, structured tools, context reduction, session memory, and bounded subagents. Hacker News treated it as a practical framework for understanding why products like Codex and Claude Code feel stronger than plain chat.

#coding-agents #agent-harness #repo-context

LLM sources.twitter Apr 5, 2026 2 min read

Cursor details Composer 2’s training stack, from continued pretraining to real-world RL

Cursor said on March 26, 2026 that real-time reinforcement learning lets it ship improved Composer 2 checkpoints every five hours. Cursor’s March 27 technical report says the model combines continued pretraining on Kimi K2.5 with large-scale RL in realistic Cursor sessions, scores 61.3 on CursorBench, and runs on an asynchronous multi-region RL stack with large sandbox fleets.

#cursor #composer-2 #reinforcement-learning

AI Hacker News Apr 3, 2026 2 min read

Hacker News Pushes Cursor 3 as a Unified Workspace for Coding Agents

Cursor 3 reframes AI coding as multi-agent orchestration, combining local and cloud agents, multi-repo context, and PR-oriented workflows in a single interface.

#cursor #coding-agents #developer-tools

AI Reddit Mar 31, 2026 2 min read

r/singularity Debates Meta-Harness After Its TerminalBench 2 Lead Over Claude Code

A r/singularity post with 286 upvotes and 57 comments spotlighted Meta-Harness claiming a clear TerminalBench 2 lead over Claude Code. The discussion centered on what a harness is, whether AI-designed harnesses can beat manual iteration, and whether open models will get the same treatment.

#reddit #meta-harness #terminalbench-2

LLM sources.twitter Mar 30, 2026 2 min read

Cursor publishes the Composer 2 technical report detailing continued pretraining and large-scale RL for coding agents

Cursor has published the Composer 2 technical report, outlining its code-focused continued pretraining, large-scale reinforcement learning pipeline, and CursorBench-led evaluation strategy. The report offers an unusually detailed first-party look at how a production coding agent is trained and measured.

#cursor #composer-2 #coding-agents

AI Reddit Mar 30, 2026 2 min read

r/singularity Highlights Cursor’s Five-Hour Real-Time RL Loop for Composer

A March 29 r/singularity thread amplified Cursor's claim that Composer checkpoints can now be trained from live user interactions and shipped every five hours, with reward-hacking fixes treated as part of the story rather than an afterthought.

#cursor #reinforcement-learning #coding-agents

LLM Hacker News Mar 28, 2026 2 min read

Hacker News spotlights ATLAS and the economics of local coding agents

A Hacker News post pushed ATLAS into the spotlight by framing a consumer-GPU coding agent as a serious cost challenger to hosted systems. The headline benchmark is interesting, but the repository itself makes clear that its 74.6% result is not a controlled head-to-head against Claude 4.5 Sonnet because the task counts and evaluation protocols differ.

#coding-agents #benchmarks #local-inference

LLM sources.twitter Mar 27, 2026 2 min read

OpenAI expands Codex access and turns plugins into reusable workflow packages

OpenAIDevs said on March 27, 2026 that Codex usage limits had been reset across plans so users could try newly launched plugins. OpenAI's Help Center says Codex is temporarily available on Free and Go, paid plans are getting 2x rate limits, and plugins package reusable workflows built from skills, app integrations, and MCP configurations.

#openai #codex #plugins

LLM Hacker News Mar 26, 2026 2 min read

A public dashboard turns Claude Code’s GitHub footprint into a measurable trend

An independent Claude Code dashboard says its since-launch view now covers more than 20.8 million observed commits, over 1.08 million active repositories, and 114,785 new original repositories in the last seven days. Hacker News drove the link to 274 points and 164 comments as users debated what metrics can actually capture AI coding adoption.

#claude-code #github #coding-agents

LLM Reddit Mar 25, 2026 2 min read

r/artificial highlights ATLAS reaching 74.6% LiveCodeBench on a $500 GPU

r/artificial focused on ATLAS because it shows how planning, verification, and repair infrastructure can push a frozen 14B local model far closer to frontier coding performance.

#atlas #livecodebench #local-inference

AI Hacker News Mar 21, 2026 2 min read

Hacker News Revisits Code Discipline as AI Coding Agents Start Reshaping Repositories

A Hacker News discussion around the “AI Code” manifesto argues that teams adopting AI coding agents need tighter architectural rules, from small semantic functions to data models that make invalid states hard to represent.

#ai-engineering #coding-agents #code-quality