Articles

All AI LLM Humanoid Robots Sciences Gaming Finance

Source:

From To

LLM 21h ago 3 min read

Claude Opus 5 puts near-Fable coding power at half the cost

The new Claude default for high-end daily work shifts the model race toward performance per dollar. Anthropic says Opus 5 approaches Claude Fable 5 on coding and knowledge work while keeping API pricing at $5/M input and $25/M output tokens.

#anthropic #claude #coding-agents

LLM Hacker News 1d ago 1 min read

Why the Software Factory Debate Is Really About Review, Not More Harnesses

The post pushed back on the idea that more loops can remove human judgment from production coding workflows.

#coding-agents #software-factory #code-review

LLM X/Twitter Jul 19, 2026 1 min read

OpenInterpreter brings a Rust Kimi K3 harness to coding agents

Open coding agents are maturing around harnesses, protocols, and SDK compatibility. OpenInterpreter says its Kimi K3 native harness is written in Rust, Apache licensed, and compatible with ACP and the Codex SDK.

#openinterpreter #kimi-k3 #rust

LLM Hacker News Jul 18, 2026 1 min read

LM Studio Bionic turns open models into a desktop agent workflow

HN’s interest landed on the tradeoff Bionic represents: local models, cloud fallback, coding workflows, and a closed-source desktop app all in one package.

#lm-studio #open-models #coding-agents

LLM Hacker News Jul 18, 2026 1 min read

Grok Build goes open source, and the debate jumps straight to trust

HN treated Grok Build less as a feature drop and more as a test of control: an open Rust coding agent is useful, but telemetry, forks, and provider lock-in shaped the discussion.

#grok #xai #coding-agents

LLM Hacker News Jul 14, 2026 1 min read

Clawk gives coding agents a disposable Linux VM instead of your laptop

Coding agents need real execution rights to be useful, but handing them a local machine is uncomfortable. Clawk drew HN interest by moving that work into a disposable, network-restricted VM.

#coding-agents #sandboxing #vm

LLM X/Twitter Jul 10, 2026 2 min read

OpenAI says 30% of SWE-Bench Pro is broken and drops its recommendation

OpenAI says SWE-Bench Pro no longer reliably measures frontier coding capability after finding 30% of its public tasks broken. The cited issues include hidden requirements, contradictory instructions, strict tests and incomplete grading criteria.

#openai #swe-bench #coding-agents

LLM Hacker News Jul 6, 2026 1 min read

Clean code may not make coding agents pass more, but it makes them wander less

The HN debate centered on measurement: equal pass rates do not mean equal agent cost or navigation behavior.

#coding-agents #software-engineering #maintainability

LLM X/Twitter Jul 6, 2026 2 min read

Databricks Omnigent coordinates multiple coding agents in one workflow

AI coding is shifting from picking one assistant to orchestrating several agents. Omnigent is an open-source meta-harness with shared sessions, guardrails, and human-in-the-loop workflows.

#databricks #coding-agents #open-source

LLM X/Twitter Jul 3, 2026 2 min read

GitHub makes Kimi K2.7 Code Copilot's first open-weight choice

Copilot now has its first selectable open-weight model. GitHub says Kimi K2.7 Code starts in VS Code for Pro tiers, with Business and Enterprise admins required to enable it by policy.

#github #copilot #kimi

LLM Hacker News Jun 30, 2026 1 min read

Ornith-1.0 tests the open-model bar for agentic coding

HN interest centered on whether the model feels useful in real coding loops, not just on the benchmark table.

#ornith #coding-agents #open-models

LLM Hacker News Jun 30, 2026 1 min read

GLM 5.2 tops Claude Code in Semgrep security benchmark

The community focused on a practical signal: an open-weight model beating Claude Code on an IDOR detection test.

#glm #security #benchmark