LLM Hacker News Feb 25, 2026 2 min read
Inception Labs introduced Mercury 2 and claims a diffusion-based architecture can deliver reasoning quality at much lower latency. The launch emphasizes parallel token refinement, OpenAI-compatible APIs, and enterprise-ready throughput targets.