#reasoning-models

LLM Reddit Mar 9, 2026 2 min read

Sarvam open-sources 30B and 105B reasoning models trained in India

A high-scoring LocalLLaMA thread surfaced Sarvam AI's release of two Apache 2.0 reasoning models, Sarvam 30B and Sarvam 105B. The company says both were trained from scratch in India, use Mixture-of-Experts designs, and target reasoning, coding, agentic workflows, and Indian-language performance.

#open-models #india #reasoning-models

LLM Hacker News Feb 25, 2026 2 min read

Mercury 2 Launches a Diffusion Reasoning LLM Aimed at Real-Time Inference

Inception Labs introduced Mercury 2 and claims a diffusion-based architecture can deliver reasoning quality at much lower latency. The launch emphasizes parallel token refinement, OpenAI-compatible APIs, and enterprise-ready throughput targets.

#diffusion-llm #reasoning-models #inference-speed

LLM Reddit Feb 17, 2026 1 min read

Reddit Highlights Gemini 3 Deep Think Upgrade for Science and Engineering

A high-ranking r/singularity post shared Google’s Gemini 3 Deep Think update. The announcement includes benchmark claims such as 48.4% on Humanity’s Last Exam (without tools), 84.6% on ARC-AGI-2, and Codeforces Elo 3455, plus Gemini API early access.

#gemini #reasoning-models #benchmarks