Automating alignment research is moving from concept to measured experiment. Anthropic says a Claude Opus 4.6 researcher recovered 97% of the weak-to-strong supervision gap at roughly 1/100 the human time cost.
LLM
RSS FeedLocalLLaMA reacted because the post attacks a very real pain point for running large MoE models on limited VRAM. The author tested a llama.cpp fork that tracks recently routed experts and keeps the hot ones in VRAM for Qwen3.5-122B-A10B, reporting 26.8% faster token generation than layer-based offload at a similar 22GB VRAM budget.
LocalLLaMA reacted because the joke-like idea of an LLM tuning its own runtime came with concrete benchmark numbers. The author says llm-server v2 adds --ai-tune, feeding llama-server help into a tuning loop that searches flag combinations and caches the fastest config; on their rig, Qwen3.5-27B Q4_K_M moved from 18.5 tok/s to 40.05 tok/s.
HN reacted because this was less about one wrapper and more about who gets credit and control in the local LLM stack. The Sleeping Robots post argues that Ollama won mindshare on top of llama.cpp while weakening trust through attribution, packaging, cloud routing, and model storage choices, while commenters pushed back that its UX still solved a real problem.
Lightning OPD attacks a practical bottleneck in on-policy distillation: keeping a live teacher model running throughout training. The paper reports 69.9% on AIME 2024 from Qwen3-8B-Base in 30 GPU hours, a 4.0x speedup over standard OPD.
The Reddit thread is not about mourning TGI. It reads like operators comparing notes after active momentum shifted away from it, with most commenters saying vLLM is now the safer default for general inference serving because the migration path is lighter and the performance case is easier to defend.
HN reacted less to the launch itself than to the question behind it: can AI finally do useful spreadsheet work inside Excel instead of opening one more chat panel? OpenAI’s beta add-in writes directly in the workbook, explains referenced cells, asks permission before edits, and that raised expectations immediately.
HN did not stay on the word steal for long. The real argument was whether an AI agent can spend a user’s paid LLM credits and GitHub identity on upstream maintenance without a hard opt-in, because once that happens the problem stops being clever automation and becomes consent.
Mistral is turning connectors from glue code into a platform feature: built-in connectors and custom MCP servers now sit inside Studio and can be called across conversations, completions, and agents. The April 15 release also adds direct tool calling and requires_confirmation, making enterprise integration and approval flows part of the product instead of application scaffolding.
r/artificial latched onto this because it turned a vague complaint about Claude feeling drier and more evasive into a pile of concrete counts. The post is not an official benchmark, but that is exactly why it traveled: it reads like a field report from someone with enough logs to make the frustration measurable.
LocalLLaMA paid attention because MiniMax tried to cool down the M2.7 license anxiety, but the thread still read the wording as muddy. What people wanted was not a softer tone, it was a clear answer on what self-hosted commercial use actually permits.
Reuters’ new Mythos analysis argues banks are staring at a timing problem, not a distant risk. Officials in the U.S., Canada, and Britain have already met with banking leaders, and Anthropic says the model found thousands of high and critical vulnerabilities.