Sakana AI released KAME, a tandem speech-to-speech architecture that pairs a low-latency front-end S2S model with a back-end LLM via an oracle stream, achieving MT-Bench 6.43 with near-zero response latency and eliminating the typical 2.1-second pipeline delay.
#open-source
RSS FeedPoolside AI released Laguna XS.2 on April 28, 2026 under Apache 2.0 — a 33B total/3B active MoE model purpose-built for agentic coding, scoring 68.2% on SWE-bench Verified and deployable on a single consumer GPU.
Released April 29, 2026 under Modified MIT license, Mistral Medium 3.5 consolidates the company's chat, reasoning, and coding models into one 128B dense open-weight model with 256K context, scoring 77.6% on SWE-bench Verified.
This open-source TypeScript tool converts any codebase or knowledge base into an explorable, searchable, and queryable knowledge graph with Q&A capabilities, integrating with Claude Code, Codex, Cursor, Copilot, and Gemini CLI.
Open-source PFlash uses speculative prefill to dramatically cut time-to-first-token for long-context LLM inference, achieving 10.4x speedup on Qwen3.6-27B Q4_K_M with a consumer RTX 3090.
OpenAI has released Symphony, an open-source specification that turns issue trackers like Linear into a control plane for autonomous coding agents. The system assigns a Codex agent per task, handles CI, rebasing, and PR management without human oversight.
HN read Zig's anti-AI contribution rule as a maintainer-time policy: review is for growing trusted humans, and LLM-shaped PRs break that loop.
Warp is opening more than source code. The terminal company put its client under AGPL, moved product planning into public GitHub issues, and says nearly 1 million active developers can now steer Oz-powered agent builds in the open.
Why it matters: kernel work is what decides whether long-context and edge-side agent systems stay theoretical or become cheap enough to run. Qwen says FlashQLA delivers 2-3x forward speedup and 2x backward speedup over the FLA Triton kernel on NVIDIA Hopper.
HN did not treat this as abstract legal trivia. Once the Claude Code leak became the hook, the thread turned into a practical question for every team shipping AI-assisted software: if the model wrote the bulk of it, what is actually yours?
HN treated Ghostty’s GitHub exit as more than a forge move. What hit people was the subtext: when even a maintainer with deep GitHub history decides the relationship is no longer worth it, reliability and focus stop sounding like background complaints.
This matters because Xiaomi just put a frontier-scale model family behind permissive terms instead of a closed API gate. The MiMo-V2.5 release promises a 1M-token context window, MIT licensing for commercial use and fine-tuning, and a Pro variant Xiaomi says leads open models on GDPVal-AA and ClawEval.