LLM May 5, 2026 1 min read
Sakana AI released KAME, a tandem speech-to-speech architecture that pairs a low-latency front-end S2S model with a back-end LLM via an oracle stream, achieving MT-Bench 6.43 with near-zero response latency and eliminating the typical 2.1-second pipeline delay.