#realtime

LLM X/Twitter Mar 30, 2026 2 min read

Google rolls out Gemini 3.1 Flash Live across Gemini Live, Search Live, and AI Studio

Google DeepMind said on March 26, 2026 that Gemini 3.1 Flash Live is rolling out in Gemini Live and Google Search Live, while developers can access it through Google AI Studio. Google’s announcement positions 3.1 Flash Live as its highest-quality audio model, with lower latency, improved tonal understanding, and benchmark gains including 90.8% on ComplexFuncBench Audio.

#google #gemini #voice-ai

LLM Mar 27, 2026 2 min read

Google ships Gemini 3.1 Flash Live for lower-latency voice agents and global Search Live

Google introduced Gemini 3.1 Flash Live on Mar 26, 2026 as its new real-time audio model for developers, enterprises, and consumer products. The release ties together the Gemini Live API, Gemini Enterprise for Customer Experience, Search Live, and Gemini Live around a single lower-latency voice stack.

#google #gemini #voice-ai

AI Mar 15, 2026 2 min read

Mistral expands its speech stack with Voxtral Realtime and Voxtral Mini Transcribe V2

Mistral has published Voxtral Realtime and Voxtral Mini Transcribe V2, adding sub-200ms streaming transcription, 13-language support, and open weights for the realtime model. The company also paired the launch with an audio playground in Mistral Studio and aggressive API pricing at $0.003/min and $0.006/min.

#mistral #speech #transcription

106

AI X/Twitter Mar 14, 2026 2 min read

Together AI Packages a One-Cloud Voice-Agent Stack for Real-Time Deployment

Together AI said on March 12, 2026 that it is launching a one-cloud stack for real-time voice agents. Its public materials describe co-located STT, LLM, and TTS infrastructure with under-500ms latency, 25+ regions, and separate kernel work that cut time-to-first-64-tokens to 77ms in a voice-agent deployment.

#voice-agents #inference #realtime