Skip to content
LLM X/Twitter Mar 30, 2026 2 min read

OpenAI Developers said on March 30, 2026 that Perplexity has been running voice experiences with the Realtime API in production and published lessons from that work. The post says Perplexity now handles millions of monthly voice sessions and details how the team changed context chunking, standardized audio formats, and tuned turn-taking for noisy real-world environments.

LLM X/Twitter Mar 30, 2026 2 min read

Google DeepMind said on March 26, 2026 that Gemini 3.1 Flash Live is rolling out in Gemini Live and Google Search Live, while developers can access it through Google AI Studio. Google’s announcement positions 3.1 Flash Live as its highest-quality audio model, with lower latency, improved tonal understanding, and benchmark gains including 90.8% on ComplexFuncBench Audio.

LLM Reddit Mar 30, 2026 2 min read

A March 2026 r/LocalLLaMA post with 126 points and 45 comments highlighted a practical guide for running Qwen3.5-27B through llama.cpp and wiring it into OpenCode. The post stands out because it covers the operational details that usually break local coding setups: quant choice, chat-template fixes, VRAM budgeting, Tailscale networking, and tool-calling behavior.