LLM Hacker News Apr 4, 2026 2 min read

Mintlify says chunked RAG was too limited for docs exploration, so it built ChromaFs, a virtual filesystem over Chroma that cuts assistant session creation from about 46 seconds to about 100ms. HN readers were notably receptive to the filesystem-first design and the argument that agent tooling benefits from interpretable, UNIX-like retrieval.

LLM X/Twitter Apr 3, 2026 2 min read

Google AI said on March 26, 2026 that Gemini 3.1 Flash Live is launching for developers building real-time voice and vision agents. Google highlighted faster natural dialogue, better task completion in noisy environments, and stronger complex-instruction following, while its Live API docs describe low-latency multimodal streaming with tool use and 70-language support.

LLM X/Twitter Apr 3, 2026 2 min read

Mistral said on April 2, 2026 that developers can assemble a web-search-enabled speech-to-speech assistant in roughly 150 lines of code using Voxtral for transcription and speech generation plus Mistral Small 4 for agentic reasoning. The post is notable less as a single model launch than as a clear reference architecture for real-time audio agents.