Google DeepMind’s April 2, 2026 X thread introduced Gemma 4 as a new open model family built for reasoning and agentic workflows. Google says the lineup spans E2B, E4B, 26B MoE, and 31B Dense, and adds native function calling, structured JSON output, and longer context windows.
r/LocalLLaMA pushed Gemma 4 into one of the strongest community signals in this crawl as Google shipped an open model family spanning edge devices through workstation-class local servers.
Google said on April 2, 2026 that Gemma 4 is its most capable open model family so far, built from the same technology base as Gemini 3. Google says the family spans E2B, E4B, 26B MoE, and 31B Dense models, adds function-calling and structured JSON support, and offers up to 256K context with an Apache 2.0 license.
Google DeepMind said on March 26, 2026 that Gemini 3.1 Flash Live is rolling out in Gemini Live and Google Search Live, while developers can access it through Google AI Studio. Google’s announcement positions 3.1 Flash Live as its highest-quality audio model, with lower latency, improved tonal understanding, and benchmark gains including 90.8% on ComplexFuncBench Audio.
Google expanded Search Live on March 26, 2026 to every language and location where AI Mode is available. The move pushes multimodal voice-and-camera search to more than 200 countries and territories and gives Gemini’s live audio stack a much larger real-world footprint.
Google said on March 11, 2026 that it has completed the Wiz acquisition and will bring the company into Google Cloud while keeping Wiz available across all major clouds. The company is positioning the deal as a multicloud and AI security move as attackers increasingly use AI to scale threats.
Google DeepMind said on March 26, 2026 that it is releasing a public toolkit to measure harmful manipulation by AI systems. The company says the work spans nine studies with more than 10,000 participants and now informs safety evaluations for models including Gemini 3 Pro.
Google Research said on March 12, 2026 that it is rolling out urban flash flood forecasts with up to 24 hours of advance notice. The system uses a Groundsource dataset built from public reports and extends Google’s flood coverage beyond river flooding toward rapid-onset urban disasters.
Google said on March 10, 2026 that research with Imperial College London and the NHS found its mammography system identified 25% of interval cancers that conventional screening had missed. The company also said a second study suggests AI could reduce screening workload by about 40% when used as a second reader.
Google Research and Google DeepMind published a real-world feasibility study of AMIE in ambulatory primary care with Beth Israel Deaconess Medical Center. The study found AMIE roughly on par with primary care physicians on overall management plans and differential diagnoses, while also documenting important practical limitations.
Google introduced Gemini 3.1 Flash Live on Mar 26, 2026 as its new real-time audio model for developers, enterprises, and consumer products. The release ties together the Gemini Live API, Gemini Enterprise for Customer Experience, Search Live, and Gemini Live around a single lower-latency voice stack.
Google Research introduced TurboQuant on March 24, 2026 as a compression approach for KV cache and vector search bottlenecks. Hacker News pushed the post to 491 points and 129 comments, reflecting how central memory efficiency has become for long-context inference.