Skip to content

#gemini

RSS Feed
AI Apr 16, 2026 2 min read

Google’s new speech model moves control from hidden settings into the text itself: audio tags can steer style, pace, and delivery across 70+ languages. Gemini 3.1 Flash TTS is in preview through Gemini API, Google AI Studio, and Vertex AI, reaches Google Vids users, scores 1,211 Elo on Artificial Analysis, and watermarks outputs with SynthID.

Humanoid Robots Apr 15, 2026 2 min read

Google DeepMind's latest robotics model pushes a hard industrial task from 23% to 93% accuracy when agentic vision is enabled, putting a concrete number on embodied reasoning progress. The April 14 release also puts Gemini Robotics-ER 1.6 into the Gemini API and Google AI Studio, so developers can test the upgrade immediately.

AI X/Twitter Apr 12, 2026 2 min read

Google said on March 27, 2026 that Google Translate's Live translate with headphones is now on iOS and expanding to more countries for both Android and iOS users. Google's official product pages say the feature supports 70+ languages, works with any pair of headphones, and builds on Gemini speech-to-speech translation designed to preserve tone, emphasis, and cadence.

AI Hacker News Apr 10, 2026 2 min read

A Hacker News thread pushed a GitHub repo claiming it can detect and weaken Gemini image SynthID watermarks using signal processing alone. The more important debate was not the headline claim itself, but whether the project had been validated against Google's own detector and what that says about watermark-based provenance overall.