Google’s new speech model moves control from hidden settings into the text itself: audio tags can steer style, pace, and delivery across 70+ languages. Gemini 3.1 Flash TTS is in preview through Gemini API, Google AI Studio, and Vertex AI, reaches Google Vids users, scores 1,211 Elo on Artificial Analysis, and watermarks outputs with SynthID.
#gemini
RSS FeedGoogle has turned Gemini into a native Mac resident instead of a browser tab, with Option + Space bringing up the assistant and screen sharing extending it to whatever is open on the desktop. The April 15 launch also makes the app free globally for machines running macOS 15 or later.
Google DeepMind's latest robotics model pushes a hard industrial task from 23% to 93% accuracy when agentic vision is enabled, putting a concrete number on embodied reasoning progress. The April 14 release also puts Gemini Robotics-ER 1.6 into the Gemini API and Google AI Studio, so developers can test the upgrade immediately.
Google DeepMind is pushing embodied reasoning closer to deployable robotics, not just lab demos. In the linked thread and blog post, Gemini Robotics-ER 1.6 reaches 93% on instrument reading with agentic vision and improves injury-risk detection in video by 10% over Gemini 3.0 Flash.
Google is no longer treating AI memory as a niche add-on. By bringing Gemini Personal Intelligence to India, it is testing whether a model that reads Gmail, Photos, and watch history can become a daily assistant in one of its biggest markets.
On April 8, 2026, Gemini introduced notebooks as a new project layer for grouping past chats, files and instructions. Google says notebooks sync with NotebookLM and are rolling out first on the web for Google AI Ultra, Pro and Plus subscribers.
On April 9, 2026, Gemini said it can now generate interactive visualizations directly in chat. Google’s product page says the rollout adds functional simulations, adjustable parameters and 3D exploration to the Gemini app for global users on the Pro model.
Google introduced notebooks in Gemini on April 8, 2026, adding a shared workspace that syncs chats and source files with NotebookLM. The initial rollout starts on the web for Google AI Ultra, Pro, and Plus subscribers, with mobile, more European countries, and free users to follow.
Google said on March 27, 2026 that Google Translate's Live translate with headphones is now on iOS and expanding to more countries for both Android and iOS users. Google's official product pages say the feature supports 70+ languages, works with any pair of headphones, and builds on Gemini speech-to-speech translation designed to preserve tone, emphasis, and cadence.
Google on April 8 began rolling out Gemini for Home early access in Japan. The update moves Google Home from fixed commands toward conversational control, AI camera summaries, and natural-language video search.
Google said on March 26, 2026 that Search Live is expanding to every language and country where AI Mode is already available. The rollout reaches more than 200 countries and territories and uses Gemini 3.1 Flash Live to make search more conversational, voice-first, and camera-aware.
A Hacker News thread pushed a GitHub repo claiming it can detect and weaken Gemini image SynthID watermarks using signal processing alone. The more important debate was not the headline claim itself, but whether the project had been validated against Google's own detector and what that says about watermark-based provenance overall.