#google-cloud

AI 10h ago 2 min read

Google Cloud bundles agents, TPUs and workspace context into one enterprise stack

The enterprise AI fight is shifting from model selection to stack design. In its April 24, 2026 Cloud Next recap, Google Cloud packaged Gemini Enterprise Agent Platform, Workspace Intelligence, TPU 8t and 8i, and Virgo Network as one coordinated operating layer for AI agents.

#google-cloud #agents #tpu

LLM 12h ago 2 min read

Gemini Enterprise adds reusable Skills for reviewable agent workflows

Enterprise AI gets more useful when teams can reuse and inspect workflows instead of rebuilding them in chat every time. Google Cloud said Gemini Enterprise now saves workflows as shared Skills, after saying a day earlier that Agent Designer can test and approve each step before execution.

#google-cloud #gemini-enterprise #agents

LLM 1d ago 2 min read

Google turns Cloud Next into an agent-platform pitch at 16B TPM

Google says its AI business has crossed from pilots to operations: 75% of Cloud customers now use AI products, 330 customers processed more than 1 trillion tokens each in the past year, and model traffic exceeds 16 billion tokens per minute. The company used Cloud Next ’26 to turn that scale into a product pitch for Gemini Enterprise Agent Platform, a full runtime and governance layer for enterprise agents.

#google-cloud #gemini #agents

AI 2d ago 2 min read

Google splits its next TPU in two: 8t for training, 8i for inference

Google has redesigned its TPU roadmap around agent workloads instead of one-size-fits-all acceleration. TPU 8t targets giant training runs with nearly 3x per-pod compute and 121 exaflops, while TPU 8i focuses on low-latency inference with 19.2 Tb/s interconnect and up to 5x lower on-chip latency for collectives.

#google-cloud #tpu #ai-infrastructure

AI 2d ago 2 min read

NVIDIA and Google Cloud push AI factories toward 960,000 Rubin GPUs

This is less about one more cloud partnership and more about the infrastructure shape of the next agent wave. NVIDIA and Google Cloud say A5X Rubin systems can scale to 80,000 GPUs per site and 960,000 across multisite clusters, while cutting inference cost per token and boosting token throughput per megawatt by up to 10x versus the prior generation.

#nvidia #google-cloud #ai-infrastructure

AI Hacker News 3d ago 2 min read

HN read Google’s TPU 8t and 8i as a sign that agent workloads need different silicon

HN treated TPU 8t and 8i as more than giant datacenter numbers. The thread focused on the bigger shift: agent-era infrastructure is splitting training and inference into separate hardware bets.

#google-cloud #tpu #ai-infrastructure

AI sources.twitter 4d ago 2 min read

Google folds Vertex AI into agent platform with 200+ models

Why it matters: Google is turning Vertex AI from a collection of services into a governed agent platform. The linked Google Cloud post says Model Garden gives access to more than 200 models, including Gemini 3.1 Pro, Lyria 3, Gemma 4, and Claude families.

#google-cloud #gemini #vertex-ai

AI sources.twitter Apr 19, 2026 2 min read

Google Cloud A4X Max scales AI clusters to 50,000 GPUs

Why it matters: AI infrastructure is moving from single accelerator rentals to managed clusters that resemble supercomputers. Google Cloud said A4X Max bare-metal instances support up to 50,000 GPUs and twice the network bandwidth of earlier generations.

#google-cloud #a4x-max #ai-infrastructure

AI sources.twitter Apr 17, 2026 2 min read

BigQuery chat adds forecasting and anomaly detection functions

Why it matters: Google Cloud is moving analytics assistants beyond SQL explanation into model-backed analysis. The tweet names two concrete AI functions now reachable from chat: forecasting and anomaly detection.

#google-cloud #bigquery #conversational-analytics

LLM sources.twitter Apr 12, 2026 2 min read

Google Cloud Brings MCP Toolbox for Databases to Java

In an April 10, 2026 X post, Google Cloud Tech resurfaced its Java SDK for the MCP Toolbox for Databases as a path to enterprise-grade agent integrations. The linked blog argues that Java teams can keep Spring Boot, transactional controls, and stateful service patterns while connecting agents to databases through MCP instead of custom glue code.

#google-cloud #mcp #java

LLM sources.twitter Apr 10, 2026 2 min read

Google Cloud brings autonomous embedding generation to BigQuery in preview

Google Cloud Tech highlighted BigQuery’s autonomous embedding generation preview on April 10, 2026, positioning it as a way to keep vector data in sync without separate ETL glue. The documentation shows automatically maintained embedding columns backed by Vertex AI models, plus a preview built-in model path inside BigQuery.

#google-cloud #bigquery #embeddings

LLM sources.twitter Mar 28, 2026 2 min read

Google Cloud shows Gemini CLI using MCP servers for agentic app migration and deployment

GoogleCloudTech posted a demo on March 27, 2026 showing Gemini CLI using Model Context Protocol (MCP) servers to migrate and deploy a full-stack application. Google's September 11, 2025 Gemini CLI extensions post and December 11, 2025 MCP support announcement show that the demo is built on /deploy for Cloud Run, managed MCP endpoints for Google services, and enterprise controls such as IAM, audit logs, and Model Armor.

#google-cloud #gemini-cli #mcp