#ai-infrastructure

AI 18h ago 2 min read

Naver turns a ₩400B state loan into a Korea compute push

South Korea is no longer treating AI infrastructure as a private-sector side quest. A ₩400 billion loan from the Financial Services Commission will expand Naver’s Gak Sejong facility, bankroll GPU deployment and give HyperCLOVA X a bigger domestic base just as AI sovereignty becomes industrial policy.

#naver #south-korea #ai-infrastructure

AI 18h ago 2 min read

Google puts $10B down on Anthropic and leaves $30B waiting

Alphabet just rewired the AI capital race: $10 billion goes to Anthropic now at a $350 billion valuation, with another $30 billion tied to performance targets. Coming days after Amazon’s own pledge, the deal shows that frontier labs are no longer raising money in rounds so much as pre-buying compute at planetary scale.

#anthropic #google #funding

AI 2d ago 2 min read

Google splits its next TPU in two: 8t for training, 8i for inference

Google has redesigned its TPU roadmap around agent workloads instead of one-size-fits-all acceleration. TPU 8t targets giant training runs with nearly 3x per-pod compute and 121 exaflops, while TPU 8i focuses on low-latency inference with 19.2 Tb/s interconnect and up to 5x lower on-chip latency for collectives.

#google-cloud #tpu #ai-infrastructure

AI 2d ago 2 min read

NVIDIA and Google Cloud push AI factories toward 960,000 Rubin GPUs

This is less about one more cloud partnership and more about the infrastructure shape of the next agent wave. NVIDIA and Google Cloud say A5X Rubin systems can scale to 80,000 GPUs per site and 960,000 across multisite clusters, while cutting inference cost per token and boosting token throughput per megawatt by up to 10x versus the prior generation.

#nvidia #google-cloud #ai-infrastructure

AI Hacker News 3d ago 2 min read

HN read Google’s TPU 8t and 8i as a sign that agent workloads need different silicon

HN treated TPU 8t and 8i as more than giant datacenter numbers. The thread focused on the bigger shift: agent-era infrastructure is splitting training and inference into separate hardware bets.

#google-cloud #tpu #ai-infrastructure

AI Hacker News Apr 20, 2026 2 min read

HN read the RAM shortage as AI infrastructure spilling onto everyday devices

HN latched onto the RAM shortage because the uncomfortable link is physical: HBM demand for AI data centers is now shaping prices for phones, laptops, and handhelds.

#ai-infrastructure #memory #hbm

AI sources.twitter Apr 19, 2026 2 min read

Google Cloud A4X Max scales AI clusters to 50,000 GPUs

Why it matters: AI infrastructure is moving from single accelerator rentals to managed clusters that resemble supercomputers. Google Cloud said A4X Max bare-metal instances support up to 50,000 GPUs and twice the network bandwidth of earlier generations.

#google-cloud #a4x-max #ai-infrastructure

AI Hacker News Apr 18, 2026 2 min read

HN read AI compute scarcity as a product architecture problem

HN treated rising GPU costs as more than infrastructure trivia. If frontier access tightens and inference gets pricier, startups may have to compete on procurement, routing, caching, evaluation, and smaller-model strategy rather than assuming abundant calls to the strongest model.

#ai-infrastructure #compute #pricing

AI sources.twitter Apr 7, 2026 1 min read

Anthropic signs Google and Broadcom deal for multi-gigawatt TPU capacity starting in 2027

Anthropic said on April 7, 2026 that it has signed a deal with Google and Broadcom for multiple gigawatts of next-generation TPU capacity coming online from 2027. The company also said run-rate revenue has surpassed 30 billion dollars and more than 1,000 business customers are now spending over 1 million dollars annually.

#anthropic #google #broadcom

AI Reddit Apr 4, 2026 2 min read

r/singularity Shifts the AI Scaling Debate from GPUs to Power Infrastructure

A `r/singularity` post highlighted reporting that roughly half of planned U.S. data center projects have been delayed or canceled because transformers, switchgear, batteries, and related power equipment remain supply constrained. The story resonated because it reframes AI expansion as a grid and industrial logistics problem, not only a chip problem.

#data-centers #power-grid #ai-infrastructure

AI Reddit Apr 2, 2026 2 min read

OpenAI Closes $122B Round and Frames Compute as Its Core Strategic Flywheel

OpenAI said on March 31, 2026 that it closed a $122 billion funding round at an $852 billion post-money valuation. The company used the announcement to present consumer reach, enterprise growth, API usage, Codex adoption, and compute access as one reinforcing AI platform flywheel.

#openai #funding #ai-infrastructure

AI sources.twitter Apr 2, 2026 2 min read

NVIDIA positions Groq 3 LPX as the low-latency inference rack for Vera Rubin

On March 17, 2026, NVIDIADC described Groq 3 LPX on X as a new rack-scale low-latency inference accelerator for the Vera Rubin platform. NVIDIA’s March 16 press release and technical blog say LPX brings 256 LPUs, 128 GB of on-chip SRAM, and 640 TB/s of scale-up bandwidth into a heterogeneous inference path with Vera Rubin NVL72 for agentic AI workloads.

#nvidia #groq-3-lpx #vera-rubin