Hugging Face has launched Storage Buckets, a mutable and non-versioned object storage layer for checkpoints, processed data, logs, and agent traces on the Hub. The company says Xet-based deduplication and cloud pre-warming should make large ML workflows faster and cheaper to operate.
#infrastructure
RSS FeedMeta said on March 11, 2026 that it is developing and deploying four new generations of MTIA custom chips within the next two years. The company is positioning MTIA as a central part of its AI infrastructure strategy for ranking, recommendations, and GenAI inference workloads.
OpenAI said it raised $110B at a $730B pre-money valuation and added Amazon and NVIDIA to a broader infrastructure push. The company tied the financing to fast growth across Codex, ChatGPT, and enterprise deployments.
Meta said on March 11, 2026 that it is accelerating its in-house MTIA roadmap across four generations, from MTIA 300 through MTIA 500. The company is using custom silicon to push harder on ranking, recommendation, and especially GenAI inference economics at Meta scale.
On March 6, 2026, OpenAI reposted a message from Sachin Katti saying construction is underway in Port Washington, Wisconsin. The post turns OpenAI’s previously announced Stargate and partner-led compute strategy into a visible on-the-ground build milestone.
Together AI said on March 12, 2026 that it is launching a one-cloud stack for real-time voice agents. Its public materials describe co-located STT, LLM, and TTS infrastructure with under-500ms latency, 25+ regions, and separate kernel work that cut time-to-first-64-tokens to 77ms in a voice-agent deployment.
Anthropic said on March 10, 2026 that Sydney will become its fourth office in Asia-Pacific. The expansion is aimed at enterprise, startup, and research customers in Australia and New Zealand, with local compute options under consideration for data residency needs.
Meta says custom silicon is critical to scaling next-generation AI and has published a roadmap update for its MTIA family. The company says it accelerated development enough to release four generations in two years as model architectures keep changing faster than traditional chip cycles.
OpenAI and Amazon announced a multi-year strategic partnership on February 27, 2026. The deal combines a new Amazon Bedrock-based Stateful Runtime Environment, exclusive third-party AWS distribution for OpenAI Frontier, approximately 2 gigawatts of Trainium capacity, and a $50 billion Amazon investment.
NVIDIAAI says it is partnering with Thinking Machines to deploy at least 1 gigawatt of NVIDIA Vera Rubin systems for frontier AI training. Thinking Machines frames the partnership as infrastructure for both frontier model training and platforms for customizable AI.
Meta said its long-term AMD agreement will provide up to 6GW of AMD Instinct GPU capacity for AI infrastructure. First shipments are planned for the second half of 2026 on Helios rack-scale systems.
Google used AI Impact Summit 2026 to bundle infrastructure, public-sector grants, and workforce programs into one AI adoption package. The announcements include $15 billion for Indian cloud and AI infrastructure, two new $30 million initiatives for government and science, and a training effort aimed at 720,000 workers.