Why it matters: long-running agents need memory that survives beyond one prompt without replaying every message. Cloudflare says Agent Memory is in private beta and keeps useful state available without filling the context window.
#workers-ai
RSS FeedCloudflare moved Workers AI into larger-model territory on March 19, 2026 by adding Moonshot AI’s Kimi K2.5. The company is pitching a single stack for durable agent execution, large-context inference, and lower-cost open-model deployment.
Cloudflare said on April 10, 2026 that its global network passed 500 Tbps of external capacity across 330+ cities and now protects more than 20% of the web. The company frames the milestone as both DDoS headroom and a response to the changing traffic mix of AI crawlers and autonomous agents.
Cloudflare said on X on March 19 that Kimi K2.5 is now available on Workers AI. The launch pairs a frontier open-source model with platform features aimed at lowering latency and cost for agent workloads.
Cloudflare said on March 20, 2026 that Kimi K2.5 is now available on Workers AI so developers can run agents end-to-end on its platform. The linked Cloudflare blog says the model ships with a 256K context window, multi-turn tool calling, vision, and structured outputs, and that one internal agent workload cut costs by 77% after the switch.
Cloudflare said on March 20, 2026 that Kimi K2.5 was available on Workers AI so developers could build end-to-end agents on Cloudflare’s platform. Its launch post says the model brings a 256k context window, multi-turn tool calling, vision inputs, and structured outputs, while an internal security-review agent processing 7B tokens per day cut costs by 77% after the switch.
Cloudflare said on March 19, 2026 that Workers AI now supports Moonshot AI's Kimi K2.5. The company is using the model to argue that a unified agent platform can offer both strong tool use and much lower production cost.