LLM 2d ago 2 min read
Cloudflare says Workers AI has made Kimi K2.5 3x faster for agent workloads. The technical change pushed p90 time per token from roughly 100 ms to 20-30 ms and raised peak input-token cache hit ratios from 60% to 80% with heavy internal users.