LLM Apr 16, 2026 2 min read

Cloudflare is trying to make model choice less sticky: AI Gateway now routes Workers AI calls to 70+ models across 12+ providers through one interface. For agent builders, the important part is not the catalog alone but spend controls, retry behavior, and failover in workflows that may chain ten inference calls for one task.

LLM Hacker News Apr 16, 2026 2 min read

HN liked the ambition but went straight for the weak points: marketplace demand, MDM trust, Mac privacy claims, and whether the operator economics are believable. Darkbloom says idle Apple Silicon can serve OpenAI-compatible private inference at lower cost; the thread treated that as an architecture and incentives problem, not just a landing page.