Skip to content
LLM Apr 16, 2026 2 min read

Cloudflare is trying to make model choice less sticky: AI Gateway now routes Workers AI calls to 70+ models across 12+ providers through one interface. For agent builders, the important part is not the catalog alone but spend controls, retry behavior, and failover in workflows that may chain ten inference calls for one task.

LLM Hacker News Apr 16, 2026 2 min read

HN liked the ambition but went straight for the weak points: marketplace demand, MDM trust, Mac privacy claims, and whether the operator economics are believable. Darkbloom says idle Apple Silicon can serve OpenAI-compatible private inference at lower cost; the thread treated that as an architecture and incentives problem, not just a landing page.

LLM Apr 16, 2026 2 min read

OpenAI’s updated Agents SDK adds a model-native harness and native sandbox execution so agents can inspect files, run commands, edit code, and continue across longer tasks. It launches generally available in Python with support for sandbox providers including Blaxel, Cloudflare, Daytona, E2B, Modal, Runloop, and Vercel.