#developers

LLM Apr 16, 2026 2 min read

Cloudflare turns AI Gateway into one API for 70+ models

Cloudflare is trying to make model choice less sticky: AI Gateway now routes Workers AI calls to 70+ models across 12+ providers through one interface. For agent builders, the important part is not the catalog alone but spend controls, retry behavior, and failover in workflows that may chain ten inference calls for one task.

#cloudflare #llm #agents

LLM sources.official Apr 16, 2026 2 min read

OpenAI gives Agents SDK native sandboxes for long-running work

OpenAI’s updated Agents SDK adds a model-native harness and native sandbox execution so agents can inspect files, run commands, edit code, and continue across longer tasks. It launches generally available in Python with support for sandbox providers including Blaxel, Cloudflare, Daytona, E2B, Modal, Runloop, and Vercel.

#openai #agents #sdk

AI sources.twitter Apr 13, 2026 2 min read

Replit and Accenture pair strategic investment with enterprise rollout of secure vibecoding

Amjad Masad said on April 10, 2026 that Accenture is investing in Replit, adopting it internally, and partnering to bring secure vibecoding to enterprises globally. The public first-party disclosure is brief, but the combination of capital, internal deployment, and access to a consulting organization with 700,000-plus employees makes this a meaningful enterprise distribution move for AI-assisted software creation.

#replit #accenture #vibecoding

AI Mar 28, 2026 2 min read

OpenAI releases teen-safety policy prompts for gpt-oss-safeguard

OpenAI said on March 24, 2026 that it is publishing prompt-based teen-safety policies designed for gpt-oss-safeguard and other reasoning models. The initial release covers six risk areas and was developed with input from Common Sense Media and everyone.ai.

#openai #teen-safety #gpt-oss-safeguard

LLM Mar 25, 2026 2 min read

Google Previews Gemini 3.1 Flash-Lite for High-Volume AI Workloads

Google introduced Gemini 3.1 Flash-Lite on Mar 03, 2026 as its fastest and lowest-cost Gemini 3 series model. The preview release targets high-volume developer workloads with lower pricing, faster latency, and stronger benchmark scores than the prior 2.5 Flash tier.

#google #gemini #llm

LLM Mar 18, 2026 2 min read

Google launches Gemini 3.1 Flash-Lite for high-volume AI workloads at lower cost

Google introduced Gemini 3.1 Flash-Lite on March 3, 2026 as its fastest and most cost-efficient Gemini 3 series model. The model is rolling out in preview through the Gemini API in Google AI Studio and Vertex AI, with pricing of $0.25/1M input tokens and $1.50/1M output tokens, plus claims of a 2.5x faster Time to First Answer Token and 45% higher output speed than 2.5 Flash.

#google #gemini #flash-lite

LLM Mar 17, 2026 2 min read

Google adds project spend caps and faster tier upgrades for the Gemini API

Google introduced Project Spend Caps, revamped Usage Tiers, and new billing dashboards for Gemini API developers in AI Studio. The update is aimed at making cost control and scaling behavior more predictable for teams moving into paid usage.

#google #gemini #api

LLM Mar 16, 2026 2 min read

OpenAI introduces GPT-5.4 for tougher coding and agent workflows

On March 5, 2026, OpenAI introduced GPT-5.4 as a flagship model focused on relevance, contextual understanding, and instruction following. In the API, it pairs a 1M-token context window with stronger tool search for long, multi-tool workflows.

#openai #gpt-5.4 #llm

AI sources.twitter Mar 14, 2026 2 min read

OpenAI Expands Video API with Sora 2 Character Reuse, Scene Extensions, and Batch Rendering

OpenAI Developers posted on March 12, 2026 that the Video API now supports a broader Sora 2 workflow. The update adds reusable characters, video extensions, longer clips, portrait and landscape exports, and batch processing for studio-style pipelines.

#openai #sora #video-api

LLM Mar 13, 2026 2 min read

Google introduces the Developer Knowledge API and MCP Server for Gemini Code Assist workflows

Google introduced the Developer Knowledge API and an open-source MCP Server on February 4, 2026. The tools are meant to connect internal documentation, public URLs, and other team knowledge sources to Gemini Code Assist and AI-agent workflows with less custom plumbing.

#google #gemini #developers

LLM sources.twitter Mar 10, 2026 1 min read

Google DeepMind Rolls Out Gemini 3.1 Flash-Lite Preview

Google DeepMind said Gemini 3.1 Flash-Lite is rolling out in preview through the Gemini API and Google AI Studio. The company positioned it as the most cost-efficient Gemini 3 model, with lower price, faster performance, and tunable thinking levels.

#google #gemini #flash-lite

LLM sources.twitter Mar 10, 2026 1 min read

Claude Code Adds Multi-Agent Code Review for Team and Enterprise

Claude said Claude Code now includes Code Review, a feature that dispatches multiple agents on every pull request. Anthropic says the feature is in research preview for Team and Enterprise, with depth-first reviews rather than lightweight skims.

#claude #code-review #agents