Google introduced Project Spend Caps, revamped Usage Tiers, and new billing dashboards for Gemini API developers in AI Studio. The update is aimed at making cost control and scaling behavior more predictable for teams moving into paid usage.
#api
RSS FeedPerplexity said on March 11, 2026 that its new Agent API combines search, tool execution, and multi-model orchestration behind one managed runtime. The launch positions Perplexity less as a single-answer interface and more as infrastructure for production agent workflows.
OpenAI posted on March 5, 2026 that GPT-5.4 Thinking and GPT-5.4 Pro are rolling out across ChatGPT, the API, and Codex. The launch article positions GPT-5.4 as a professional-work model with 1M-token context, native computer use, stronger tool search, and better spreadsheet, document, and presentation performance.
OpenAI says GPT-5.4 Thinking and Pro are rolling out gradually across ChatGPT, the API, and Codex. The company positions GPT-5.4 as a unified frontier model for professional work with stronger coding, tool use, and 1M-token context.
Google DeepMind said Gemini 3.1 Flash-Lite is rolling out in preview through the Gemini API and Google AI Studio. The company positioned it as the most cost-efficient Gemini 3 model, with lower price, faster performance, and tunable thinking levels.
OpenAI says GPT-5.4 Thinking is shipping in ChatGPT, with GPT-5.4 also live in the API and Codex and GPT-5.4 Pro available for harder tasks. The launch packages reasoning, coding, and native computer use into a single professional-work model with up to 1M tokens of context.
OpenAI Developers has updated its GPT-5.4 API prompting guide. The new guidance focuses on tool use, structured outputs, verification loops, and long-running workflows for production-grade agents.
Google AI shared practical Gemini 3.1 Flash-Lite examples, including high-volume image sorting and business automation scenarios. The thread also points developers to preview access via Gemini API, Google AI Studio, and Vertex AI.
OpenAI announced that GPT-5.4 Thinking and GPT-5.4 Pro are rolling out in ChatGPT, while GPT-5.4 is already available in the API and Codex. The launch positions GPT-5.4 as a unified frontier model for reasoning, coding, and agentic workflows.
Google AI Developers announced that Gemini 3.1 Flash-Lite is rolling out in preview via the Gemini API and Google AI Studio. The post positions it as the fastest and most cost-efficient model in the Gemini 3 line, now adding dynamic thinking for task-adaptive reasoning.
OpenAI released GPT-5.3 Instant on March 3, 2026 as a faster, cheaper derivative of GPT-5.3 for day-to-day use. The company says it matches GPT-4.1 latency and pricing while improving instruction following and reducing hallucinations.
OpenAI announced GPT-5 on 2025-08-07 for both ChatGPT and API usage. The launch highlights include a reported 45% hallucination reduction vs GPT-4o and major benchmark gains such as HealthBench Hard 44.6.