Meta's legal team sent a notice to the Heretic Free Software Project for distributing Llama model derivatives. Heretic responded with sardonic compliance — invoking Galileo — while immediately setting up a Codeberg mirror in Germany and announcing preservation measures.
LLM
RSS FeedOpenAI launched a personal finance preview for ChatGPT Pro users in the US on May 15, powered by Plaid to connect over 12,000 financial institutions. Users get a live dashboard of spending, investments, and subscriptions analyzed by GPT-5.5.
Anthropic and KPMG announced a global strategic alliance on May 19, embedding Claude into KPMG's Digital Gateway platform for all 276,000 employees, with priority rollout in tax, private equity, and cybersecurity workflows.
Google launched Gemini 3.5 Flash at I/O 2026 on May 19, making it generally available the same day. It outperforms Gemini 3.1 Pro on coding and agentic benchmarks while running 4x faster at 40% lower cost.
Alibaba's Qwen team has released Qwen3.7-Max, an agent-focused frontier LLM. It ranks 5th on Artificial Analysis's Intelligence Index, nearly matching GPT 5.4, and is available as both an API and open weights.
Forge is a new open-source Python framework that applies structured guardrails to self-hosted LLMs. The best config — Ministral-3 8B Q8 — jumps from a 53% baseline to 86.5% on the 26-scenario eval suite, with 99% achievable on agentic tasks.
Google has released Gemini 3.5 Flash, optimized for agentic workflows and complex tasks. It claims 4x faster output than competing frontier models at under half the cost, with top-tier scores on Terminal-Bench, MCP Atlas, and reasoning benchmarks.
For months, Claude has been spontaneously telling users to go to sleep during active conversations, sometimes at 8:30 AM. Anthropic acknowledges the issue but hasn't identified the root cause, calling it 'a bit of a character tic.'
OpenAI and Dell Technologies announced a partnership on May 18 to bring Codex to hybrid and on-premises enterprise environments via the Dell AI Data Platform and AI Factory. The deal targets regulated industries — finance, healthcare, government — where data cannot leave private infrastructure. Codex currently serves over 4 million developers per week.
Google DeepMind unveiled Gemini 3.5 Flash at Google I/O 2026, their strongest model yet for agents and coding. It runs 4x faster than comparable frontier models while matching or exceeding their intelligence on benchmarks.
OpenAI launched Codex in the ChatGPT iOS and Android app, turning phones into a remote control for Codex sessions running on a desktop or devbox. The preview rolls out across all plans including Free, with 4 million weekly Codex users.
Google unveiled Gemini 3.5 Flash at I/O 2026, combining frontier performance with agentic capabilities. The new Managed Agents API provisions a full sandboxed agent environment with a single API call.