Skip to content

Open-Weight LLM Race (May 2026): Mistral, Poolside, Gemma, and Grok

4 articles Updated May 8, 2026 #benchmark#open-source#product-launch#api

Current state

A burst of open-weight model releases in the first week of May 2026: Mistral Medium 3.5 (128B unifying chat, reasoning, and coding), Poolside's Laguna XS.2 (68.2% SWE-bench on a single GPU), Gemma 4 Multi-Token Prediction drafters for faster inference, and xAI's Grok 4.3 topping agentic tool-calling benchmarks.

What changed recently

  • xAI Launches Grok 4.3 on API: Tops Agentic Tool Calling Benchmarks
  • Google Releases Multi-Token Prediction Drafters for Gemma 4: Up to 3x Speedup
  • Poolside Releases Laguna XS.2: First Open-Weight Coding Model That Runs on a Single GPU

Key tensions

Optimistic case: Open-Weight LLM Race (May 2026): Mistral, Poolside, Gemma, and Grok unlocks real, compounding leverage.
Skeptical case: reliability, cost, and control around Open-Weight LLM Race (May 2026): Mistral, Poolside, Gemma, and Grok remain unresolved.

Signals to watch

  • Momentum and new coverage around “benchmark”
  • Momentum and new coverage around “open-source”
  • Momentum and new coverage around “product-launch”

Timeline

Latest
Recent development
Recent development
Recent development
Share: Long