Mistral Medium 3.5: A Single 128B Open-Weight Model That Replaces Three Separate Models

One Model, Three Jobs

Mistral AI released Mistral Medium 3.5 on April 29, 2026 under a Modified MIT license. The model consolidates three previously separate Mistral offerings: Mistral Medium 3.1 for instruction-following, Magistral for reasoning, and Devstral 2 for coding. Users now switch between modes with a single toggle.

Specifications

Parameters: 128B (dense, not MoE)
Context window: 256K tokens
License: Modified MIT
SWE-bench Verified: 77.6%
API input price: $1.50 per million tokens
Self-hostable: On 4 GPUs

Ships with Vibe

Medium 3.5 launches alongside Vibe, a cloud-based coding agent that autonomously submits pull requests to GitHub. Together, they represent Mistral's push into agentic coding workflows.

Competitive Position

Among open-weight models, 77.6% on SWE-bench Verified is top-tier. The ability to self-host on four GPUs lowers the enterprise barrier significantly for teams with data residency constraints.

Source: Mistral AI, Winbuzzer

LLM May 5, 2026 1 min read

Poolside Releases Laguna XS.2: First Open-Weight Coding Model That Runs on a Single GPU

Poolside AI released Laguna XS.2 on April 28, 2026 under Apache 2.0 — a 33B total/3B active MoE model purpose-built for agentic coding, scoring 68.2% on SWE-bench Verified and deployable on a single consumer GPU.

#open-source #coding #benchmark

LLM X/Twitter 2d ago 1 min read

GLM 5.2 hits 64% on Vibe Code Bench as open weights close in

Open-weight coding models crossed a new practical threshold. Vals AI says GLM 5.2 scored 64% on Vibe Code Bench v1.1, at least 14 percentage points ahead of the next open-weight model.

#glm-5-2 #open-weights #benchmark

LLM Reddit Mar 3, 2026 1 min read

Qwen 2.5 → 3 → 3.5: How Alibaba's Smallest Models Have Transformed Across Generations

A widely-shared r/LocalLLaMA comparison of Qwen's smallest models across three generations (score: 681) reveals extraordinary efficiency gains. The Qwen 3.5 9B now outperforms the previous-generation 80B on several benchmarks, while the 2B handles video understanding better than many 7B models.

#qwen #alibaba #open-source