Poolside Releases Laguna XS.2: First Open-Weight Coding Model That Runs on a Single GPU
Poolside's First Open-Weight Release
On April 28, 2026, Poolside AI released Laguna XS.2 and Laguna M.1 simultaneously. XS.2 is the company's first model with open weights, released under Apache 2.0.
Specifications
- Architecture: 33B total / 3B active MoE
- Training: 30T tokens, fully in-house infrastructure
- SWE-bench Verified: 68.2%
- SWE-bench Pro: 44.5%
- SWE-bench Multilingual: 62.4%
- Terminal-Bench 2.0: 30.1%
- Hardware: Single GPU with 36GB RAM (runs on Apple M-series via Ollama)
- License: Apache 2.0
Architecture
XS.2 uses sigmoid gating with per-layer rotary scales and a mixed Sliding Window Attention / global attention layout in a 3:1 ratio across 40 layers. The model was trained entirely on Poolside's own in-house stack.
How to Access
Weights are on Hugging Face. Hosted inference is available via Poolside's API and OpenRouter. Local deployment works via Ollama on Mac hardware.
Source: Poolside AI, VentureBeat
Related Articles
Open-weight coding models crossed a new practical threshold. Vals AI says GLM 5.2 scored 64% on Vibe Code Bench v1.1, at least 14 percentage points ahead of the next open-weight model.
Released April 29, 2026 under Modified MIT license, Mistral Medium 3.5 consolidates the company's chat, reasoning, and coding models into one 128B dense open-weight model with 256K context, scoring 77.6% on SWE-bench Verified.
A widely-shared r/LocalLLaMA comparison of Qwen's smallest models across three generations (score: 681) reveals extraordinary efficiency gains. The Qwen 3.5 9B now outperforms the previous-generation 80B on several benchmarks, while the 2B handles video understanding better than many 7B models.