Liquid AI Releases LFM2.5: 8B MoE Model Trained on 38T Tokens
Original: Liquid AI reveals 8B-A1B MoE trained on 38T View original →
A New Edge AI Benchmark
Liquid AI has released LFM2.5 8B-A1B, a full-scale upgrade to its October 2025 predecessor. The model is a Mixture-of-Experts architecture optimized for on-device AI across edge hardware. Training data scaled from 12T to 38T tokens—more than tripling the previous version.
Key Technical Improvements
The context window expands 4x from 32K to 128K tokens, and vocabulary doubles from 65K to 128K. Multilingual tokenization saw major gains: Hindi +120.4%, Thai +238.2%, Vietnamese +117.9%. The model introduces targeted probability redistribution to address reasoning loop failures, plus knowledge-boundary optimization for hallucination mitigation.
Benchmark Performance
The AA-Omniscience Index improved 53 points from the prior version. Key scores: IFEval 91.84, MATH500 88.76, AIME25 42.53. The model outperforms similarly-sized dense alternatives and competes with Gemma-4-26B despite being three times smaller.
Inference Speed
On CPU hardware, the M5 Max delivers 253 tokens per second, with approximately 30 tokens per second on mobile devices at under 6GB memory. On GPU, a single NVIDIA H100 achieves 18,500 output tokens per second at high concurrency—translating to over 1.6 billion tokens processed daily.
Deployment and Ecosystem
The model ships with support for llama.cpp, MLX, vLLM, SGLang, and ONNX, covering Apple, AMD, Intel, Qualcomm, and Nvidia hardware. The LocalCowork demo showcases 67 tools across 13 servers executing entirely on-device with sub-second dispatch latency—no cloud dependency required.
Related Articles
A new DELEGATE-52 benchmark study finds that even frontier LLMs like Gemini 3.1 Pro, Claude 4.6 Opus, and GPT 5.4 corrupt an average of 25% of document content during long delegated workflows, with errors compounding silently.
DeepSeek V4 Pro tied with GPT-5.2 on FoodTruck Bench, a 30-day agentic benchmark using 34 tools, arriving roughly 10 weeks after GPT-5.2 was tested at approximately 17x lower cost.
OpenAI replaced GPT-5.3 Instant with GPT-5.5 Instant as ChatGPT's default model on May 5. The update cuts hallucinations by 52.5% in high-stakes domains and trims response length by 30%.
Comments (0)
No comments yet. Be the first to comment!