#llm

LLM Feb 23, 2026 1 min read

Alibaba Releases Qwen3.5: Open-Weight MoE Model Claims to Beat US Rivals

Alibaba launched Qwen3.5, a 397B-parameter open-weight multimodal model supporting 201 languages. The company claims it outperforms GPT-5.2, Claude Opus 4.5, and Gemini 3 on benchmarks, while costing 60% less than its predecessor.

#alibaba #qwen #open-source

LLM Reddit Feb 23, 2026 1 min read

Gemini 3.1 Pro Built a Fully Playable Space Game Through Natural Language Alone

A user created a fully playable space exploration game using only natural language instructions to Gemini 3.1 Pro over a few hours. The AI handled performance optimization, soundtrack generation, and UI design entirely from plain language requests, producing around 1,800 lines of HTML code.

#gemini #google #code-generation

LLM X/Twitter Feb 22, 2026 1 min read

Google DeepMind Releases Gemini 3.1 Pro: 2x Reasoning Boost and Record Benchmark Scores

Google DeepMind has released Gemini 3.1 Pro with over 2x reasoning performance versus Gemini 3 Pro. The model scores 77.1% on ARC-AGI-2 (up from 31.1%), 80.6% on SWE-bench Verified, and tops 12 of 18 tracked benchmarks at unchanged $2/$12 per million token pricing.

#gemini #google-deepmind #llm

107

LLM Hacker News Feb 22, 2026 2 min read

Taalas Prints LLM Weights into Silicon: 17,000 Tokens/sec at 10x Lower Cost

Taalas has released an ASIC chip that physically etches Llama 3.1 8B model weights into silicon, achieving 17,000 tokens per second—10x faster, 10x cheaper, and 10x more power-efficient than GPU-based inference systems.

#taalas #asic #llm

104

LLM Feb 22, 2026 1 min read

ByteDance Launches Doubao 2.0 — Frontier-Level AI at One-Tenth the Cost

ByteDance released Doubao 2.0 ahead of Lunar New Year, claiming GPT-5.2 and Gemini 3 Pro parity with 98.3 on AIME 2025, a 3020 Codeforces rating, and pricing 10x cheaper than Western rivals.

#bytedance #llm #product-launch

AI X/Twitter Feb 22, 2026 1 min read

Karpathy: LLMs Are Rewriting the Rules of Software — All Code Will Be Rewritten Many Times Over

AI researcher Andrej Karpathy argues that LLMs fundamentally change software constraints, excelling at code translation. He predicts large fractions of all software ever written will be rewritten many times over as AI reshapes the programming landscape.

#karpathy #llm #software-engineering

LLM X/Twitter Feb 22, 2026 1 min read

Google DeepMind Launches Gemini 3.1 Pro with Significantly Improved Overall Intelligence

Google DeepMind announced Gemini 3.1 Pro, featuring major improvements to overall model intelligence for tackling tougher problems. Rolling out to Google AI Pro and Ultra subscribers in the Gemini app and NotebookLM, with API preview in Google AI Studio.

#google-deepmind #gemini #gemini-3.1

LLM Hacker News Feb 20, 2026 2 min read

Taalas proposes model-specific silicon for low-latency AI inference

A high-engagement Hacker News thread spotlights Taalas’ claim that model-specific silicon can cut inference latency and cost, including a hard-wired Llama 3.1 8B deployment reportedly reaching 17K tokens/sec per user.

#llm #inference #ai-hardware

LLM Feb 20, 2026 2 min read

Anthropic Commits to Keeping Claude Ad-Free, Framing AI Chats as a Trust Surface

In a February 4, 2026 post, Anthropic said Claude conversations will remain ad-free and not include unsolicited product placements. The company argues that conversational AI requires clearer trust incentives than ad-supported feed or search models.

#anthropic #claude #llm

LLM Hacker News Feb 20, 2026 2 min read

Gemini 3.1 Pro Launches as Google Targets Complex Reasoning Work

A top Hacker News discussion tracked Google’s Gemini 3.1 Pro rollout. Google positions it as a stronger reasoning baseline, highlighting a 77.1% ARC-AGI-2 score and broad preview availability across developer, enterprise, and consumer channels.

#gemini #google #llm

101

LLM Hacker News Feb 19, 2026 2 min read

HN Spotlights Step 3.5 Flash: Open-Source 196B MoE Model Aiming for Fast Agentic Reasoning

A high-signal Hacker News post highlighted StepFun's Step 3.5 Flash launch, describing a 196B-parameter MoE foundation model with about 11B active parameters, 256K context, and vendor-reported coding/agent benchmarks.

#stepfun #open-source #llm

105

LLM Hacker News Feb 18, 2026 2 min read

Claude Sonnet 4.6 launched: 1M context, same pricing, stronger real-world automation

Anthropic introduced Claude Sonnet 4.6 with a 1M token context window (beta), stronger coding/computer-use performance, and unchanged API pricing at $3/$15 per million tokens.

#anthropic #claude #sonnet

124