Google DeepMind Releases Gemini 3.1 Pro: 2x Reasoning Boost and Record Benchmark Scores

Overview

Google DeepMind released Gemini 3.1 Pro on February 19, 2026, delivering over 2x reasoning improvement compared to Gemini 3 Pro. The model achieves 77.1% on ARC-AGI-2 (up from 31.1%), 80.6% on SWE-bench Verified, and tops 12 of 18 tracked benchmarks — all at the same price point as its predecessor.

Benchmark Performance

ARC-AGI-2: 77.1% (up from 31.1%)
SWE-bench Verified: 80.6%
GPQA Diamond: 94.3%
LiveCodeBench Pro Elo: 2887
Humanity's Last Exam: 44.4%
#1 on 12 of 18 tracked benchmarks

Key Features

1M token context window: Supports text, images, audio, and video
Three thinking levels: Low, Medium, High — tune latency vs. reasoning depth
64K output capacity: Suited for complex, long-form tasks
Multimodal: Processes text, audio, images, video, and entire code repositories

Pricing and Access

Gemini 3.1 Pro maintains the same pricing as Gemini 3 Pro at $2 per million input tokens and $12 per million output tokens. Available through the Gemini API, Vertex AI, the Gemini app, and NotebookLM.

Source: Google DeepMind (@GoogleDeepMind) on X

LLM Feb 28, 2026 2 min read

Google DeepMind Launches Gemini 3.1 Pro for Complex Reasoning Workloads

Google DeepMind announced Gemini 3.1 Pro on February 19, 2026 as an upgraded core model for harder tasks. The company highlighted a verified 77.1% score on ARC-AGI-2 and broad rollout across developer, enterprise, and consumer surfaces.

#gemini #google-deepmind #llm

113

LLM X/Twitter 4d ago 1 min read

Gemini Flash splits into three models for cheaper agent workloads

Google is steering Gemini toward cost-controlled production agents rather than a single flagship race. The new 3.6 Flash cuts output token use by 17% versus 3.5 Flash, while 3.5 Flash-Lite reaches 350 output tokens per second.

#google-deepmind #gemini #ai-agents

LLM X/Twitter Mar 8, 2026 1 min read

Google releases Android Bench to measure LLM performance on Android development

Google AI Developers has released Android Bench, an official leaderboard for LLMs on Android development tasks. In the first results, Gemini 3.1 Pro ranks first, and Google is also publishing the benchmark, dataset, and test harness.

#google #android #benchmark

103