Google DeepMind Releases Gemini 3.1 Pro: 2x Reasoning Boost and Record Benchmark Scores

Overview

Google DeepMind released Gemini 3.1 Pro on February 19, 2026, delivering over 2x reasoning improvement compared to Gemini 3 Pro. The model achieves 77.1% on ARC-AGI-2 (up from 31.1%), 80.6% on SWE-bench Verified, and tops 12 of 18 tracked benchmarks — all at the same price point as its predecessor.

Benchmark Performance

ARC-AGI-2: 77.1% (up from 31.1%)
SWE-bench Verified: 80.6%
GPQA Diamond: 94.3%
LiveCodeBench Pro Elo: 2887
Humanity's Last Exam: 44.4%
#1 on 12 of 18 tracked benchmarks

Key Features

1M token context window: Supports text, images, audio, and video
Three thinking levels: Low, Medium, High — tune latency vs. reasoning depth
64K output capacity: Suited for complex, long-form tasks
Multimodal: Processes text, audio, images, video, and entire code repositories

Pricing and Access

Gemini 3.1 Pro maintains the same pricing as Gemini 3 Pro at $2 per million input tokens and $12 per million output tokens. Available through the Gemini API, Vertex AI, the Gemini app, and NotebookLM.

Source: Google DeepMind (@GoogleDeepMind) on X

LLM Feb 28, 2026 2 min read

Google DeepMind Launches Gemini 3.1 Pro for Complex Reasoning Workloads

Google DeepMind announced Gemini 3.1 Pro on February 19, 2026 as an upgraded core model for harder tasks. The company highlighted a verified 77.1% score on ARC-AGI-2 and broad rollout across developer, enterprise, and consumer surfaces.

#gemini #google-deepmind #llm

LLM May 22, 2026 1 min read

Google I/O 2026: Gemini 3.5 Flash Arrives with Flagship Performance at Flash Speed

Google launched Gemini 3.5 Flash at I/O 2026 on May 19, making it generally available the same day. It outperforms Gemini 3.1 Pro on coding and agentic benchmarks while running 4x faster at 40% lower cost.

#google #gemini #product-launch

LLM X/Twitter Mar 8, 2026 1 min read

Google releases Android Bench to measure LLM performance on Android development

Google AI Developers has released Android Bench, an official leaderboard for LLMs on Android development tasks. In the first results, Gemini 3.1 Pro ranks first, and Google is also publishing the benchmark, dataset, and test harness.

#google #android #benchmark