Google Releases Gemini 3.1 Pro: 77.1% ARC-AGI-2, Doubled Reasoning Performance

Google DeepMind Launches Gemini 3.1 Pro

Google DeepMind officially launched Gemini 3.1 Pro on February 19, 2026, marking a significant advancement in multimodal reasoning and agentic AI capabilities. The model is the latest in the Gemini 3 series.

Benchmark Results

The model achieves 77.1% on ARC-AGI-2—more than doubling its predecessor Gemini 3 Pro's score of 31.1%. On SWE-Bench Verified, it scores 80.6%, and on GPQA Diamond (graduate-level science reasoning), it reaches 94.3%. Terminal-Bench 2.0 performance stands at 68.5%, and Humanity's Last Exam (with tools) scores 51.4%.

Technical Specifications

The model supports a 1 million token input context window, enabling processing of over 1,500 pages of text or entire code repositories in a single prompt. Maximum output is 65,536 tokens. Gemini 3.1 Pro natively handles text, images, audio, video, and code in unified multimodal workflows.

Key Improvements

A new granular thinking mode lets users select reasoning depth from minimal to high, balancing speed and accuracy. Hallucination rates dropped significantly from 88% to 50% on the AA-Omniscience benchmark. Agentic reliability has been improved for autonomous multi-step workflows.

Pricing and Access

Pricing remains unchanged: $2/1M tokens (input up to 200K), $4/1M (input beyond 200K), $12/1M (output up to 200K), $18/1M (output beyond 200K). Available through Google AI Studio, Vertex AI, Gemini API, NotebookLM, and Microsoft Foundry. Deep Think mode is available to Google AI Ultra subscribers.

Source: Google DeepMind — Gemini 3.1 Pro Model Card

LLM Feb 23, 2026 1 min read

Google Releases Gemini 3.1 Pro with 77.1% on ARC-AGI-2

Google's Gemini 3.1 Pro achieves 77.1% on ARC-AGI-2—more than doubling the previous Gemini 3 Pro's score. The mid-cycle upgrade brings Deep Think-level reasoning capabilities to all users and developers.

#google #gemini #benchmark

100

LLM May 23, 2026 2 min read

Google I/O 2026: Gemini Spark personal AI agent launches with Gemini 3.5 Flash

At Google I/O 2026 on May 19, Google unveiled Gemini 3.5 Flash—which outperforms Gemini 3.1 Pro across all benchmarks at 4× the speed and half the API cost—alongside Gemini Spark, a 24/7 personal AI agent that works in the background and can be reached directly via Gmail. Spark enters beta for Google AI Ultra subscribers in the US starting the week of May 26.

#google #gemini #product-launch

LLM May 22, 2026 1 min read

Google I/O 2026: Gemini 3.5 Flash Arrives with Flagship Performance at Flash Speed

Google launched Gemini 3.5 Flash at I/O 2026 on May 19, making it generally available the same day. It outperforms Gemini 3.1 Pro on coding and agentic benchmarks while running 4x faster at 40% lower cost.