Gemini 3.1 Pro

Google's flagship in 2026 with a 2M context window; tiered pricing above 200K tokens. Context caching can cut cached reads to ~$0.20-$0.40/1M.

Gemini 3.1 Pro strengths

Largest mainstream context (2M)
Strong multimodal reasoning
Native Google ecosystem integration
Competitive pricing

Pricing & context

Context window	2M tokens
Input price /1M	$2.00 (under 200K; $4.00 above)
Output price /1M	$12.00 (under 200K; $18.00 above)
Modalities	text, image, audio, video

Cost guide: a typical call of ~10K input + 2K output tokens runs roughly $2.00 (under 200K; $4.00 above) × 0.01 + $12.00 (under 200K; $18.00 above) × 0.002 — worth modelling against cheaper tiers before committing high-volume traffic.

When to choose Gemini 3.1 Pro

Gemini 3.1 Pro is best for Long-document and multimodal workloads, RAG over huge corpora, and Google Cloud-native apps. If your workload is more cost-sensitive, weigh it against Llama 4 Scout (~$0.08 (varies by host) input /1M) first.

Gemini 3.1 Pro FAQ

How much does Gemini 3.1 Pro cost?

Gemini 3.1 Pro is priced at $2.00 (under 200K; $4.00 above) per 1M input tokens and $12.00 (under 200K; $18.00 above) per 1M output tokens (public API list price), with a 2M tokens context window.

What is Gemini 3.1 Pro best for?

Gemini 3.1 Pro by Google is best for Long-document and multimodal workloads, RAG over huge corpora, and Google Cloud-native apps.

Is Gemini 3.1 Pro multimodal?

Gemini 3.1 Pro supports text, image, audio, video.

Tools that use Gemini 3.1 Pro

Google Gemini (Nano Banana / Imagen)Google Veo 3.1 Google Antigravity Google Gemini

01	GPT-5.5	OpenAI	$5.00	→
02	GPT-5.4	OpenAI	$2.50	→
03	GPT-5.4 mini	OpenAI	$0.75	→
04	Claude Opus 4.8	Anthropic	$5.00	→