Gemini 3.1 Pro

Googletextimageaudiovideo

Google's flagship in 2026 with a 2M context window; tiered pricing above 200K tokens. Context caching can cut cached reads to ~$0.20-$0.40/1M.

Gemini 3.1 Pro strengths

  • Largest mainstream context (2M)
  • Strong multimodal reasoning
  • Native Google ecosystem integration
  • Competitive pricing

Pricing & context

Context window2M tokens
Input price /1M$2.00 (under 200K; $4.00 above)
Output price /1M$12.00 (under 200K; $18.00 above)
Modalitiestext, image, audio, video

Cost guide: a typical call of ~10K input + 2K output tokens runs roughly $2.00 (under 200K; $4.00 above) × 0.01 + $12.00 (under 200K; $18.00 above) × 0.002 — worth modelling against cheaper tiers before committing high-volume traffic.

When to choose Gemini 3.1 Pro

Gemini 3.1 Pro is best for Long-document and multimodal workloads, RAG over huge corpora, and Google Cloud-native apps. If your workload is more cost-sensitive, weigh it against Llama 4 Scout (~$0.08 (varies by host) input /1M) first.

Gemini 3.1 Pro FAQ

How much does Gemini 3.1 Pro cost?

Gemini 3.1 Pro is priced at $2.00 (under 200K; $4.00 above) per 1M input tokens and $12.00 (under 200K; $18.00 above) per 1M output tokens (public API list price), with a 2M tokens context window.

What is Gemini 3.1 Pro best for?

Gemini 3.1 Pro by Google is best for Long-document and multimodal workloads, RAG over huge corpora, and Google Cloud-native apps.

Is Gemini 3.1 Pro multimodal?

Gemini 3.1 Pro supports text, image, audio, video.

Tools that use Gemini 3.1 Pro

Other models

All models →
01GPT-5.5OpenAI$5.00
02GPT-5.4OpenAI$2.50
03GPT-5.4 miniOpenAI$0.75
04Claude Opus 4.8Anthropic$5.00