Gemini 3.1 Pro
Google's flagship in 2026 with a 2M context window; tiered pricing above 200K tokens. Context caching can cut cached reads to ~$0.20-$0.40/1M.
Gemini 3.1 Pro strengths
- Largest mainstream context (2M)
- Strong multimodal reasoning
- Native Google ecosystem integration
- Competitive pricing
Pricing & context
| Context window | 2M tokens |
| Input price /1M | $2.00 (under 200K; $4.00 above) |
| Output price /1M | $12.00 (under 200K; $18.00 above) |
| Modalities | text, image, audio, video |
Cost guide: a typical call of ~10K input + 2K output tokens runs roughly $2.00 (under 200K; $4.00 above) × 0.01 + $12.00 (under 200K; $18.00 above) × 0.002 — worth modelling against cheaper tiers before committing high-volume traffic.
When to choose Gemini 3.1 Pro
Gemini 3.1 Pro is best for Long-document and multimodal workloads, RAG over huge corpora, and Google Cloud-native apps. If your workload is more cost-sensitive, weigh it against Llama 4 Scout (~$0.08 (varies by host) input /1M) first.
Gemini 3.1 Pro FAQ
How much does Gemini 3.1 Pro cost?
Gemini 3.1 Pro is priced at $2.00 (under 200K; $4.00 above) per 1M input tokens and $12.00 (under 200K; $18.00 above) per 1M output tokens (public API list price), with a 2M tokens context window.
What is Gemini 3.1 Pro best for?
Gemini 3.1 Pro by Google is best for Long-document and multimodal workloads, RAG over huge corpora, and Google Cloud-native apps.
Is Gemini 3.1 Pro multimodal?
Gemini 3.1 Pro supports text, image, audio, video.
Tools that use Gemini 3.1 Pro
Other models
All models →| 01 | GPT-5.5 | OpenAI | $5.00 | → |
| 02 | GPT-5.4 | OpenAI | $2.50 | → |
| 03 | GPT-5.4 mini | OpenAI | $0.75 | → |
| 04 | Claude Opus 4.8 | Anthropic | $5.00 | → |