Gemini 3.5 Flash: AI price, benchmarks
Gemini 3.5 Flash by Google: 91.0 overall (preliminary), $7.12/M blended cost, 1M context. New 2026-05 release.
Gemini 3.5 Flash by Google: 91.0 overall (preliminary), $7.12/M blended cost, 1M context. New 2026-05 release.
Gemini 3.5 Flash is a 2026-05 Google model in the Benchquill record with a preliminary 91.0 overall score, $7.12/M blended cost, and 1M context window. Its full Benchquill sub-scores are pending review. Launched at Google I/O (May 2026). First Flash-tier model to beat a Pro-tier model on agentic/coding: Terminal-Bench 2.1 76.2%, MCP Atlas 83.6%, CharXiv Reasoning 84.2% — all above Gemini 3.1 Pro, at ~4x faster output.
| Rank | Model | Provider | Overall | Blended cost | Context |
|---|---|---|---|---|---|
| 6 | Gemini 3.5 Flash | 91.0 | $7.12/M | 1M |
| Rank | Model | Provider | Overall | Blended cost | Context |
|---|---|---|---|---|---|
| 1 | GPT-5.5 | OpenAI | 94.6 | $23.75/M | 1.05M |
| 2 | Claude Opus 4.8 | Anthropic | 94.0 | $20.00/M | 1M |
| 3 | Claude Opus 4.7 | Anthropic | 93.8 | $20.00/M | 1M |
| 4 | Gemini 3.1 Pro Preview | 92.4 | $9.50/M | 1M |
Gemini 3.5 Flash is a 2026-05 release with a preliminary Benchquill overall of 91.0 (pending full review). Launched at Google I/O (May 2026). First Flash-tier model to beat a Pro-tier model on agentic/coding: Terminal-Bench 2.1 76.2%, MCP Atlas 83.6%, CharXiv Reasoning 84.2% — all above Gemini 3.1 Pro, at ~4x faster output. Blended cost $7.12/M, 1M context, closed.
| Metric | Score / value |
|---|---|
| Overall (Benchquill composite) | 91.0 / 100 (preliminary) |
| Coding | Pending review |
| Reasoning | Pending review |
| Math | Pending review |
| Vision / multimodal | Pending review |
| Speed (estimated) | 340 tokens/sec |
| Input price | $1.50 / 1M tokens |
| Output price | $9.00 / 1M tokens |
| Blended price | $7.12 / 1M tokens |
| Context window | 1M |
| License | Closed |
| Modalities | Text, Vision, Audio, Video |
| Released | 2026-05 |
| Provider |
Preliminary estimate (June 2026) - full Benchquill sub-scores pending review; figures from vendor/public benchmarks. Benchmarks tracked across Benchquill: SWE-Bench Verified, HumanEval, LiveBench, BFCL v3, GPQA Diamond, MMLU, MMMU, AIME 2025, MATH-500.
Send a note to the editorial team. We reply within 24–48 hours.