Qwen3.7 Max
Alibaba's flagship (API-only on DashScope, no open weights). 90% cache discount on input; frequent promo pricing.
Qwen3.7 Max strengths
- 1M context
- Strong multilingual and coding
- Deep cache discount
- Competitive frontier quality
Pricing & context
| Context window | 1M tokens (64K max output) |
| Input price /1M | $2.50 ($0.25 cached) |
| Output price /1M | $7.50 |
| Modalities | text, image |
Cost guide: a typical call of ~10K input + 2K output tokens runs roughly $2.50 ($0.25 cached) × 0.01 + $7.50 × 0.002 — worth modelling against cheaper tiers before committing high-volume traffic.
When to choose Qwen3.7 Max
Qwen3.7 Max is best for Long-context agentic workloads, multilingual apps, and Asia-Pacific deployments. If your workload is more cost-sensitive, weigh it against Llama 4 Scout (~$0.08 (varies by host) input /1M) first.
Qwen3.7 Max FAQ
How much does Qwen3.7 Max cost?
Qwen3.7 Max is priced at $2.50 ($0.25 cached) per 1M input tokens and $7.50 per 1M output tokens (public API list price), with a 1M tokens (64K max output) context window.
What is Qwen3.7 Max best for?
Qwen3.7 Max by Alibaba is best for Long-context agentic workloads, multilingual apps, and Asia-Pacific deployments.
Is Qwen3.7 Max multimodal?
Qwen3.7 Max supports text, image.
Other models
All models →| 01 | GPT-5.5 | OpenAI | $5.00 | → |
| 02 | GPT-5.4 | OpenAI | $2.50 | → |
| 03 | GPT-5.4 mini | OpenAI | $0.75 | → |
| 04 | Claude Opus 4.8 | Anthropic | $5.00 | → |