Qwen3.7 Max

Alibaba's flagship (API-only on DashScope, no open weights). 90% cache discount on input; frequent promo pricing.

Qwen3.7 Max strengths

1M context
Strong multilingual and coding
Deep cache discount
Competitive frontier quality

Pricing & context

Context window	1M tokens (64K max output)
Input price /1M	$2.50 ($0.25 cached)
Output price /1M	$7.50
Modalities	text, image

Cost guide: a typical call of ~10K input + 2K output tokens runs roughly $2.50 ($0.25 cached) × 0.01 + $7.50 × 0.002 — worth modelling against cheaper tiers before committing high-volume traffic.

When to choose Qwen3.7 Max

Qwen3.7 Max is best for Long-context agentic workloads, multilingual apps, and Asia-Pacific deployments. If your workload is more cost-sensitive, weigh it against Llama 4 Scout (~$0.08 (varies by host) input /1M) first.

Qwen3.7 Max FAQ

How much does Qwen3.7 Max cost?

Qwen3.7 Max is priced at $2.50 ($0.25 cached) per 1M input tokens and $7.50 per 1M output tokens (public API list price), with a 1M tokens (64K max output) context window.

What is Qwen3.7 Max best for?

Qwen3.7 Max by Alibaba is best for Long-context agentic workloads, multilingual apps, and Asia-Pacific deployments.

Is Qwen3.7 Max multimodal?

Qwen3.7 Max supports text, image.

01	GPT-5.5	OpenAI	$5.00	→
02	GPT-5.4	OpenAI	$2.50	→
03	GPT-5.4 mini	OpenAI	$0.75	→
04	Claude Opus 4.8	Anthropic	$5.00	→