Z.ai GLM-5.2
Released June 16, 2026. 753B-parameter open-weights MoE tuned for long-horizon agentic coding; ~$0.26/1M cached input. The open-weights price/performance leader of mid-2026.
Z.ai GLM-5.2 strengths
- Open weights
- Long-horizon agentic coding
- Strong price/performance
- 1M context
- Cheap cached input
Pricing & context
| Context window | 1M tokens |
| Input price /1M | $0.95 |
| Output price /1M | $3.00 |
| Modalities | text |
Cost guide: a typical call of ~10K input + 2K output tokens runs roughly $0.95 × 0.01 + $3.00 × 0.002 — worth modelling against cheaper tiers before committing high-volume traffic.
When to choose Z.ai GLM-5.2
Z.ai GLM-5.2 is best for Cost-efficient agentic coding and self-hosted deployments where open weights matter. If your workload is more cost-sensitive, weigh it against Llama 4 Scout (~$0.08 (varies by host) input /1M) first.
Z.ai GLM-5.2 FAQ
How much does Z.ai GLM-5.2 cost?
Z.ai GLM-5.2 is priced at $0.95 per 1M input tokens and $3.00 per 1M output tokens (public API list price), with a 1M tokens context window.
What is Z.ai GLM-5.2 best for?
Z.ai GLM-5.2 by Z.ai (Zhipu AI) is best for Cost-efficient agentic coding and self-hosted deployments where open weights matter.
Is Z.ai GLM-5.2 multimodal?
Z.ai GLM-5.2 supports text.
Other models
All models →| 01 | GPT-5.5 | OpenAI | $5.00 | → |
| 02 | GPT-5.4 | OpenAI | $2.50 | → |
| 03 | GPT-5.4 mini | OpenAI | $0.75 | → |
| 04 | Claude Opus 4.8 | Anthropic | $5.00 | → |