MiniMax M3
MiniMax's open-weight flagship (June 1, 2026): a ~428B-parameter MoE (23B active) using MiniMax Sparse Attention for efficient 1M-token context. Strong coding, agentic and computer-use performance at roughly 5–10% of closed-frontier cost; a ~50% launch promo applied at release. Benchmarks are vendor-reported — verify.
MiniMax M3 strengths
- Open weights (self-hostable)
- 1M context via sparse attention
- Frontier coding + agentic / computer-use
- Native multimodal (incl. video input)
- Very low cost vs closed frontier models
Pricing & context
| Context window | 1M tokens (min 512K) |
| Input price /1M | $0.60 |
| Output price /1M | $2.40 |
| Modalities | text, image, video |
Cost guide: a typical call of ~10K input + 2K output tokens runs roughly $0.60 × 0.01 + $2.40 × 0.002 — worth modelling against cheaper tiers before committing high-volume traffic.
When to choose MiniMax M3
MiniMax M3 is best for Long-context coding agents, computer-use automation and high-volume tasks wanting frontier quality with open weights. If your workload is more cost-sensitive, weigh it against Llama 4 Scout (~$0.08 (varies by host) input /1M) first.
MiniMax M3 FAQ
How much does MiniMax M3 cost?
MiniMax M3 is priced at $0.60 per 1M input tokens and $2.40 per 1M output tokens (public API list price), with a 1M tokens (min 512K) context window.
What is MiniMax M3 best for?
MiniMax M3 by MiniMax is best for Long-context coding agents, computer-use automation and high-volume tasks wanting frontier quality with open weights.
Is MiniMax M3 multimodal?
MiniMax M3 supports text, image, video.
Other models
All models →| 01 | GPT-5.5 | OpenAI | $5.00 | → |
| 02 | GPT-5.4 | OpenAI | $2.50 | → |
| 03 | GPT-5.4 mini | OpenAI | $0.75 | → |
| 04 | Claude Opus 4.8 | Anthropic | $5.00 | → |