Llama 4 Maverick
Open-weights natively multimodal MoE (17B active, 128 experts). Pricing varies by host (Together, Groq, Fireworks).
Llama 4 Maverick strengths
- Open weights / self-hostable
- Natively multimodal
- Strong price-performance
- Wide host availability
Pricing & context
| Context window | 1M tokens |
| Input price /1M | ~$0.15 (varies by host) |
| Output price /1M | ~$0.60 (varies by host) |
| Modalities | text, image |
Cost guide: a typical call of ~10K input + 2K output tokens runs roughly ~$0.15 (varies by host) × 0.01 + ~$0.60 (varies by host) × 0.002 — worth modelling against cheaper tiers before committing high-volume traffic.
When to choose Llama 4 Maverick
Llama 4 Maverick is best for Teams wanting an open, multimodal model they can self-host or run cheaply via multiple providers. If your workload is more cost-sensitive, weigh it against Llama 4 Scout (~$0.08 (varies by host) input /1M) first.
Llama 4 Maverick FAQ
How much does Llama 4 Maverick cost?
Llama 4 Maverick is priced at ~$0.15 (varies by host) per 1M input tokens and ~$0.60 (varies by host) per 1M output tokens (public API list price), with a 1M tokens context window.
What is Llama 4 Maverick best for?
Llama 4 Maverick by Meta is best for Teams wanting an open, multimodal model they can self-host or run cheaply via multiple providers.
Is Llama 4 Maverick multimodal?
Llama 4 Maverick supports text, image.
Tools that use Llama 4 Maverick
Other models
All models →| 01 | GPT-5.5 | OpenAI | $5.00 | → |
| 02 | GPT-5.4 | OpenAI | $2.50 | → |
| 03 | GPT-5.4 mini | OpenAI | $0.75 | → |
| 04 | Claude Opus 4.8 | Anthropic | $5.00 | → |