Llama 4 Maverick

Open-weights natively multimodal MoE (17B active, 128 experts). Pricing varies by host (Together, Groq, Fireworks).

Llama 4 Maverick strengths

Open weights / self-hostable
Natively multimodal
Strong price-performance
Wide host availability

Pricing & context

Context window	1M tokens
Input price /1M	~$0.15 (varies by host)
Output price /1M	~$0.60 (varies by host)
Modalities	text, image

Cost guide: a typical call of ~10K input + 2K output tokens runs roughly ~$0.15 (varies by host) × 0.01 + ~$0.60 (varies by host) × 0.002 — worth modelling against cheaper tiers before committing high-volume traffic.

When to choose Llama 4 Maverick

Llama 4 Maverick is best for Teams wanting an open, multimodal model they can self-host or run cheaply via multiple providers. If your workload is more cost-sensitive, weigh it against Llama 4 Scout (~$0.08 (varies by host) input /1M) first.

Llama 4 Maverick FAQ

How much does Llama 4 Maverick cost?

Llama 4 Maverick is priced at ~$0.15 (varies by host) per 1M input tokens and ~$0.60 (varies by host) per 1M output tokens (public API list price), with a 1M tokens context window.

What is Llama 4 Maverick best for?

Llama 4 Maverick by Meta is best for Teams wanting an open, multimodal model they can self-host or run cheaply via multiple providers.

Is Llama 4 Maverick multimodal?

Llama 4 Maverick supports text, image.

Tools that use Llama 4 Maverick

Meta AI

01	GPT-5.5	OpenAI	$5.00	→
02	GPT-5.4	OpenAI	$2.50	→
03	GPT-5.4 mini	OpenAI	$0.75	→
04	Claude Opus 4.8	Anthropic	$5.00	→