← All models
L
Llama 4 Maverick
Meta
Meta's powerful Llama 4 model balancing performance and cost with 1M context
Input price$0.50 / 1M tokens
Output price$1.10 / 1M tokens
Context window1M tokens
Last updated2026-04-22
Quick calculator
Per request
$0.001050
Daily
$10.50
Monthly
$315.00
per month · 30-day estimate
Yearly
$3,832.50
Tips to reduce cost
- →Use prompt caching to reuse repeated system prompts
- →Trim whitespace and reduce verbose instructions
- →Use a smaller model for classification or routing tasks
- →Batch async requests to get 50% discount (OpenAI/Anthropic)
- →Cache identical requests at the application layer
Similar models from Meta
Compared at your current token settings
About Llama 4 Maverick
Llama 4 Maverick is a mid-range large language model from meta, priced at $0.5/1M input tokens and $1.1/1M output tokens. It is 81% cheaper than the market average and best suited for balanced open-source tasks. The 1M context window makes it suitable for very long documents, large codebases, and book-length inputs.
For most production workloads, the cost breakdown is dominated by input tokens (system prompts, context, retrieved documents) rather than output. At this price point, Llama 4 Maverick is a solid choice when balancing quality and cost at scale.
Frequently Asked Questions
How much does Llama 4 Maverick cost per 1,000 tokens?
Llama 4 Maverick costs $0.0005 per 1,000 input tokens and $0.0011 per 1,000 output tokens.
What is Llama 4 Maverick's context window?
Llama 4 Maverick supports a context window of 1M tokens, which is suitable for very long documents, large codebases, and extended multi-turn conversations.
How does Llama 4 Maverick compare to GPT-4o on price?
Llama 4 Maverick is 81% cheaper than the market average on input tokens. At $0.5/1M input vs $2.50/1M for GPT-4o, the cost difference becomes significant at scale — 10,000 requests/day with 1,000 input tokens each costs $150/month with Llama 4 Maverick vs $750/month with GPT-4o.