← All models
M
Mistral Medium 3
Mistral
State-of-the-art performance at 8x lower cost than previous generation
Input price$0.40 / 1M tokens
Output price$2.00 / 1M tokens
Context window128k tokens
Last updated2026-04-20
Quick calculator
Per request
$0.001400
Daily
$14.00
Monthly
$420.00
per month · 30-day estimate
Yearly
$5,110.00
Tips to reduce cost
- →Use prompt caching to reuse repeated system prompts
- →Trim whitespace and reduce verbose instructions
- →Use a smaller model for classification or routing tasks
- →Batch async requests to get 50% discount (OpenAI/Anthropic)
- →Cache identical requests at the application layer
Similar models from Mistral
Compared at your current token settings
About Mistral Medium 3
Mistral Medium 3 is a mid-range large language model from mistral, priced at $0.4/1M input tokens and $2/1M output tokens. It is 84% cheaper than the market average and best suited for budget frontier tasks. The 128k context window handles long documents, extended conversations, and large code files comfortably.
For most production workloads, the cost breakdown is dominated by input tokens (system prompts, context, retrieved documents) rather than output. At this price point, Mistral Medium 3 is a solid choice when balancing quality and cost at scale.
Frequently Asked Questions
How much does Mistral Medium 3 cost per 1,000 tokens?
Mistral Medium 3 costs $0.0004 per 1,000 input tokens and $0.0020 per 1,000 output tokens.
What is Mistral Medium 3's context window?
Mistral Medium 3 supports a context window of 128k tokens, which is suitable for long documents and multi-turn conversations.
How does Mistral Medium 3 compare to GPT-4o on price?
Mistral Medium 3 is 84% cheaper than the market average on input tokens. At $0.4/1M input vs $2.50/1M for GPT-4o, the cost difference becomes significant at scale — 10,000 requests/day with 1,000 input tokens each costs $120/month with Mistral Medium 3 vs $750/month with GPT-4o.