← All models
O

o3

OpenAI

Advanced reasoning model at significantly reduced price (80% cut from launch)

Input price$0.40 / 1M tokens
Output price$1.60 / 1M tokens
Context window200k tokens
Last updated2026-04-20

Quick calculator

tokens
tokens
req/day
Per request
$0.001200
Daily
$12.00
Monthly
$360.00
per month · 30-day estimate
Yearly
$4,380.00

Tips to reduce cost

  • Use prompt caching to reuse repeated system prompts
  • Trim whitespace and reduce verbose instructions
  • Use a smaller model for classification or routing tasks
  • Batch async requests to get 50% discount (OpenAI/Anthropic)
  • Cache identical requests at the application layer

Similar models from OpenAI

Compared at your current token settings

About o3

o3 is a mid-range large language model from openai, priced at $0.4/1M input tokens and $1.6/1M output tokens. It is 84% cheaper than the market average and best suited for complex reasoning. The 200k context window handles long documents, extended conversations, and large code files comfortably.

As a reasoning model, o3 generates internal thinking tokens before responding. These are billed at the output token rate and can add 3–10x to effective output cost. For tasks requiring deep reasoning — math, complex coding, multi-step analysis — this overhead is usually justified by fewer errors and retries.

Frequently Asked Questions

How much does o3 cost per 1,000 tokens?
o3 costs $0.0004 per 1,000 input tokens and $0.0016 per 1,000 output tokens.
What is o3's context window?
o3 supports a context window of 200k tokens, which is suitable for long documents and multi-turn conversations.
How does o3 compare to GPT-4o on price?
o3 is 84% cheaper than the market average on input tokens. At $0.4/1M input vs $2.50/1M for GPT-4o, the cost difference becomes significant at scale — 10,000 requests/day with 1,000 input tokens each costs $120/month with o3 vs $750/month with GPT-4o.

Compare o3 with other models

o3 vs Mistral Medium 3o3 vs Gemini 3 Flasho3 vs Mistral Large 3o3 vs Llama 4 Mavericko3 vs Gemini 2.5 Flasho3 vs Codestral