← All models
O
GPT-3.5 Turbo
OpenAI
Classic fast model — still cost-effective for simple chat tasks and legacy integrations
Input price$0.50 / 1M tokens
Output price$1.50 / 1M tokens
Context window16k tokens
Last updated2026-04-22
Quick calculator
Per request
$0.001250
Daily
$12.50
Monthly
$375.00
per month · 30-day estimate
Yearly
$4,562.50
Tips to reduce cost
- →Use prompt caching to reuse repeated system prompts
- →Trim whitespace and reduce verbose instructions
- →Use a smaller model for classification or routing tasks
- →Batch async requests to get 50% discount (OpenAI/Anthropic)
- →Cache identical requests at the application layer
Similar models from OpenAI
Compared at your current token settings
About GPT-3.5 Turbo
GPT-3.5 Turbo is a mid-range large language model from openai, priced at $0.5/1M input tokens and $1.5/1M output tokens. It is 81% cheaper than the market average and best suited for legacy chat applications. The 16k context window covers most standard production workloads.
For most production workloads, the cost breakdown is dominated by input tokens (system prompts, context, retrieved documents) rather than output. At this price point, GPT-3.5 Turbo is a solid choice when balancing quality and cost at scale.
Frequently Asked Questions
How much does GPT-3.5 Turbo cost per 1,000 tokens?
GPT-3.5 Turbo costs $0.0005 per 1,000 input tokens and $0.0015 per 1,000 output tokens.
What is GPT-3.5 Turbo's context window?
GPT-3.5 Turbo supports a context window of 16k tokens, which is suitable for standard use cases and moderate-length conversations.
How does GPT-3.5 Turbo compare to GPT-4o on price?
GPT-3.5 Turbo is 81% cheaper than the market average on input tokens. At $0.5/1M input vs $2.50/1M for GPT-4o, the cost difference becomes significant at scale — 10,000 requests/day with 1,000 input tokens each costs $150/month with GPT-3.5 Turbo vs $750/month with GPT-4o.