← All models
L
Llama 3.1 8B
Meta
Meta's efficient open-source model. Cheapest option for high-volume tasks via third-party API providers.
Input price$0.02 / 1M tokens
Output price$0.05 / 1M tokens
Context window128k tokens
Last updated2026-04-21
Quick calculator
Per request
$0.000045
Daily
$0.4500
Monthly
$13.50
per month · 30-day estimate
Yearly
$164.25
Tips to reduce cost
- →Use prompt caching to reuse repeated system prompts
- →Trim whitespace and reduce verbose instructions
- →Use a smaller model for classification or routing tasks
- →Batch async requests to get 50% discount (OpenAI/Anthropic)
- →Cache identical requests at the application layer
Similar models from Meta
Compared at your current token settings