← All models
L

Llama 3.1 405B

Meta

Meta's largest open-source model — frontier-class intelligence available via third-party API providers

Input price$3.50 / 1M tokens
Output price$3.50 / 1M tokens
Context window128k tokens
Last updated2026-04-22

Quick calculator

tokens
tokens
req/day
Per request
$0.005250
Daily
$52.50
Monthly
$1,575.00
per month · 30-day estimate
Yearly
$19,162.50

Tips to reduce cost

  • Use prompt caching to reuse repeated system prompts
  • Trim whitespace and reduce verbose instructions
  • Use a smaller model for classification or routing tasks
  • Batch async requests to get 50% discount (OpenAI/Anthropic)
  • Cache identical requests at the application layer

Similar models from Meta

Compared at your current token settings

About Llama 3.1 405B

Llama 3.1 405B is a premium large language model from meta, priced at $3.5/1M input tokens and $3.5/1M output tokens. It is priced above the market average and best suited for max open-source performance. The 128k context window handles long documents, extended conversations, and large code files comfortably.

For most production workloads, the cost breakdown is dominated by input tokens (system prompts, context, retrieved documents) rather than output. At this price point, Llama 3.1 405B is positioned for use cases where quality justifies the premium over cheaper alternatives.

Frequently Asked Questions

How much does Llama 3.1 405B cost per 1,000 tokens?
Llama 3.1 405B costs $0.0035 per 1,000 input tokens and $0.0035 per 1,000 output tokens.
What is Llama 3.1 405B's context window?
Llama 3.1 405B supports a context window of 128k tokens, which is suitable for long documents and multi-turn conversations.
How does Llama 3.1 405B compare to GPT-4o on price?
Llama 3.1 405B is priced above the market average on input tokens. At $3.5/1M input vs $2.50/1M for GPT-4o, the cost difference becomes significant at scale — 10,000 requests/day with 1,000 input tokens each costs $1050/month with Llama 3.1 405B vs $750/month with GPT-4o.

Compare Llama 3.1 405B with other models

Llama 3.1 405B vs Gemini 3 ProLlama 3.1 405B vs Claude Sonnet 4.6Llama 3.1 405B vs Claude 3.5 SonnetLlama 3.1 405B vs Grok 3Llama 3.1 405B vs GPT-4oLlama 3.1 405B vs GPT-4.1