All Models

Browse 51 models across 8 providers.

O

OpenAI

12 models
OGPT-5

OpenAI's next-generation flagship — significant capability jump over GPT-4.1 with 1M context

$8 in$32 out1M
OGPT-5 Mini

Affordable GPT-5 intelligence — brings GPT-5 capability to cost-sensitive workloads

$0.6 in$2.4 out512k
OGPT-4.1

Latest flagship with 1M context window and strong coding/instruction following

$2 in$8 out1M
OGPT-4.1 Mini

Affordable intelligence with 1M context — best cost/performance in the 4.1 family

$0.4 in$1.6 out1M
OGPT-4.1 Nano

Smallest and cheapest GPT-4.1 model — ideal for simple tasks needing 1M context

$0.1 in$0.4 out1M
Oo4-mini

Fast, efficient reasoning model optimized for STEM and coding tasks

$1.1 in$4.4 out200k
Oo3

Advanced reasoning model at significantly reduced price (80% cut from launch)

$0.4 in$1.6 out200k
Oo1

OpenAI's original frontier reasoning model — deep thinking for the hardest problems

$15 in$60 out200k
OGPT-4o

Multimodal model with strong vision, audio, and text capabilities

$2.5 in$10 out128k
OGPT-4o Mini

Ultra-affordable model for high-volume tasks with good quality

$0.15 in$0.6 out128k
OGPT-4 Turbo

Previous generation GPT-4 Turbo — powerful but superseded by GPT-4o in cost-efficiency

$10 in$30 out128k
OGPT-3.5 Turbo

Classic fast model — still cost-effective for simple chat tasks and legacy integrations

$0.5 in$1.5 out16k
A

Anthropic

8 models
AClaude Opus 4.7

Most capable Claude model — step-change improvement in agentic coding

$5 in$25 out1M
AClaude Sonnet 4.6

Optimal balance of intelligence, cost, and speed with 1M context

$3 in$15 out1M
AClaude Opus 4.6

Previous flagship — strong reasoning and extended thinking support

$5 in$25 out1M
AClaude Opus 4.5

Most capable Claude 4 model with extended thinking — top performance on complex reasoning and coding

$15 in$75 out200k
AClaude Haiku 4.5

Fastest and most cost-efficient Claude with near-frontier intelligence

$1 in$5 out200k
AClaude 3.5 Sonnet

Previous generation Sonnet — high intelligence at moderate cost, widely used in production

$3 in$15 out200k
AClaude 3.5 Haiku

Previous generation fast model — great balance of speed and intelligence at low cost

$0.8 in$4 out200k
AClaude 3 Opus

Third-generation flagship — powerful reasoning, still used for demanding legacy workloads

$15 in$75 out200k
G

Google

10 models
GGemini 3 Ultra

Google's most powerful model — frontier reasoning, native multimodal, 2M context window

$10 in$30 out2M
GGemini 3 Pro

Gemini 3's balanced model — strong reasoning at a fraction of Ultra cost

$3.5 in$14 out1M
GGemini 3 Flash

Fast and capable Gemini 3 — ideal for real-time applications needing 1M context

$0.5 in$2 out1M
GGemini 3 Flash-Lite

Most affordable Gemini 3 model — high-volume tasks with 1M context at near-zero cost

$0.12 in$0.48 out1M
GGemini 2.5 Pro

Most capable Gemini model with deep reasoning and multimodal support

$1.25 in$10 out1M
GGemini 2.5 Flash

Best-in-class speed and efficiency for diverse tasks

$0.3 in$2.5 out1M
GGemini 2.5 Flash-Lite

Most cost-efficient Gemini model for high-volume, latency-sensitive workloads

$0.1 in$0.4 out1M
GGemini 2.0 Flash

Previous gen workhorse — fast multimodal model with excellent price-to-performance

$0.1 in$0.4 out1M
GGemini 2.0 Flash-Lite

Ultra-cheap previous gen model — suitable for high-volume simple generation tasks

$0.075 in$0.3 out1M
GGemini 1.5 Pro

First model with 2M token context window — great for massive document analysis

$1.25 in$5 out2M
M

Mistral

6 models
MMagistral Medium

Mistral's reasoning model — strong for complex analytical and math tasks

$2 in$5 out128k
MMistral Large 3

Frontier-level MoE model (675B total / 41B active params) at competitive price

$0.5 in$1.5 out256k
MMistral Medium 3

State-of-the-art performance at 8x lower cost than previous generation

$0.4 in$2 out128k
MMistral Small 3.1

Efficient small model for simple, high-volume tasks

$0.1 in$0.3 out128k
MCodestral

Code-specialized model with 256k context — optimized for fill-in-the-middle

$0.3 in$0.9 out256k
MMistral Nemo

Compact multilingual model with 128k context — great budget option for EU-compliance workloads

$0.1 in$0.3 out128k
D

DeepSeek

3 models
DDeepSeek R2

DeepSeek's second-generation reasoning model — stronger than R1 across all benchmarks at similar cost

$0.8 in$3.2 out128k
DDeepSeek R1

Open-source reasoning model matching o1-level performance at a fraction of the cost. Ideal for math, coding, logic.

$0.55 in$2.19 out64k
DDeepSeek Chat

Cost-efficient chat model with strong multilingual performance. Best price-to-quality for Asian languages.

$0.27 in$1.1 out64k
L

Meta

5 models
LLlama 4 Maverick

Meta's powerful Llama 4 model balancing performance and cost with 1M context

$0.5 in$1.1 out1M
LLlama 4 Scout

Meta's latest efficient model with a massive 10M token context window at extremely low cost

$0.17 in$0.17 out10M
LLlama 3.1 405B

Meta's largest open-source model — frontier-class intelligence available via third-party API providers

$3.5 in$3.5 out128k
LLlama 3.3 70B

Meta's large open-source model with strong reasoning. Great value via providers like Together AI or Fireworks.

$0.23 in$0.4 out128k
LLlama 3.1 8B

Meta's efficient open-source model. Cheapest option for high-volume tasks via third-party API providers.

$0.02 in$0.05 out128k
X

xAI

3 models
XGrok 3

xAI's flagship model with real-time web access and strong performance on coding and analysis.

$3 in$15 out128k
XGrok 3 Fast

xAI's high-throughput variant of Grok 3 — same intelligence as flagship with faster response times

$5 in$25 out128k
XGrok 3 Mini

xAI's efficient model optimized for fast responses and cost-sensitive workloads.

$0.3 in$0.5 out128k
Q

Qwen

4 models
QQwen3.5 Flash

Alibaba's fastest model with 256k context window at near-zero cost. Best for ultra high-volume tasks.

$0.01 in$0.05 out256k
QQwen3 235B

Alibaba's large MoE model with exceptional price — $0.06/1M for both input and output tokens.

$0.06 in$0.06 out32k
QQwen3 30B

Alibaba's mid-size model — solid performance at near-zero cost for everyday tasks

$0.1 in$0.15 out32k
QQwen3 8B

Alibaba's small efficient model — one of the cheapest options for simple classification and generation

$0.05 in$0.1 out32k