← All models
G

Gemini 2.5 Flash-Lite

Google

Most cost-efficient Gemini model for high-volume, latency-sensitive workloads

Input price$0.10 / 1M tokens
Output price$0.40 / 1M tokens
Context window1M tokens
Last updated2026-04-20

Quick calculator

tokens
tokens
req/day
Per request
$0.000300
Daily
$3.00
Monthly
$90.00
per month · 30-day estimate
Yearly
$1,095.00

Tips to reduce cost

  • Use prompt caching to reuse repeated system prompts
  • Trim whitespace and reduce verbose instructions
  • Use a smaller model for classification or routing tasks
  • Batch async requests to get 50% discount (OpenAI/Anthropic)
  • Cache identical requests at the application layer

Similar models from Google

Compared at your current token settings