Question 1

How much does Gemini 2.0 Flash cost per 1,000 tokens?

Accepted Answer

Gemini 2.0 Flash costs $0.0001 per 1,000 input tokens and $0.0004 per 1,000 output tokens.

Question 2

What is Gemini 2.0 Flash's context window?

Accepted Answer

Gemini 2.0 Flash supports a context window of 1M tokens, which is suitable for very long documents, large codebases, and extended multi-turn conversations.

Question 3

How does Gemini 2.0 Flash compare to GPT-4o on price?

Accepted Answer

Gemini 2.0 Flash is 96% cheaper than the market average on input tokens. At $0.1/1M input vs $2.50/1M for GPT-4o, the cost difference becomes significant at scale — 10,000 requests/day with 1,000 input tokens each costs $30/month with Gemini 2.0 Flash vs $750/month with GPT-4o.

Question 4

Does Gemini 2.0 Flash support prompt caching?

Accepted Answer

Yes. Gemini 2.0 Flash supports prompt caching at $0.025/1M tokens — a 75% discount on repeated input. This is especially effective for RAG pipelines and chatbots with large system prompts that repeat across requests.

Gemini 2.0 Flash

Quick calculator

Tips to reduce cost

Similar models from Google

About Gemini 2.0 Flash

Frequently Asked Questions

Compare Gemini 2.0 Flash with other models