Skip to main content
Google DeepMindReleased February 5, 2025

Gemini 2.0 Flash

#8 Arena RankUndisclosed parameters

Gemini 2.0 Flash ranks in the top 10 on the Arena leaderboard. Context window: 0.001K tokens.

Context

1M

Input

$0.10

Key Specifications

🏆

Arena Rank

#8

📐

Context Window

1M

📥

Input Price

per 1M tokens

$0.10

📤

Output Price

per 1M tokens

$0.40

🧠

Parameters

Undisclosed

🔒

Open Source

No

Best For

Agentic tasksmultimodaltool use

About Gemini 2.0 Flash

Gemini 2.0 Flash, developed by Google DeepMind, is a fast multimodal model with a 1 million token context window and enhanced agentic capabilities. The model processes text, images, and audio while supporting tool use, code execution, and multi-step workflows. Its architecture is optimized for applications requiring autonomous decision-making and real-time responsiveness. Gemini 2.0 Flash introduced improved function calling and native Google Search integration, enabling grounded responses with current information. Priced at $0.10 per million input tokens and $0.40 per million output tokens, it delivers strong capability at accessible pricing. Gemini 2.0 Flash ranks #8 on the Chatbot Arena leaderboard, reflecting substantial performance improvements over its predecessor while maintaining the speed characteristics that define the Flash model line.

Pricing per 1M tokens

Input Tokens

$0.10

Output Tokens

$0.40

Frequently Asked Questions

What is Gemini 2.0 Flash?
Gemini 2.0 Flash, developed by Google DeepMind, is a fast multimodal model with a 1 million token context window and enhanced agentic capabilities. The model processes text, images, and audio while supporting tool use, code execution, and multi-step workflows. Its architecture is optimized for applications requiring autonomous decision-making and real-time responsiveness. Gemini 2.0 Flash introduced improved function calling and native Google Search integration, enabling grounded responses with current information. Priced at $0.10 per million input tokens and $0.40 per million output tokens, it delivers strong capability at accessible pricing. Gemini 2.0 Flash ranks #8 on the Chatbot Arena leaderboard, reflecting substantial performance improvements over its predecessor while maintaining the speed characteristics that define the Flash model line.
How much does Gemini 2.0 Flash cost?
Input pricing for Gemini 2.0 Flash is $0.10 per million tokens; output runs $0.40. Token-based pricing means you can scale up or down without a fixed commitment.
What is Gemini 2.0 Flash's context window?
The context window for Gemini 2.0 Flash is 1M tokens. That's the maximum amount of text you can feed into a single prompt, including system instructions, conversation history, and the actual query.
Is Gemini 2.0 Flash open source?
Gemini 2.0 Flash is closed-source and accessible only via Google DeepMind's API. Proprietary models trade deployment flexibility for convenience — Google DeepMind handles the infrastructure.
What is Gemini 2.0 Flash best for?
The sweet spot for Gemini 2.0 Flash is: Agentic tasks, multimodal, tool use. If your workload fits one of these categories, it's worth benchmarking against alternatives.