Gemini 2.5 Flash Lite

Google

google/gemini-2.5-flash-lite

Lowest cost Gemini

Context Window

1.0M

1,000,000 tokens

Max Output

66K

65,536 tokens

About this model

Gemini 2.5 Flash Lite is the lowest-cost model in the Gemini series, suitable for large-scale deployments with extreme price sensitivity. It maintains the core capabilities of the Flash series on simple tasks.

Highlights

Lowest Gemini price

1M context

Ideal for scale

Best For

Large-scale classificationSimple Q&AText extraction

2025-04-17MoE TransformerProprietary

Capabilities

ChatVisionReasoningCodepdftoolscache

Pricing (per 1M tokens)

Pricing (per 1M tokens)	/ 1M tokens
Input / 1M	$0.105
Output / 1M	$0.420
Cache Read	$0.011

Final prices shown

Quick Start

main.py

from openai import OpenAI

client = OpenAI(
    base_url="https://api.chuizi.ai/v1",
    api_key="ck-your-key-here",
)

response = client.chat.completions.create(
    model="google/gemini-2.5-flash-lite",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

Gemini 2.5 Flash Lite

About this model

Highlights

Best For

Capabilities

Pricing (per 1M tokens)

Quick Start

FAQ

Related Models

Gemini 2.5 Pro

Gemini 2.5 Flash

Gemini 3.5 Flash

Gemini 3 Flash Preview

Gemini 3.1 Pro Preview

Gemini 3.1 Flash Image Preview

Gemini 2.5 Flash Lite

About this model

Highlights

Best For

Capabilities

Pricing (per 1M tokens)

Quick Start

FAQ

How do I get an API Key?

How does billing work?

What payment methods are supported?

Are there rate limits?

Is Google Search grounding supported?

What about Gemini's 1M/2M context window?

Related Models

Gemini 2.5 Pro

Gemini 2.5 Flash

Gemini 3.5 Flash

Gemini 3 Flash Preview

Gemini 3.1 Pro Preview

Gemini 3.1 Flash Image Preview