Gemma 3 4b

Gemma

google/gemma-3-4b

128K context, vision

Context Window

128K

128,000 tokens

Max Output

8,192 tokens

About this model

Google Gemma 3 4B, ultra-compact and fast

This model supports up to 128K tokens of context. It includes native vision understanding for analyzing images and documents.

Access it through Chuizi.AI with a single ck- API key — no separate Google account needed.

Highlights

128K context window

8K max output

Native vision support

Best For

Image analysisDocument OCRVisual Q&AMultimodal chat

2025-03-12

Capabilities

ChatVision

Pricing (per 1M tokens)

Pricing (per 1M tokens)	/ 1M tokens
Input / 1M	$0.052
Output / 1M	$0.126

Final prices shown

Quick Start

main.py

from openai import OpenAI

client = OpenAI(
    base_url="https://api.chuizi.ai/v1",
    api_key="ck-your-key-here",
)

response = client.chat.completions.create(
    model="google/gemma-3-4b",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

Gemma 3 4b

About this model

Highlights

Best For

Capabilities

Pricing (per 1M tokens)

Quick Start

FAQ

Related Models

Nova Micro

Nova Lite

Nova Pro

Nova Premier

Nova 2 Lite

Nova 2 Pro

Gemma 3 4b

About this model

Highlights

Best For

Capabilities

Pricing (per 1M tokens)

Quick Start

FAQ

How do I get an API Key?

How does billing work?

What payment methods are supported?

Are there rate limits?

Related Models

Nova Micro

Nova Lite

Nova Pro

Nova Premier

Nova 2 Lite

Nova 2 Pro