Gemma 3 4b
Gemma
google/gemma-3-4b
128K context, vision
Context Window
128K
128,000 tokens
Max Output
8K
8,192 tokens
About this model
Google Gemma 3 4B, ultra-compact and fast
This model supports up to 128K tokens of context. It includes native vision understanding for analyzing images and documents.
Access it through Chuizi.AI with a single ck- API key β no separate Google account needed.
Highlights
128K context window
8K max output
Native vision support
Best For
Image analysisDocument OCRVisual Q&AMultimodal chat
2025-03-12
Capabilities
ChatVision
Aliases
gemma-3-4bPricing (per 1M tokens)
| Pricing (per 1M tokens) | / 1M tokens |
|---|---|
| Input / 1M | $0.05 |
| Output / 1M | $0.13 |
Final prices shown
Quick Start
main.py
from openai import OpenAI client = OpenAI( base_url="https://api.chuizi.ai/v1", api_key="ck-your-key-here", ) response = client.chat.completions.create( model="google/gemma-3-4b", messages=[{"role": "user", "content": "Hello!"}], ) print(response.choices[0].message.content)