GLM 4.5 Air
Zhipu
zhipu/glm-4.5-air
131K context
Context Window
131K
131,072 tokens
Max Output
16K
16,384 tokens
About this model
Lightweight GLM-4.5 for fast inference
This model supports up to 131K tokens of context. It provides strong code generation and debugging capabilities.
Access it through Chuizi.AI with a single ck- API key β no separate Zhipu AI account needed.
Highlights
131K context window
16K max output
Strong code generation
Best For
Code generationRefactoringDebuggingDocumentation
2025-07-28
Capabilities
ChatCodetools
Aliases
glm-4.5-airPricing (per 1M tokens)
| Pricing (per 1M tokens) | / 1M tokens |
|---|---|
| Input / 1M | $0.29 |
| Output / 1M | $1.16 |
Final prices shown
Quick Start
main.py
from openai import OpenAI client = OpenAI( base_url="https://api.chuizi.ai/v1", api_key="ck-your-key-here", ) response = client.chat.completions.create( model="zhipu/glm-4.5-air", messages=[{"role": "user", "content": "Hello!"}], ) print(response.choices[0].message.content)