GLM 5 Turbo

Zhipu

zhipu/glm-5-turbo

Reasoning model, 131K context

Context Window

131K

131,072 tokens

Max Output

16K

16,384 tokens

About this model

GLM-5 Turbo — fast and cost-effective coding model

This model supports up to 131K tokens of context. It excels at complex reasoning, mathematical problems, and multi-step tasks. It provides strong code generation and debugging capabilities.

Access it through Chuizi.AI with a single ck- API key — no separate Zhipu AI account needed.

Highlights

131K context window

16K max output

Advanced reasoning

Strong code generation

Best For

Complex coding tasksAlgorithm designCode reviewTechnical architecture

2026-01-01

Capabilities

ChatReasoningCodetools

Pricing (per 1M tokens)

Pricing (per 1M tokens)	/ 1M tokens
Input / 1M	$0.588
Output / 1M	$2.31

Final prices shown

Quick Start

main.py

from openai import OpenAI

client = OpenAI(
    base_url="https://api.chuizi.ai/v1",
    api_key="ck-your-key-here",
)

response = client.chat.completions.create(
    model="zhipu/glm-5-turbo",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

GLM 5 Turbo

About this model

Highlights

Best For

Capabilities

Pricing (per 1M tokens)

Quick Start

FAQ

Related Models

GLM 5.1

GLM 5

GLM 5v Turbo

GLM 4.7

GLM 4.6

GLM 4.5

GLM 5 Turbo

About this model

Highlights

Best For

Capabilities

Pricing (per 1M tokens)

Quick Start

FAQ

How do I get an API Key?

How does billing work?

What payment methods are supported?

Are there rate limits?

Related Models

GLM 5.1

GLM 5

GLM 5v Turbo

GLM 4.7

GLM 4.6

GLM 4.5