Qwen Flash

Qwen

qwen/qwen-flash

1M context

Context Window

1.0M

1,048,576 tokens

Max Output

66K

65,536 tokens

About this model

Ultra-fast Qwen Flash, upgraded to Qwen3.5

This model supports up to 1M tokens of context. It provides strong code generation and debugging capabilities.

Access it through Chuizi.AI with a single ck- API key — no separate Alibaba account needed.

Highlights

1M context window

66K max output

Strong code generation

Best For

Code generationRefactoringDebuggingDocumentation

2025-07-29

Capabilities

ChatCodetools

Pricing (per 1M tokens)

Pricing (per 1M tokens)	/ 1M tokens
Input / 1M	$0.029
Output / 1M	$0.294

Final prices shown

Quick Start

main.py

from openai import OpenAI

client = OpenAI(
    base_url="https://api.chuizi.ai/v1",
    api_key="ck-your-key-here",
)

response = client.chat.completions.create(
    model="qwen/qwen-flash",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

Qwen Flash

About this model

Highlights

Best For

Capabilities

Pricing (per 1M tokens)

Quick Start

FAQ

Related Models

Qwen Max

Qwen Plus

Qwen Turbo

Qwen2.5 Coder 32b

Qwen VL Max

Qwen3 Max

Qwen Flash

About this model

Highlights

Best For

Capabilities

Pricing (per 1M tokens)

Quick Start

FAQ

How do I get an API Key?

How does billing work?

What payment methods are supported?

Are there rate limits?

Related Models

Qwen Max

Qwen Plus

Qwen Turbo

Qwen2.5 Coder 32b

Qwen VL Max

Qwen3 Max