Qwen Flash

Qwen
qwen/qwen-flash

1M context

Context Window

1.0M

1,048,576 tokens

Max Output

66K

65,536 tokens

About this model

Ultra-fast Qwen Flash, upgraded to Qwen3.5

This model supports up to 1M tokens of context. It provides strong code generation and debugging capabilities.

Access it through Chuizi.AI with a single ck- API key β€” no separate Alibaba account needed.

Highlights

1M context window
66K max output
Strong code generation

Best For

Code generationRefactoringDebuggingDocumentation
2025-07-29

Capabilities

ChatCodetools

Aliases

qwen-flash

Pricing (per 1M tokens)

Pricing (per 1M tokens)/ 1M tokens
Input / 1M$0.03
Output / 1M$0.29

Final prices shown

Quick Start

main.py
from openai import OpenAI

client = OpenAI(
    base_url="https://api.chuizi.ai/v1",
    api_key="ck-your-key-here",
)

response = client.chat.completions.create(
    model="qwen/qwen-flash",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

FAQ

Related Models

Qwen Flash β€” Pricing, Context, Capabilities | Chuizi AI