Qwen Flash
Qwen
qwen/qwen-flash
1M context
Context Window
1.0M
1,048,576 tokens
Max Output
66K
65,536 tokens
About this model
Ultra-fast Qwen Flash, upgraded to Qwen3.5
This model supports up to 1M tokens of context. It provides strong code generation and debugging capabilities.
Access it through Chuizi.AI with a single ck- API key β no separate Alibaba account needed.
Highlights
1M context window
66K max output
Strong code generation
Best For
Code generationRefactoringDebuggingDocumentation
2025-07-29
Capabilities
ChatCodetools
Aliases
qwen-flashPricing (per 1M tokens)
| Pricing (per 1M tokens) | / 1M tokens |
|---|---|
| Input / 1M | $0.03 |
| Output / 1M | $0.29 |
Final prices shown
Quick Start
main.py
from openai import OpenAI client = OpenAI( base_url="https://api.chuizi.ai/v1", api_key="ck-your-key-here", ) response = client.chat.completions.create( model="qwen/qwen-flash", messages=[{"role": "user", "content": "Hello!"}], ) print(response.choices[0].message.content)