Qwen3 VL Flash
Qwen
qwen/qwen3-vl-flash
Fast vision, ultra-low cost
Context Window
131K
131,072 tokens
Max Output
8K
8,192 tokens
About this model
Qwen3 VL Flash is the fast and affordable variant of the Qwen vision-language series, ideal for cost-sensitive visual understanding.
Access via Chuizi.AI with a ck- API key.
Highlights
Ultra-fast
Ultra-low cost
Visual understanding
131K context
Best For
Batch image processingSimple classificationCost-firstHigh throughput
2026-03-01
Capabilities
ChatVisiontoolspdf
Aliases
qwen3-vl-flashPricing (per 1M tokens)
| Pricing (per 1M tokens) | / 1M tokens |
|---|---|
| Input / 1M | $0.21 |
| Output / 1M | $0.63 |
Final prices shown
Quick Start
main.py
from openai import OpenAI client = OpenAI( base_url="https://api.chuizi.ai/v1", api_key="ck-your-key-here", ) response = client.chat.completions.create( model="qwen/qwen3-vl-flash", messages=[{"role": "user", "content": "Hello!"}], ) print(response.choices[0].message.content)