Qwen3 VL Flash

Qwen
qwen/qwen3-vl-flash

Fast vision, ultra-low cost

Context Window

131K

131,072 tokens

Max Output

8K

8,192 tokens

About this model

Qwen3 VL Flash is the fast and affordable variant of the Qwen vision-language series, ideal for cost-sensitive visual understanding.

Access via Chuizi.AI with a ck- API key.

Highlights

Ultra-fast
Ultra-low cost
Visual understanding
131K context

Best For

Batch image processingSimple classificationCost-firstHigh throughput
2026-03-01

Capabilities

ChatVisiontoolspdf

Aliases

qwen3-vl-flash

Pricing (per 1M tokens)

Pricing (per 1M tokens)/ 1M tokens
Input / 1M$0.21
Output / 1M$0.63

Final prices shown

Quick Start

main.py
from openai import OpenAI

client = OpenAI(
    base_url="https://api.chuizi.ai/v1",
    api_key="ck-your-key-here",
)

response = client.chat.completions.create(
    model="qwen/qwen3-vl-flash",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

FAQ

Related Models

Qwen3 Vl Flash β€” Pricing, Context, Capabilities | Chuizi AI