Qwen3.5 Omni Flash

Qwen
qwen/qwen3.5-omni-flash

Qwen3.5 fast multimodal

Context Window

131K

131,072 tokens

Max Output

8K

8,192 tokens

About this model

Qwen 3.5 Omni Flash is the fast variant of the Qwen 3.5 multimodal series, supporting text, image, and audio for both input and output.

Offers multimodal capabilities at very low cost, ideal for high-throughput scenarios. Access via Chuizi.AI with a ck- API key.

Highlights

Ultra-fast
Very low cost
Full multimodal I/O
131K context

Best For

Batch processingVoice assistantsMultimodal appsHigh throughput
2026-03-01

Capabilities

ChatVisionAudiotools

Aliases

qwen3.5-omni-flash

Pricing (per 1M tokens)

Pricing (per 1M tokens)/ 1M tokens
Input / 1M$0.21
Output / 1M$0.63

Final prices shown

Quick Start

main.py
from openai import OpenAI

client = OpenAI(
    base_url="https://api.chuizi.ai/v1",
    api_key="ck-your-key-here",
)

response = client.chat.completions.create(
    model="qwen/qwen3.5-omni-flash",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

FAQ

Related Models

Qwen3.5 Omni Flash β€” Pricing, Context, Capabilities | Chuizi AI