Qwen3.5 Omni Flash

Qwen

qwen/qwen3.5-omni-flash

Qwen3.5 fast multimodal

Context Window

131K

131,072 tokens

Max Output

8,192 tokens

About this model

Qwen 3.5 Omni Flash is the fast variant of the Qwen 3.5 multimodal series, supporting text, image, and audio for both input and output.

Offers multimodal capabilities at very low cost, ideal for high-throughput scenarios. Access via Chuizi.AI with a ck- API key.

Highlights

Ultra-fast

Very low cost

Full multimodal I/O

131K context

Best For

Batch processingVoice assistantsMultimodal appsHigh throughput

2026-03-01

Capabilities

ChatVisionAudiotools

Pricing (per 1M tokens)

Pricing (per 1M tokens)	/ 1M tokens
Input / 1M	$0.210
Output / 1M	$0.630

Final prices shown

Quick Start

main.py

from openai import OpenAI

client = OpenAI(
    base_url="https://api.chuizi.ai/v1",
    api_key="ck-your-key-here",
)

response = client.chat.completions.create(
    model="qwen/qwen3.5-omni-flash",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

Qwen3.5 Omni Flash

About this model

Highlights

Best For

Capabilities

Pricing (per 1M tokens)

Quick Start

FAQ

Related Models

Qwen Max

Qwen Plus

Qwen Turbo

Qwen2.5 Coder 32b

Qwen VL Max

Qwen3 Max

Qwen3.5 Omni Flash

About this model

Highlights

Best For

Capabilities

Pricing (per 1M tokens)

Quick Start

FAQ

How do I get an API Key?

How does billing work?

What payment methods are supported?

Are there rate limits?

Related Models

Qwen Max

Qwen Plus

Qwen Turbo

Qwen2.5 Coder 32b

Qwen VL Max

Qwen3 Max