GPT 4o Audio Preview

OpenAI

openai/gpt-4o-audio-preview

Speech-to-text model

Context Window

128K

128,000 tokens

Max Output

16K

16,384 tokens

About this model

Multimodal model supporting audio input and output

This model includes native vision understanding for analyzing images and documents.

Access it through Chuizi.AI with a single ck- API key — no separate OpenAI account needed.

Highlights

Native vision support

Multi-cloud failover

Unified billing via Chuizi.AI

Best For

Image analysisDocument OCRVisual Q&AMultimodal chat

2024-10-01

Capabilities

ChatVisionAudiotools

Pricing (per 1M tokens)

Pricing (per 1M tokens)	/ 1M tokens
Input / 1M	$2.63
Output / 1M	$10.50

Final prices shown

Quick Start

main.py

from openai import OpenAI

client = OpenAI(
    base_url="https://api.chuizi.ai/v1",
    api_key="ck-your-key-here",
)

response = client.chat.completions.create(
    model="openai/gpt-4o-audio-preview",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

GPT 4o Audio Preview

About this model

Highlights

Best For

Capabilities

Pricing (per 1M tokens)

Quick Start

FAQ

Related Models

GPT 4.1

GPT 4.1 Mini

GPT 4.1 Nano

GPT 4o

GPT 4o Mini

O3

GPT 4o Audio Preview

About this model

Highlights

Best For

Capabilities

Pricing (per 1M tokens)

Quick Start

FAQ

How do I get an API Key?

How does billing work?

What payment methods are supported?

Are there rate limits?

How is this different from the official OpenAI API?

Does vision (image input) work?

Are o3/o4-mini reasoning models supported?

Related Models

GPT 4.1

GPT 4.1 Mini

GPT 4.1 Nano

GPT 4o

GPT 4o Mini

O3