GPT 4o Audio Preview

OpenAI
openai/gpt-4o-audio-preview

Speech-to-text model

Context Window

128K

128,000 tokens

Max Output

16K

16,384 tokens

About this model

Multimodal model supporting audio input and output

This model includes native vision understanding for analyzing images and documents.

Access it through Chuizi.AI with a single ck- API key β€” no separate OpenAI account needed.

Highlights

Native vision support
Multi-cloud failover
Unified billing via Chuizi.AI

Best For

Image analysisDocument OCRVisual Q&AMultimodal chat
2024-10-01

Capabilities

ChatVisionAudiotools

Aliases

gpt-4o-audio-preview

Pricing (per 1M tokens)

Pricing (per 1M tokens)/ 1M tokens
Input / 1M$2.63
Output / 1M$10.50

Final prices shown

Quick Start

main.py
from openai import OpenAI

client = OpenAI(
    base_url="https://api.chuizi.ai/v1",
    api_key="ck-your-key-here",
)

response = client.chat.completions.create(
    model="openai/gpt-4o-audio-preview",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

FAQ

Related Models

GPT 4o Audio Preview β€” Pricing, Context, Capabilities | Chuizi AI