GPT 4o Audio Preview
OpenAI
openai/gpt-4o-audio-preview
Speech-to-text model
Context Window
128K
128,000 tokens
Max Output
16K
16,384 tokens
About this model
Multimodal model supporting audio input and output
This model includes native vision understanding for analyzing images and documents.
Access it through Chuizi.AI with a single ck- API key β no separate OpenAI account needed.
Highlights
Native vision support
Multi-cloud failover
Unified billing via Chuizi.AI
Best For
Image analysisDocument OCRVisual Q&AMultimodal chat
2024-10-01
Capabilities
ChatVisionAudiotools
Aliases
gpt-4o-audio-previewPricing (per 1M tokens)
| Pricing (per 1M tokens) | / 1M tokens |
|---|---|
| Input / 1M | $2.63 |
| Output / 1M | $10.50 |
Final prices shown
Quick Start
main.py
from openai import OpenAI client = OpenAI( base_url="https://api.chuizi.ai/v1", api_key="ck-your-key-here", ) response = client.chat.completions.create( model="openai/gpt-4o-audio-preview", messages=[{"role": "user", "content": "Hello!"}], ) print(response.choices[0].message.content)