Model Pricing

Prices below are upstream provider costs. Your actual bill is the displayed price x 1.05. Token prices are per 1 million tokens.

For the full model list and real-time pricing, visit chuizi.ai/models.

OpenAI

ModelInput ($/1M)Output ($/1M)Context Window
GPT-4.1$2.00$8.001M
GPT-4.1-mini$0.40$1.601M
GPT-4.1-nano$0.10$0.401M
GPT-4o$2.50$10.00128K
GPT-4o-mini$0.15$0.60128K
o3$2.00$8.00200K
o4-mini$1.10$4.40200K

Anthropic

ModelInput ($/1M)Output ($/1M)Context Window
Claude Opus 4-6$15.00$75.00200K
Claude Sonnet 4-6$3.00$15.00200K
Claude Haiku 4-5$1.00$5.00200K

Anthropic models support prompt caching. The cache_read price is approximately 10% of the input price. See Cache Discount Pricing for details.

Google

ModelInput ($/1M)Output ($/1M)Context Window
Gemini 2.5 Pro$1.25$10.001M
Gemini 2.5 Flash$0.15$0.601M
Gemini 2.0 Flash$0.10$0.401M

DeepSeek

ModelInput ($/1M)Output ($/1M)Context Window
DeepSeek V3.2$0.28$0.42128K
DeepSeek R1$0.55$2.19128K
DeepSeek Chat$0.28$0.42128K

DeepSeek models automatically enable disk caching. cache_read saves approximately 90% of the input cost.

Next Steps