Billing Model
Chuizi.AI charges upstream provider cost x 1.05 with no hidden fees. All billing calculations use 8-decimal precision to ensure financial accuracy.
Three Billing Types
| Billing Type | Use Case | Unit |
|---|---|---|
| Per-token | Chat, Embedding, and text models | Per 1M tokens |
| Per-request | Image generation, TTS | Per request |
| Per-second | Speech-to-text (Whisper) | Per second of audio |
Per-Token Billing
Most models (Chat, Reasoning, Embedding) use per-token billing. Input tokens and output tokens are priced separately.
Formula:
cost = (input_tokens x input_price + output_tokens x output_price) x 1.05
Example — Claude Sonnet 4-6:
| Item | Count | Price (per 1M tokens) | Subtotal |
|---|---|---|---|
| Input tokens | 2,000 | $3.00 | $0.006000 |
| Output tokens | 500 | $15.00 | $0.007500 |
| Subtotal | — | — | $0.013500 |
| x 1.05 multiplier | — | — | $0.014175 |
If the request uses prompt caching, the cost also includes cache_read_tokens and cache_write_tokens. See Cache Discount Pricing for details.
Next Steps
- Cache Discount Pricing — Save up to 90% on input costs with prompt caching
- Billing Flow — How post-pay billing and overdraft protection work
- Model Pricing — Complete price table for all supported models