Billing Model

Chuizi.AI charges upstream provider cost x 1.05 with no hidden fees. All billing calculations use 8-decimal precision to ensure financial accuracy.

Three Billing Types

Billing TypeUse CaseUnit
Per-tokenChat, Embedding, and text modelsPer 1M tokens
Per-requestImage generation, TTSPer request
Per-secondSpeech-to-text (Whisper)Per second of audio

Per-Token Billing

Most models (Chat, Reasoning, Embedding) use per-token billing. Input tokens and output tokens are priced separately.

Formula:

cost = (input_tokens x input_price + output_tokens x output_price) x 1.05

Example — Claude Sonnet 4-6:

ItemCountPrice (per 1M tokens)Subtotal
Input tokens2,000$3.00$0.006000
Output tokens500$15.00$0.007500
Subtotal$0.013500
x 1.05 multiplier$0.014175

If the request uses prompt caching, the cost also includes cache_read_tokens and cache_write_tokens. See Cache Discount Pricing for details.

Next Steps