Troubleshooting FAQ

How do I fix 429 Too Many Requests?

A 429 means you hit the rate limit. Chuizi.AI uses sliding window rate limiting with these defaults:

  • Global — 60 RPM (60 requests per minute)
  • Per model — 30 RPM (30 requests per minute)

How to fix it:

  1. Check the Retry-After header in the response and wait the specified number of seconds
  2. Implement an exponential backoff strategy
  3. If you need higher limits, adjust rpm_limit in the API Key settings in your Dashboard
  4. Spread requests across time windows to avoid bursts

What should I do about request timeouts?

Timeouts typically have a few causes:

  • Slow model processing — large models (o3, Opus) can take significant time on complex inputs. Set a longer timeout value, or use stream: true for incremental output
  • Network issues — check your connectivity to api.chuizi.ai
  • Upstream provider congestion — upstream responses may slow down during peak hours. Chuizi.AI automatically tries backup channels

Recommended settings:

timeout: 120000  // Set 2+ minutes for reasoning models
stream: true     // Streaming gives you the first token faster

Why am I getting empty responses?

Common causes of empty responses:

  1. max_tokens set too low — the model does not have enough space to generate content. Increase max_tokens (at least 1024 recommended)
  2. stop parameter triggers too early — check if your stop sequences conflict with expected output
  3. Content safety filter — the model determined the output might violate safety policies and chose not to generate. Modify your prompt
  4. Try stream: true — streaming mode lets you see if the model produced any partial output

Next Steps