Troubleshooting FAQ
How do I fix 429 Too Many Requests?
A 429 means you hit the rate limit. Chuizi.AI uses sliding window rate limiting with these defaults:
- Global — 60 RPM (60 requests per minute)
- Per model — 30 RPM (30 requests per minute)
How to fix it:
- Check the
Retry-Afterheader in the response and wait the specified number of seconds - Implement an exponential backoff strategy
- If you need higher limits, adjust
rpm_limitin the API Key settings in your Dashboard - Spread requests across time windows to avoid bursts
What should I do about request timeouts?
Timeouts typically have a few causes:
- Slow model processing — large models (o3, Opus) can take significant time on complex inputs. Set a longer
timeoutvalue, or usestream: truefor incremental output - Network issues — check your connectivity to
api.chuizi.ai - Upstream provider congestion — upstream responses may slow down during peak hours. Chuizi.AI automatically tries backup channels
Recommended settings:
timeout: 120000 // Set 2+ minutes for reasoning models stream: true // Streaming gives you the first token faster
Why am I getting empty responses?
Common causes of empty responses:
max_tokensset too low — the model does not have enough space to generate content. Increasemax_tokens(at least 1024 recommended)stopparameter triggers too early — check if your stop sequences conflict with expected output- Content safety filter — the model determined the output might violate safety policies and chose not to generate. Modify your prompt
- Try
stream: true— streaming mode lets you see if the model produced any partial output
Next Steps
- Error Codes — Full error code reference with detailed troubleshooting
- Rate Limits — Understand the three-tier rate limiting system
- Error Handling Guide — Implement retries and backoff in your code