Upstream Providers

Five infrastructure providers (Azure, Bedrock, Vertex, Bailian, Volcengine) power the 200+-model catalog with geographic diversity and cross-vendor redundancy.

Provider Overview

ProviderModels RoutedRegionsTypical Latency
Azure (OpenAI + MaaS)~30Global80-120ms
Amazon Bedrock~50US multi-region100-150ms
Google Vertex AI~18US90-130ms
Alibaba Bailian (DashScope)~45Asia-Pacific50-80ms
Volcengine (ByteDance)3Asia-Pacific50-80ms

Next Steps

  • Provider Routing — How the gateway selects providers for each request
  • Failover — Automatic rerouting when a provider is down
  • Circuit Breaker — Health monitoring and recovery for upstream providers