RouteHostsPer-token priceReal savings come from
Claude only
Published list price
Volume tiers, prompt caching API, batch API (50% off), latest features ship here first
Claude, Cohere, Mistral, AI21, Llama, Amazon Titan
Same as native list price for each model
AWS EDP credits, Provisioned Throughput, region availability, no Anthropic features that have not been ported
Claude, Gemini, PaLM, third-party
Same as native list for each model
GCP Committed Use Discounts, regional residency, Gemini-only features (Google Search grounding, etc.)
Azure (AI Foundry / Azure OpenAI)
source →OpenAI, Llama, Mistral, Cohere, Phi — no Anthropic, no Gemini
Same as OpenAI list; sometimes a small premium
Microsoft EA / MACC commitments, PTUs, enterprise procurement, US-Government regions
Anthropic, OpenAI, Google, open-source — many providers behind one API
Native list + ~5% markup
One-API convenience, no commitment, useful for spike traffic — worse for steady-state cost