๐๏ธ Router - Load Balancing
LiteLLM manages:
๐๏ธ [BETA] Request Prioritization
Beta feature. Use for testing only.
๐๏ธ Proxy - Load Balancing
Load balance multiple instances of the same model
๐๏ธ Fallbacks
If a call fails after num_retries, fallback to another model group.
๐๏ธ Timeouts
The timeout set in router is for the entire length of the call, and is passed down to the completion() call level as well.
๐๏ธ Tag Based Routing
Route requests based on tags.
๐๏ธ Budget Routing
LiteLLM Supports setting the following budgets:
๐๏ธ Provider specific Wildcard routing
Proxy all models from a provider