📄️ Quick Start
Quick start CLI, Config, Docker
📄️ Proxy Config.yaml
Set model list, apibase, apikey, temperature & proxy server settings (master-key) on the config.yaml.
📄️ Embeddings - /embeddings
See supported Embedding Providers & Models here
📄️ Load Balancing - Multiple Instances of 1 model
Load balance multiple instances of the same model
📄️ Key Management
Track Spend and create virtual keys for the proxy
📄️ Model Management
Add new models + Get model info without restarting proxy.
📄️ Fallbacks, Retries, Timeouts, Cooldowns
If a call fails after num_retries, fall back to another model group.
📄️ Health Checks
Use this to health check all LLMs defined in your config.yaml
📄️ Modify Incoming Data
Modify data just before making litellm completion calls call on proxy
📄️ Caching
Cache LLM Responses
📄️ Logging - Custom Callbacks, OpenTelemetry, Langfuse, Sentry
Log Proxy Input, Output, Exceptions using Custom Callbacks, Langfuse, OpenTelemetry, LangFuse, DynamoDB
📄️ CLI Arguments
Cli arguments, --host, --port, --num_workers
📄️ 🐳 Docker, Deploying LiteLLM Proxy
Dockerfile