Providers | liteLLM

📄️ Integrate as a Model Provider

Quick Start for OpenAI-Compatible Providers

📄️ Add OpenAI-Compatible Provider (JSON)

For simple OpenAI-compatible providers (like Hyperbolic, Nscale, etc.), you can add support by editing a single JSON file.

📄️ Add Model Pricing & Context Window

To add pricing or context window information for a model, simply make a PR to this file:

📄️ OpenAI (Text Completion)

LiteLLM supports OpenAI text completion models

📄️ OpenAI-Compatible Endpoints

Selecting openai as the provider routes your request to an OpenAI-compatible endpoint using the upstream

📄️ AWS Sagemaker

LiteLLM supports All Sagemaker Huggingface Jumpstart Models

📄️ LiteLLM Proxy (LLM Gateway)

| Property | Details |

📄️ AI21

LiteLLM supports the following AI21 models:

📄️ Aleph Alpha

LiteLLM supports all models from Aleph Alpha.

📄️ Baseten

LiteLLM supports both Baseten Model APIs and dedicated deployments with automatic routing.

📄️ Bytez

LiteLLM supports all chat models on Bytez!

📄️ Cerebras

https://inference-docs.cerebras.ai/api-reference/chat-completions

📄️ Clarifai

Anthropic, OpenAI, Qwen, xAI, Gemini and most of Open soured LLMs are Supported on Clarifai.

📄️ Cloudflare Workers AI

https://developers.cloudflare.com/workers-ai/models/text-generation/

📄️ Codestral API [Mistral AI]

Codestral is available in select code-completion plugins but can also be queried directly. See the documentation for more details.

LiteLLM supports all AI models from CometAPI. CometAPI provides access to 500+ AI models through a unified API interface, including cutting-edge models like GPT-5, Claude Opus 4.1, and various other state-of-the-art language models.

📄️ CompactifAI

https://docs.compactif.ai/

📄️ Custom API Server (Custom Format)

Call your custom torch-serve / internal LLM APIs via LiteLLM

📄️ Dashscope API (Qwen models)

https://dashscope.console.aliyun.com/

📄️ Databricks

LiteLLM supports all models on Databricks

📄️ DataRobot

LiteLLM supports all models from DataRobot. Select datarobot as the provider to route your request through the datarobot OpenAI-compatible endpoint using the upstream official OpenAI Python API library.

📄️ Deepgram

LiteLLM supports Deepgram's /listen endpoint.

📄️ DeepInfra

https://deepinfra.com/

📄️ Deepseek

https://deepseek.com/

📄️ Docker Model Runner

Overview

📄️ ElevenLabs

ElevenLabs provides high-quality AI voice technology, including speech-to-text capabilities through their transcription API.

📄️ Fal AI

Fal AI provides fast, scalable access to state-of-the-art image generation models including FLUX, Stable Diffusion, Imagen, and more.

📄️ Featherless AI

https://featherless.ai/

📄️ Fireworks AI

We support ALL Fireworks AI models, just set fireworks_ai/ as a prefix when sending completion requests

📄️ FriendliAI

We support ALL FriendliAI models, just set friendliai/ as a prefix when sending completion requests

📄️ Galadriel

https://docs.galadriel.com/api-reference/chat-completion-API

📄️ Github

https://github.com/marketplace/models

📄️ GitHub Copilot

https://docs.github.com/en/copilot

📄️ GMI Cloud

Overview

📄️ ChatGPT Subscription

Use ChatGPT Pro/Max subscription models through LiteLLM with OAuth device flow authentication.

📄️ GradientAI

https://digitalocean.com/products/gradientai

📄️ Groq

https://groq.com/

📄️ Helicone

Overview

📄️ Heroku

Provision a Model

🗃️ HuggingFace

2 items

📄️ Hyperbolic

Overview

📄️ Infinity

| Property | Details |

📄️ Jina AI

https://jina.ai/embeddings/

📄️ Lambda AI

Overview

📄️ LangGraph

Call LangGraph agents through LiteLLM using the OpenAI chat completions format.

📄️ Lemonade

Lemonade Server is an OpenAI-compatible local language model inference provider optimized for AMD GPUs and NPUs. The lemonade litellm provider supports standard chat completions with full OpenAI API compatibility.