LiteLLM Operator

A Kubernetes operator for deploying and managing production-ready LiteLLM AI Gateway instances. Built with Operator SDK for OLM integration, OperatorHub distribution, and first-class OpenShift support.

Replaces manual Helm-based deployments with a declarative, reconciliation-based approach that keeps CRD state and the LiteLLM API in sync.

Features

Declarative LiteLLM deployment — manage proxy instances, models, teams, users, and API keys as Kubernetes custom resources
Bidirectional config sync — reconciles CRD state with the LiteLLM REST API on every sync interval
Team member management — three modes: crd (CRD authoritative), sso (IdP authoritative), mixed (additive)
VirtualKey secret management — generated API keys are stored in Kubernetes Secrets with owner references for automatic cleanup
SSO/SCIM support — configure Azure Entra ID, Okta, Google, or generic OIDC providers declaratively
Production-ready — HPA, PDB, NetworkPolicy, health checks, resource limits, security contexts
Multiple install methods — OLM bundles (OperatorHub/OpenShift) or Helm chart

Custom Resource Definitions

CRD	Short Name	Description
`LiteLLMInstance`	`li`	Deploys a LiteLLM proxy with database, Redis, networking, and SSO
`LiteLLMModel`	`lm`	Registers a model (e.g., `openai/gpt-4o`) with the proxy
`LiteLLMTeam`	`lt`	Creates a team with budget limits and member management
`LiteLLMUser`	`lu`	Creates a user (service accounts, bot users, non-SSO environments)
`LiteLLMVirtualKey`	`lk`	Generates an API key scoped to a team/user with budget and rate limits

All secondary resources (LiteLLMModel, LiteLLMTeam, LiteLLMUser, LiteLLMVirtualKey) reference a LiteLLMInstance via spec.instanceRef.

Prerequisites

Go 1.22+
Docker 17.03+
kubectl v1.28+
Access to a Kubernetes v1.28+ cluster
A PostgreSQL database for LiteLLM state storage

Quick Start

1. Install CRDs

make install

2. Deploy the operator

make deploy IMG=ghcr.io/palenaai/litellm-operator:latest

3. Create a database secret

kubectl create secret generic litellm-db-credentials \
  --from-literal=DATABASE_URL='postgresql://user:pass@host:5432/litellm'

4. Deploy a LiteLLM instance

apiVersion: litellm.palena.ai/v1alpha1
kind: LiteLLMInstance
metadata:
  name: my-gateway
spec:
  replicas: 2
  masterKey:
    autoGenerate: true
  database:
    external:
      connectionSecretRef:
        name: litellm-db-credentials
        key: DATABASE_URL
  service:
    type: ClusterIP
    port: 4000

5. Register a model

apiVersion: litellm.palena.ai/v1alpha1
kind: LiteLLMModel
metadata:
  name: gpt4o
spec:
  instanceRef:
    name: my-gateway
  modelName: gpt-4o
  litellmParams:
    model: openai/gpt-4o
    apiKeySecretRef:
      name: openai-credentials
      key: OPENAI_API_KEY

6. Create a team and API key

apiVersion: litellm.palena.ai/v1alpha1
kind: LiteLLMTeam
metadata:
  name: engineering
spec:
  instanceRef:
    name: my-gateway
  teamAlias: engineering
  models: [gpt-4o]
  maxBudgetMonthly: 1000
  budgetDuration: "30d"
  members:
    - email: dev@example.com
      role: user
---
apiVersion: litellm.palena.ai/v1alpha1
kind: LiteLLMVirtualKey
metadata:
  name: eng-ci-key
spec:
  instanceRef:
    name: my-gateway
  keyAlias: eng-ci-key
  teamRef:
    name: engineering
  models: [gpt-4o]
  maxBudget: "100"

The generated API key is stored in a Secret (default name: {name}-key):

kubectl get secret eng-ci-key-key -o jsonpath='{.data.api-key}' | base64 -d

Installation Methods

Direct (Makefile)

make install       # Install CRDs
make deploy        # Deploy operator

OLM (OpenShift / clusters with OLM)

operator-sdk run bundle ghcr.io/palenaai/litellm-operator-bundle:v0.5.0

Helm

helm install litellm-operator deploy/charts/litellm-operator/

Development

Build

make build                    # Build operator binary
make docker-build IMG=...     # Build container image

Test

make test          # Unit + integration tests (envtest)
make test-e2e      # End-to-end tests (requires cluster)

Generate

make generate      # DeepCopy functions
make manifests     # CRD YAMLs, RBAC, webhooks

Run locally (against current kubeconfig cluster)

make install       # Install CRDs first
make run           # Run operator outside the cluster

Architecture

Key design points:

LiteLLMInstance controller manages Deployment, ConfigMap, Service, Secrets, Ingress, HPA, PDB, NetworkPolicy, and migration Jobs
Secondary controllers (Model, Team, User, VirtualKey) resolve their instanceRef to discover the LiteLLM API endpoint and master key, then sync state via the REST API
Finalizers ensure cleanup: deleting a CRD calls the corresponding LiteLLM API delete endpoint before removing the Kubernetes resource
Spec hash annotations (litellm.palena.ai/sync-hash) enable change detection to avoid unnecessary API calls

Project Structure

api/v1alpha1/          CRD type definitions
internal/controller/   Reconciliation controllers
internal/litellm/      LiteLLM REST API client
internal/resources/    Kubernetes resource generators
config/crd/bases/      Generated CRD manifests
config/samples/        Example custom resources
bundle/                OLM bundle manifests
deploy/charts/         Helm chart

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
api/v1alpha1		api/v1alpha1
cmd		cmd
config		config
docs		docs
hack		hack
internal		internal
test		test
.dockerignore		.dockerignore
.gitignore		.gitignore
.golangci.yml		.golangci.yml
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
PROJECT		PROJECT
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LiteLLM Operator

Features

Custom Resource Definitions

Prerequisites

Quick Start

1. Install CRDs

2. Deploy the operator

3. Create a database secret

4. Deploy a LiteLLM instance

5. Register a model

6. Create a team and API key

Installation Methods

Direct (Makefile)

OLM (OpenShift / clusters with OLM)

Helm

Development

Build

Test

Generate

Run locally (against current kubeconfig cluster)

Architecture

Project Structure

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LiteLLM Operator

Features

Custom Resource Definitions

Prerequisites

Quick Start

1. Install CRDs

2. Deploy the operator

3. Create a database secret

4. Deploy a LiteLLM instance

5. Register a model

6. Create a team and API key

Installation Methods

Direct (Makefile)

OLM (OpenShift / clusters with OLM)

Helm

Development

Build

Test

Generate

Run locally (against current kubeconfig cluster)

Architecture

Project Structure

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages