LiteLLM with Advanced PII Detection Guardrails

A production-ready LiteLLM deployment featuring dual PII detection systems (regex-based + AI-powered) with automated CI/CD pipeline for containerized deployment to AWS ECR.

What This Repository Does

This repository provides a security-enhanced LiteLLM proxy that automatically detects and blocks Personally Identifiable Information (PII) in both user inputs and AI model responses, packaged as production-ready container images.

Key Capabilities

Dual PII Protection: Fast regex + comprehensive AI-based detection
Automated CI/CD: GitHub Actions → ECR → Infrastructure deployment
Production Ready: Multi-platform containers with security hardening
Local Development: Complete development environment with testing tools

Architecture Overview

┌─────────────────────────────────────────────────────────────────┐
│                    LiteLLM Proxy Container                      │
├─────────────────────────────────────────────────────────────────┤
│  ┌─────────────────┐    ┌─────────────────┐                    │
│  │   Regex-Based   │    │  Presidio AI    │                    │
│  │ PII Guardrails  │    │ PII Guardrails  │                    │
│  │                 │    │                 │                    │
│  │ • Email         │    │ • 50+ Entities  │                    │
│  │ • SSN           │    │ • ML-Powered    │                    │
│  │ • Phone         │    │ • Context-Aware │                    │
│  │ • Credit Cards  │    │ • Confidence    │                    │
│  └─────────────────┘    └─────────────────┘                    │
│                                                                 │
│  Pre-call & Post-call Protection for Complete Coverage         │
└─────────────────────────────────────────────────────────────────┘

PII Detection Systems

1. Regex-Based Guardrails (Fast & Reliable)

Email addresses: user@domain.com, user+tag@domain.org
Social Security Numbers: 123-45-6789, 123 45 6789, 123456789
Phone numbers: (555) 123-4567, 555-123-4567, +1 555 123 4567
Credit card numbers: Visa, MasterCard, Amex, Discover patterns
Performance: Sub-millisecond detection
Use Case: High-throughput scenarios requiring fast response

2. Microsoft Presidio AI Guardrails (Comprehensive & Intelligent)

50+ PII Entity Types: PERSON, ORGANIZATION, LOCATION, IP_ADDRESS, etc.
Context-Aware Detection: ML models understand context and nuance
Confidence Scoring: Configurable thresholds (default: 0.7)
Multi-language Support: Extensible language detection
Use Case: Comprehensive protection for sensitive environments

3. Dual Protection Strategy

Pre-call Guardrails: Block PII in user inputs before reaching AI models
Post-call Guardrails: Block PII in AI responses before reaching users
Configurable: Enable/disable individual guardrails as needed
Layered Security: Multiple detection systems provide comprehensive coverage

CI/CD & Deployment Pipeline

Automated Container Builds

Trigger: Push to main (when code files change)
├── Multi-platform build (amd64/arm64)
├── Security hardening applied
├── Push to ECR with dual tags:
│   ├── latest (development)
│   └── {commit-sha} (production)
└── Trigger infrastructure deployment

Path-Based Build Optimization

Builds only trigger when these files change:

Dockerfile (build instructions)
litellm-config.yaml (runtime configuration)
pii_*.py (guardrail implementations)
.github/workflows/build-and-push-ecr.yml (CI/CD pipeline)

Documentation and test changes don't trigger unnecessary builds.

Repository Dispatch Integration

Automatically triggers infrastructure deployment in litellm-infra repository with:

New container image URI
Commit SHA for traceability
Environment targeting (dev/staging/prod)

Published Container Images

Repository	Image URI	Description
litellm-guardrails	`{ECR_REGISTRY}/litellm-guardrails:latest`	Latest development build
litellm-guardrails	`{ECR_REGISTRY}/litellm-guardrails:{sha}`	Production-ready tagged builds

Local Development

Quick Start

# Clone and start development environment
git clone https://github.com/mrcloudchase/litellm-app.git
cd litellm-app

# Build and start all services
make build && make start

# Pull AI model for testing
make pull-model

# Test the deployment
make test

Development Stack

LiteLLM Proxy: http://localhost:4000
Ollama (AI Models): http://localhost:11434
PostgreSQL: localhost:5432
Master Key: sk-local-dev-key-12345

Testing Guardrails

# Test regex-based PII detection
make test-guardrails

# Use HTTP test collections
# tests/test_regex.http - Regex guardrail tests
# tests/test_presidio.http - Presidio guardrail tests

Configuration

Guardrail Configuration (`litellm-config.yaml`)

guardrails:
  # Fast regex-based detection
  - guardrail_name: "pii-regex-precall"
    litellm_params:
      guardrail: pii_regex_precall.PIIRegexPreCallGuardrail
      mode: "pre_call"
      
  # Comprehensive AI-based detection  
  - guardrail_name: "pii-presidio-precall"
    litellm_params:
      guardrail: pii_presidio_precall.PIIPresidioPreCallGuardrail
      mode: "pre_call"
      language: "en"
      threshold: 0.7

Environment Variables

LITELLM_MASTER_KEY=your-master-key    # Authentication
DATABASE_URL=postgresql://...         # Optional: Persistence
LITELLM_MODE=PRODUCTION              # Runtime mode

Production Usage

Docker Compose

services:
  litellm:
    image: {ECR_REGISTRY}/litellm-guardrails:latest
    ports:
      - "4000:4000"
    environment:
      - LITELLM_MASTER_KEY=${LITELLM_MASTER_KEY}
      - DATABASE_URL=${DATABASE_URL}

Kubernetes Deployment

apiVersion: apps/v1
kind: Deployment
metadata:
  name: litellm-guardrails
spec:
  replicas: 3
  selector:
    matchLabels:
      app: litellm-guardrails
  template:
    spec:
      containers:
      - name: litellm
        image: {ECR_REGISTRY}/litellm-guardrails:latest
        ports:
        - containerPort: 4000
        env:
        - name: LITELLM_MASTER_KEY
          valueFrom:
            secretKeyRef:
              name: litellm-secrets
              key: master-key

Security Features

Container Security

Minimal Attack Surface: Only essential runtime files included
Security Hardening: Non-root execution where possible
Comprehensive .dockerignore: Test files, docs, and dev tools excluded
Multi-platform Support: Consistent security across architectures

PII Protection

Zero Trust Model: All inputs and outputs scanned
Configurable Sensitivity: Adjustable confidence thresholds
Audit Trail: Comprehensive logging of PII detection events
Performance Optimized: Fast regex for high-throughput, AI for accuracy

Repository Structure

litellm-app/
├── Dockerfile                    # Container build instructions
├── litellm-config.yaml          # LiteLLM runtime configuration
├── docker-compose.yml           # Local development stack
├── Makefile                     # Development automation
├── 
├── pii_regex_detection.py       # Shared regex detection logic
├── pii_regex_precall.py         # Regex pre-call guardrail
├── pii_regex_postcall.py        # Regex post-call guardrail
├── 
├── pii_presidio_detection.py    # Shared Presidio AI logic
├── pii_presidio_precall.py      # Presidio pre-call guardrail  
├── pii_presidio_postcall.py     # Presidio post-call guardrail
├── 
├── tests/                       # Test collections and scripts
│   ├── test_regex.http         # Regex guardrail API tests
│   ├── test_presidio.http      # Presidio guardrail API tests
│   └── test_regex.py           # Python test automation
└── 
└── .github/workflows/           # CI/CD automation
    └── build-and-push-ecr.yml  # Container build pipeline

Getting Started

For Local Development: Use make build && make start
For Production Deployment: Pull from ECR and deploy with your infrastructure
For Testing: Use the HTTP test collections in tests/
For CI/CD: Configure GitHub secrets and let automation handle builds

Contributing

Fork the repository
Make your changes (ensure they follow the project structure)
Test locally with make test and make test-guardrails
Submit a pull request

Changes to core files (Dockerfile, litellm-config.yaml, pii_*.py) will trigger automatic container builds.

For a full walkthrough, please see the IMPLEMENTATION_GUIDE.md file.

Enterprise-grade PII protection for AI applications. Built for scale, security, and reliability.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LiteLLM with Advanced PII Detection Guardrails

What This Repository Does

Key Capabilities

Architecture Overview

PII Detection Systems

1. Regex-Based Guardrails (Fast & Reliable)

2. Microsoft Presidio AI Guardrails (Comprehensive & Intelligent)

3. Dual Protection Strategy

CI/CD & Deployment Pipeline

Automated Container Builds

Path-Based Build Optimization

Repository Dispatch Integration

Published Container Images

Local Development

Quick Start

Development Stack

Testing Guardrails

Configuration

Guardrail Configuration (`litellm-config.yaml`)

Environment Variables

Production Usage

Docker Compose

Kubernetes Deployment

Security Features

Container Security

PII Protection

Repository Structure

Getting Started

Contributing

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github/workflows		.github/workflows
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
Custom Guardrail Implementation Guide.pdf		Custom Guardrail Implementation Guide.pdf
Dockerfile		Dockerfile
IMPLEMENTATION_GUIDE.md		IMPLEMENTATION_GUIDE.md
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
litellm-config-local.yaml		litellm-config-local.yaml
litellm-config.yaml		litellm-config.yaml
pii_presidio_detection.py		pii_presidio_detection.py
pii_presidio_postcall.py		pii_presidio_postcall.py
pii_presidio_precall.py		pii_presidio_precall.py
pii_regex_detection.py		pii_regex_detection.py
pii_regex_postcall.py		pii_regex_postcall.py
pii_regex_precall.py		pii_regex_precall.py

mrcloudchase/litellm-app

Folders and files

Latest commit

History

Repository files navigation

LiteLLM with Advanced PII Detection Guardrails

What This Repository Does

Key Capabilities

Architecture Overview

PII Detection Systems

1. Regex-Based Guardrails (Fast & Reliable)

2. Microsoft Presidio AI Guardrails (Comprehensive & Intelligent)

3. Dual Protection Strategy

CI/CD & Deployment Pipeline

Automated Container Builds

Path-Based Build Optimization

Repository Dispatch Integration

Published Container Images

Local Development

Quick Start

Development Stack

Testing Guardrails

Configuration

Guardrail Configuration (litellm-config.yaml)

Environment Variables

Production Usage

Docker Compose

Kubernetes Deployment

Security Features

Container Security

PII Protection

Repository Structure

Getting Started

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Guardrail Configuration (`litellm-config.yaml`)

Packages