LLMProxy

Unified API proxy for multiple LLM providers. A Python library that provides a single interface for calling OpenAI, Anthropic, and other LLM APIs with automatic fallback and load balancing.

Architecture

graph TD
    A[Your Application] --> B[LLMProxy]
    B --> C{Router / Fallback}
    C --> D[OpenAI Provider]
    C --> E[Anthropic Provider]
    C --> F[Custom Provider]
    D --> G[OpenAI API]
    E --> H[Anthropic API]
    F --> I[Any LLM API]

    B --> J[Cost Estimator]
    B --> K[Usage Stats]
    B --> L[Health Checker]
    B --> M[Retry Engine]

    style B fill:#4a90d9,stroke:#333,color:#fff
    style C fill:#f5a623,stroke:#333,color:#fff

Features

Multi-provider support — OpenAI, Anthropic, and extensible to any LLM API
Automatic fallback — Define fallback chains so requests reroute on failure
Load balancing — Distribute requests across providers
Cost estimation — Estimate token costs before sending requests
Usage tracking — Monitor request counts, tokens, and costs per provider
Health checks — Verify provider availability in real time
Configurable retries — Exponential backoff with jitter out of the box
Pydantic models — Fully typed request/response objects

Quickstart

Installation

pip install llmproxy

Or install from source:

git clone https://github.com/officethree/LLMProxy.git
cd LLMProxy
pip install -e ".[dev]"

Basic Usage

from llmproxy import LLMProxy

proxy = LLMProxy()

# Add providers
proxy.add_provider("openai", {
    "api_key": "sk-...",
    "base_url": "https://api.openai.com/v1",
})
proxy.add_provider("anthropic", {
    "api_key": "sk-ant-...",
    "base_url": "https://api.anthropic.com/v1",
})

# Set fallback order
proxy.set_fallback_chain(["openai", "anthropic"])

# Generate a completion
response = await proxy.complete(
    prompt="Explain quantum computing in one paragraph.",
    model="gpt-4o",
    provider="openai",
)
print(response.content)

Cost Estimation

estimate = proxy.estimate_cost("Write a haiku about Python.", model="gpt-4o")
print(f"Estimated cost: ${estimate['estimated_cost']:.6f}")

Health Checks

status = await proxy.health_check("openai")
print(f"OpenAI healthy: {status['healthy']}")

Usage Stats

stats = proxy.get_usage_stats()
for provider, data in stats.items():
    print(f"{provider}: {data['request_count']} requests, ${data['total_cost']:.4f}")

Configuration

Copy .env.example to .env and set your API keys:

cp .env.example .env

See docs/ARCHITECTURE.md for detailed design documentation.

Development

make install    # Install with dev dependencies
make test       # Run tests
make lint       # Run linter
make format     # Format code

Inspired by

Inspired by LiteLLM and multi-provider LLM trends.

Built by Officethree Technologies | Made with love and AI

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github/workflows		.github/workflows
docs		docs
src/llmproxy		src/llmproxy
tests		tests
.env.example		.env.example
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLMProxy

Architecture

Features

Quickstart

Installation

Basic Usage

Cost Estimation

Health Checks

Usage Stats

Configuration

Development

Inspired by

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLMProxy

Architecture

Features

Quickstart

Installation

Basic Usage

Cost Estimation

Health Checks

Usage Stats

Configuration

Development

Inspired by

About

Topics

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages