-
Notifications
You must be signed in to change notification settings - Fork 3
Home
hugalafutro edited this page Jun 11, 2026
·
7 revisions
Multi-Provider AI Gateway
A single OpenAI-compatible endpoint that sits in front of all your LLM providers. Route requests across models using your own priority ordering, fail over automatically when a provider goes down, and see exactly where your tokens are going.

git clone https://github.com/hugalafutro/model-hotel.git
cd model-hotel
cp .env.example .env # Set MASTER_KEY and DATABASE_URL
docker compose up --buildSee Development for local setup details.
-
Unified API - One OpenAI-compatible surface for all providers:
/v1/chat/completionsplus multimodal endpoints (embeddings, image generation/edits/variations, text-to-speech, speech-to-text) -
Hotel Routing - Prefix models with
hotel/to route through failover groups (works on every endpoint) - Transparent Failover - Automatic retry on 5xx, 429, 401/403, 404, and timeouts
- Circuit Breaker - Per-provider circuit breaker prevents wasted requests
- Virtual Keys - Per-client API keys with rate limiting and usage tracking
- Model Discovery - Auto-sync 300+ models across 30+ providers, with a post-scan summary of what changed (added / re-enabled / disabled models, failover group updates)
- Request Logging - Full latency decomposition (TTFT, overhead, per-stage timing)
- Privacy by Design - Prompts are never logged, read, or stored
- Interactive Chat & Arena - Built-in UI for testing and comparing models
- Configuration - Environment variables, runtime settings, appearance
- Development - Local setup, project structure, contributing
- Virtual Keys - Per-client API key management, rate limiting, usage tracking
- API Reference - Proxy and admin API endpoints with examples
- Request Logging - Latency decomposition, log management, app logs
- Model Discovery - Automatic model synchronization with per-provider metadata
- Failover and Hotel Routing - Transparent failover, hotel routing, circuit breaker
- Security - AES-256-GCM encryption, Argon2id, SHA-256 hashing, URL validation
- Privacy - Data handling, what is and isn't captured, local deployment
Core packages: proxy/ (streaming, failover), provider/ (discovery, encryption), failover/ (circuit breaker, routing), virtualkey/ (auth, rate limiting), model/ (caching, CRUD). PostgreSQL backend with 43 migrations.
See the full architecture diagram in the README.
Last synced from hugalafutro/model-hotel@fe063b9 on 2026-06-21 10:25 UTC. Edit these pages under wiki/ (and images under docs/screenshots/) in the main repo, not here.
- π¨ Home
- βοΈ Configuration
- π Development
- π Virtual Keys
- π API Reference
- π Request Logging
- π Model Discovery
- π Failover and Hotel Routing
- π Alerting