-
-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Tiers
OmniRoute organizes the 207+ supported providers into 3 economic tiers. Each request travels through them in order until one returns successfully — you get the cheapest viable response without ever writing fallback code.
Providers you already pay for. OmniRoute uses every drop of quota before it expires.
| Provider | Why Tier 1 |
|---|---|
| Claude Code OAuth | Anthropic Pro/Team — flat-rate, often unused |
| OpenAI Codex (ChatGPT subscription) | Plus/Team includes Codex quota |
| GitHub Copilot | Per-seat — quota resets monthly |
| Cursor IDE | Pro plan quota |
| Antigravity / Windsurf | Built-in quotas |
Strategy: route here first for every request that fits the model's
strengths. Quota tracker monitors approaching reset; combo strategies
reset-aware and subscription prioritize accordingly.
Pay-per-token providers under $1/1M tokens. Reserved for high-volume work or after Tier 1 quotas hit limits.
| Provider | Price (input/output) | Strengths |
|---|---|---|
| DeepSeek V4 Pro | $0.27 / $1.10 per 1M | Code, reasoning |
| GLM-4.5 | $0.60 / $2.20 per 1M | Long context |
| MiniMax M1 | $0.20 / $1.10 per 1M | Speed |
| Qwen Coder | $0.30 / $1.20 per 1M | Code |
| OpenRouter (price-optimized) | varies | 100+ models, dynamic |
Strategy: combo cost-optimized picks lowest $/token model that meets
the task's capability filter (vision, JSON mode, tools, max-context).
Zero-cost providers — free tiers, credit programs, OAuth daily quotas.
| Provider | Free quota / credits |
|---|---|
| Kiro AI | Free Claude tier (generous fair-use) |
| OpenCode Free | No auth, generous rate limits |
| Qoder | Free OAuth |
| Gemini CLI OAuth | Generous daily quota |
| Google Vertex AI | $300 new-account credits |
| Amazon Q | Free tier for AWS users |
| Pollinations | Open public API |
| Cloudflare AI | Workers AI free tier |
Strategy: combo auto with budget cap routes here when Tier 1+2 fail
or when useFreeOnly=true is set. Free providers often have weaker
rate limits — circuit breaker recovers them on backoff.
Dashboard → Tiers → assign your providers. Defaults (from tierDefaults.json) are
sensible; edit when you have specific subscriptions to prioritize or providers to exclude.
Auto-Combo's 9-factor scoring also considers tier. See
docs/routing/AUTO-COMBO.md.
Dashboard → Usage shows tokens spent per tier per day. Use this to:
- Confirm Tier 1 is utilized fully (otherwise you're wasting subscription value)
- Identify which Tier 2 models are picked most (consolidate to 1-2)
- Verify Tier 3 saves money on test/exploration workloads
{
"strategy": "auto",
"config": { "auto": { "weights": { "costInv": 0.5, "tierPriority": 0.3 } } }
}Forces strongly towards Tier 3; only uses Tier 2 if Tier 3 is unavailable.
{
"strategy": "priority",
"targets": [
{ "provider": "claude-code-oauth", "weight": 1 },
{ "provider": "deepseek", "weight": 1 },
{ "provider": "kiro", "weight": 1 }
]
}Explicit ordered list matching Tier 1 → Tier 2 → Tier 3.
OmniRoute · Website · npm · Docker Hub
- Setup Guide
- User Guide
- Features
- Quick Start (Docker)
- Electron Desktop App
- Termux (Android)
- PWA Guide
- MCP Server
- A2A Server
- Agent Protocols
- OpenCode Plugin
- Webhooks
- Cloud Agents
- Skills
- Memory
- Evals
- Gamification
- Guardrails
- Compliance
- Error Sanitization
- Public Credentials
- Route Guard Tiers
- Stealth Guide
- CLI Token Auth