-
-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Threat Model
Status: Authoritative. Supersedes any inline threat notes in code comments.
Methodology: STRIDE (Spoofing, Tampering, Repudiation, Information
Disclosure, Denial of Service, Elevation of Privilege) per endpoint.
Scope: v1 client API + management surface + agent dispatch + relay.
Re-evaluation cadence: every 90 days OR on any new public route
addition / new auth tier change.
Owner: security-circle lead.
Related: SECURITY.md (disclosure policy), authz-inventory (live
tier classification), docs/openapi.yaml (API surface), docs/architecture/
(security boundaries).
┌────────────────────────────────────────────────────────────────────┐
│ Public Internet │
│ └── TLS termination (Caddy / cloud LB) │
│ │ │
│ ▼ ◀── Trust boundary 1: edge ↔ application │
│ ┌──────────────────────────────────────────────────────────────┐ │
│ │ Next.js App Router (3 replicas behind Caddy LB) │ │
│ │ ┌──────────────────────────────────────────────┐ │ │
│ │ │ per-route authz guards (routeGuard.ts) │ │ │
│ │ │ ┌────────┐ ┌────────┐ ┌────────┐ ┌──────┐ │ │ │
│ │ │ │ PUBLIC │ │CLIENT │ │MANAGE │ │ALWAYS│ │ │ │
│ │ │ │ /health│ │ /v1/* │ │/settn │ │PROT │ │ │ │
│ │ │ └────────┘ └────┬───┘ └────┬───┘ └──────┘ │ │ │
│ │ └──────────────────┼─────────┼──────────────┘ │ │
│ │ ▼ ▼ │ │
│ │ ┌─────────────────────────┐ ┌──────────────────────────┐ │ │
│ │ │ src/lib/sse/handlers │ │ src/lib/localDb │ │ │
│ │ │ (translator + chat) │ │ (sql.js encrypted store)│ │ │
│ │ └──────────┬──────────────┘ └────────────┬─────────────┘ │ │
│ │ │ │ │ │
│ │ ▼ ◀── Trust boundary 2: app ↔ upstream │ │
│ └───────────────┼──────────────────────────────┼─────────────────┘ │
│ ▼ ▼ │
│ ┌──────────────────────────┐ ┌────────────────────────────┐ │
│ │ Provider APIs │ │ Filesystem (DB + secrets) │ │
│ │ (OpenAI, Anthropic, …) │ │ + Redis (session/quota) │ │
│ └──────────────────────────┘ └────────────────────────────┘ │
└────────────────────────────────────────────────────────────────────┘
Boundary 1 (edge ↔ app): Mitigated by TLS at the edge + per-route auth gates
(src/server/authz/routeGuard.ts + extractApiKey/isValidApiKey in each
route.ts; there is no global Next.js middleware). Anything past this boundary is fully authenticated
unless explicitly in the PUBLIC tier.
Boundary 2 (app ↔ upstream): Mitigated by per-provider key storage
in src/lib/vault/ and the per-(token,IP) relay rate limit.
Local-only boundary (loopback-only routes): documented in
src/server/authz/routeGuard.ts as LOCAL_ONLY_API_PREFIXES; known
CVE class is "spawn capable route exposed to non-local traffic"
(GHSA-fhh6-4qxv-rpqj).
| Letter | Threat | Question it answers |
|---|---|---|
| S | Spoofing | Can an attacker pretend to be a legitimate principal? |
| T | Tampering | Can an attacker modify data in transit or at rest? |
| R | Repudiation | Can a principal deny an action they took? |
| I | Information disclosure | Can an attacker read data they shouldn't? |
| D | Denial of service | Can an attacker degrade or block service? |
| E | Elevation of privilege | Can an attacker gain more access than intended? |
Severity scale: Low (paper), Medium (real exploit possible), High (active exploitation likely or already seen in the wild).
The 50 v1 route handlers are scored below. Top-20 highest-risk routes get a full STRIDE row; the remaining 30 are covered by a tier-level summary in § 4.
| STRIDE | Risk | Mitigation |
|---|---|---|
| S | M — Bearer key stolen → caller identity spoofed | Keys hashed (SHA-256) at rest in src/lib/db/apiKeys.ts; revocation list in src/lib/db/apiKeys.ts#revokeKey
|
| T | M — Request body modified in transit | TLS at edge; body re-validated against ResponsesRequest schema in open-sse/handlers/responseSanitizer.ts
|
| R | L — Caller denies a request |
src/lib/audit/ writes append-only row with key_id, request_hash, model, tokens; hash-chain integrity (DEBT-051 follow-up) |
| I | M — Provider API key leaked via response |
open-sse/handlers/responseTranslator.ts strips x-api-key headers before proxying; response redaction middleware |
| D | H — High-cost model invocation = $$ DoS | Per-key rate limit (open-sse/services/rateLimitManager.ts); per-tenant quota pool (src/lib/db/quotaPools.ts); DEBT-001 TPM/TPD reservoir is not yet enforced |
| E | L — Bearer scope escalation | API keys carry explicit scopes array; routes require specific scopes; requireManagementAuth enforces management tier |
| STRIDE | Risk | Mitigation |
|---|---|---|
| S | M — Relay token theft | Tokens hashed with SHA-256 + per-token ID column; per-(token,IP) rate limit in src/app/api/v1/relay/chat/completions/route.ts:38-58
|
| T | M — Token rotation bypass | Token records have revoked_at; check happens before each request |
| R | L — Forensic headers sanitized |
sanitizeForensicHeader strips CR/LF; length cap 256; IP/UA never echoed back to client |
| I | H — IP-bucket map leaks | In-memory only, per-instance, capped at 10,000 entries; not persisted; see DEBT-038 (in-memory state loss on restart) |
| D | M — Per-(token,IP) abuse |
RELAY_IP_PER_MINUTE=30 default; attacker rotates IPs to spread, hits per-token wall |
| E | M — Token scope creep | Tokens carry allowedModels[]; reject on model-not-in-list |
| STRIDE | Risk | Mitigation |
|---|---|---|
| S | L — Same as 3.1 | Same auth + key model |
| T | L — Inputs are vectors; tampered embedding not useful | Re-encode on receive; reject dimension mismatch |
| R | L |
src/lib/audit/ row written |
| I | M — Embedding vectors can leak training data | Reject requests with PII patterns (regex) before forwarding; DEBT-052 follow-up |
| D | L — Embeddings cheaper than chat | Quota pool cap |
| E | L | Bearer scope |
| STRIDE | Risk | Mitigation |
|---|---|---|
| S | L | Same auth |
| T | L | |
| R | L | |
| I | M — Documents contain user content | Per-tenant key isolation; audit log captures doc hash, not content |
| D | L | |
| E | L |
| STRIDE | Risk | Mitigation |
|---|---|---|
| S | L | |
| T | L | |
| R | L | |
| I | H — Moderation inputs are sensitive user content | Encrypted in transit (TLS); not logged; flag results cached with TTL |
| D | L | |
| E | L |
| STRIDE | Risk | Mitigation |
|---|---|---|
| S | L | |
| T | L | |
| R | L | |
| I | L — Audio is synthesised from input | Same as 3.1 |
| D | M — Long TTS = bandwidth burn |
input cap 4096 chars; per-key rate limit |
| E | L |
| STRIDE | Risk | Mitigation |
|---|---|---|
| S | L | |
| T | M — Multipart body tampered |
multipart-parser validates Content-Length + boundary; size cap 25 MB |
| R | L | |
| I | H — Audio contains user PII / secrets | TTL cache; never written to disk by default; redaction of credit-card-like sequences in open-sse/executors/chatgpt-web.ts (per-file TODO, see DEBT-019) |
| D | M — Large audio = bandwidth + provider cost | 25 MB cap; rate limit |
| E | L |
| STRIDE | Risk | Mitigation |
|---|---|---|
| S | L | |
| T | L | |
| R | L | |
| I | L — Generated images are public URLs | URLs are signed with TTL; provider-controlled CDN |
| D | M — Image gen is $expensive | Per-key USD spend limits enforced in the request pipeline (src/lib/db/tokenLimits.ts, surfaced via src/app/api/usage/budget/route.ts) |
| E | L |
| STRIDE | Risk | Mitigation |
|---|---|---|
| S | L | |
| T | L | |
| R | M — Long-running async job, hard to attribute | Job records include key_id, request_id, created_at; result includes signed URL with TTL |
| I | M — Generated video may contain PII | Same as 3.8 |
| D | M — Video gen is very $expensive | Cost-budget middleware; max 10-second durations |
| E | L |
| STRIDE | Risk | Mitigation |
|---|---|---|
| S | M — Uploaded file attributed to wrong key |
key_id captured at upload; immutable record |
| T | H — Malicious file (zip bomb, polyglot) | Size cap 25 MB; MIME sniff + magic-byte check; AV scan is not performed (DEBT-053 follow-up) |
| R | L | |
| I | H — Files contain user PII | Encrypted at rest in src/lib/vault/; per-tenant access control |
| D | M — Large uploads | Size cap + rate limit |
| E | L |
| STRIDE | Risk | Mitigation |
|---|---|---|
| S | H — IDOR (file belonging to other tenant) | Per-tenant scope check; requireTenantScope middleware (added in v3.8.25) |
| T | L | |
| R | L | |
| I | H — Tenant isolation breach = data leak | Per-tenant prefix in file storage; cross-tenant read attempt logged + 403 |
| D | L | |
| E | L |
| STRIDE | Risk | Mitigation |
|---|---|---|
| S | L | |
| T | M — Batch input file modified after upload | Content hash stored at upload; mismatch on read = reject |
| R | L | |
| I | M — Input file contains PII | Same as 3.10 |
| D | M — Large batch | 50k row cap per batch |
| E | L |
| STRIDE | Risk | Mitigation |
|---|---|---|
| S | L | |
| T | H — Mass delete | Requires manage-scope auth; requireManagementAuth enforced; audit log captures key_id
|
| R | L | |
| I | M — Output files may leak | Soft-delete with 7-day retention before GC |
| D | L | |
| E | M — A misconfigured client could trigger this | Auth gate prevents; no anonymous access |
| STRIDE | Risk | Mitigation |
|---|---|---|
| S | L | |
| T | H — Prompt injection in task prompt
|
withInjectionGuard middleware (per src/app/api/v1/responses/route.ts:6); pattern blocklist + sandboxed tool execution |
| R | M — Long-running agent task, attribution | Task ID + key_id + created_at recorded; agent result signed by provider |
| I | H — Agent reads user repo / credentials | Provider-scoped credentials; least-privilege OAuth; agent result sandboxed before return |
| D | M — Long-running agent = $$ burn | 10-minute default timeout; configurable timeoutSeconds
|
| E | M — Agent tool calls could exceed scope | Tool allowlist per provider; rejection in open-sse/executors/
|
| STRIDE | Risk | Mitigation |
|---|---|---|
| S | L | |
| T | L | |
| R | L | |
| I | H — Leaked credential metadata enables targeted phishing | Returns only provider, hasValue, updatedAt (NEVER the value); requires manage-scope auth |
| D | L | |
| E | L |
| STRIDE | Risk | Mitigation |
|---|---|---|
| S | L | |
| T | L | |
| R | L | |
| I | M — Reveals quota usage; could enable timing attacks | Rate-limited; per-key only; no cross-key reads |
| D | L | |
| E | L |
| STRIDE | Risk | Mitigation |
|---|---|---|
| S | L | |
| T | L | |
| R | L | |
| I | M — Reveals provider allowlist | Required for client UX; per-key scope |
| D | L | |
| E | L |
| STRIDE | Risk | Mitigation |
|---|---|---|
| S | L | |
| T | L | |
| R | L | |
| I | H — Fetched content could be exfiltrated via prompt injection | Content goes through withInjectionGuard; explicit extract modes; URL allowlist (default-deny, see DEBT-054) |
| D | M — Recursive fetch = bandwidth burn | Max redirects 3; max 5 MB response; timeout 10s |
| E | L |
| STRIDE | Risk | Mitigation |
|---|---|---|
| S | M — VSCode token theft = long-lived impersonation | Tokens are 256-bit, single-purpose, scoped to VSCode-CLI; revokable from /api/keys/{id}
|
| T | L | |
| R | L | |
| I | M — Token in URL = logged in access logs | Token treated as secret; logs hash + last-4 only |
| D | M — Per-token rate limit (lower than API key) |
VSCODECLI_RATE_LIMIT=10 req/min default |
| E | L |
| STRIDE | Risk | Mitigation |
|---|---|---|
| S | M — WS upgrade lacks standard Bearer header | Subprotocol-based auth: Sec-WebSocket-Protocol: bearer, <token>; per-connection scope check |
| T | M — WS message frames tampered | TLS-only (wss://); reject ws:// in prod |
| R | L | Connection ID + auth principal recorded at handshake |
| I | H — Long-lived connection = larger blast radius | 1-hour max connection; heartbeat every 30s; auto-disconnect on auth revoke |
| D | H — Many WS = connection table exhaustion | Per-IP WS cap 5; per-key cap 20; rate limit per message |
| E | L |
Routes in the same tier inherit the same baseline mitigations. Per-route
deltas are documented in docs/openapi.yaml (request schema constraints).
| Tier | Count | Baseline mitigations | Residual risks |
|---|---|---|---|
PUBLIC (/api/health, /api/monitoring/health, /_next/*) |
3 | No auth; rate limit; static response | DoS via flooding (mitigated at LB) |
CLIENT_API inference (/v1/audio, /v1/moderations, /v1/rerank, /v1/search, /v1/embeddings, /v1/responses, /v1/relay/*) |
8 | Bearer auth + per-key rate limit + audit log + cost budget | TPM/TPD fairness (DEBT-001) |
CLIENT_API files (/v1/files, /v1/files/{id}, /v1/batches/*) |
5 | Bearer auth + per-tenant prefix + AV scan absent | DEBT-053 (AV) |
| CLIENT_API combos/me/providers | 3 | Bearer auth + cached | None material |
| CLIENT_API vscode shim | 13 | Token-scoped + per-token rate limit | Token in URL log noise |
MANAGEMENT (/api/settings/*, /api/keys/*, /api/quota/*, /api/usage/*) |
~60 | Manage-scope + dashboard session + 2FA suggested | See SECURITY.md for P0 items |
| ALWAYS_PROTECTED (shutdown, DB settings) | ~5 | Auth + re-auth for security-impacting changes | None material |
| LOCAL_ONLY (loopback spawn routes) | ~8 | Loopback-only + bypass-constant gated by manage-scope | GHSA-fhh6-4qxv-rpqj class if exposed |
| ID | Mitigation | Pillar target | Effort |
|---|---|---|---|
| TM-01 | DEBT-001: TPM/TPD token-bucket fairness for relay + inference | D, E (cost) | M |
| TM-02 | DEBT-051: Hash-chain integrity for src/lib/audit/ rows |
R | M |
| TM-03 | DEBT-053: AV scan on /v1/files upload (ClamAV or external API) |
T, I | L |
| TM-04 | DEBT-054: URL allowlist for /v1/web/fetch
|
I, D | M |
| TM-05 | OIDC/SAML SSO for management tier (audit L46) | S | L |
| TM-06 | TOTP MFA for manage-scope users (audit L49) | S, E | M |
| TM-07 | Field-level encryption for PII columns (audit L47) | I | L |
| Date | Reviewer | Change |
|---|---|---|
| 2026-06-18 | @KooshaPari/core (L5-118) | Initial STRIDE per endpoint, top-20 highest-risk routes + tier summary |
| 2026-06-25 (planned) | security-circle | Re-score after DEBT-001, DEBT-051 close-outs |
| 2026-09-18 (planned) | security-circle | Quarterly full re-review; refresh mitigations list |
OmniRoute · Website · npm · Docker Hub
- Setup Guide
- User Guide
- Features
- Quick Start (Docker)
- Electron Desktop App
- Termux (Android)
- PWA Guide
- MCP Server
- A2A Server
- Agent Protocols
- OpenCode Plugin
- Webhooks
- Cloud Agents
- Skills
- Memory
- Evals
- Gamification
- Guardrails
- Compliance
- Error Sanitization
- Public Credentials
- Route Guard Tiers
- Stealth Guide
- CLI Token Auth