-
Notifications
You must be signed in to change notification settings - Fork 1k
Guardrails
Source of truth:
src/lib/guardrails/Last updated: 2026-05-13 — v3.8.0
Guardrails enforce safety, policy, and content transformations at the boundary
between OmniRoute and upstream providers. Each guardrail can inspect (and
optionally reject, transform, or annotate) request payloads (preCall) and
upstream responses (postCall).
The system is fail-open: if a guardrail throws while executing, the registry
records the error and continues with the next guardrail rather than failing the
request. Blocking is an explicit decision (block: true), never an accident.
The registry auto-loads three guardrails in priority order on import
(see registry.ts → registerDefaultGuardrails()):
| Priority | Name | Stage(s) | File |
|---|---|---|---|
5 |
vision-bridge |
preCall |
visionBridge.ts |
10 |
pii-masker |
pre + post
|
piiMasker.ts |
20 |
prompt-injection |
preCall |
promptInjection.ts |
Lower priority numbers run first.
Intercepts image-bearing requests aimed at non-vision models and replaces the image parts with text descriptions produced by a configurable vision model before the upstream call. This lets text-only providers transparently handle multimodal payloads.
Flow:
- Skip if the target model already supports vision (unless it appears in the
forced-bridge list
isVisionBridgeForcedModel). - Extract image parts via
extractImageParts(messages). Skip if none. - Load runtime config from
getSettings()(visionBridgeEnabled,visionBridgeModel,visionBridgePrompt,visionBridgeTimeout,visionBridgeMaxImages). - Cap images at
maxImages, call the vision model in parallel (Promise.allSettled), and inject[Image N]: <description>text parts in their place — failed images become[Image N]: (unavailable). - Return
modifiedPayload+ meta (imagesProcessed,processingTimeMs,visionModel).
Defaults live in src/shared/constants/visionBridgeDefaults.ts. The guardrail
exposes a deps constructor option so tests can inject fake getSettings and
callVisionModel implementations.
Runs on both stages.
-
preCallclones the payload, walkssystem,messages, andinputarrays, and appliesprocessPII()(from@/shared/utils/inputSanitizer) to stringcontent/textfields. WhenPII_REDACTION_ENABLED=trueandINPUT_SANITIZER_MODE=redact, detected PII is stripped/redacted in the outbound payload. Otherwise the call records detection counts without rewriting content. -
postCalldeep-clones the response, runssanitizePIIResponse()plus the Responses-API-shape masker (maskResponsesOutput— coversoutput_textandoutput[].content[].text). If any redaction occurs, the modified response replaces the original.
The guardrail never blocks; it only annotates (meta.detections,
meta.redacted) or rewrites.
Detects adversarial structures in user-supplied content and enforces the configured policy. Behavior is driven by environment variables and constructor options:
| Setting | Env var | Default | Effect |
|---|---|---|---|
| Enabled | INPUT_SANITIZER_ENABLED |
true |
When false, guardrail short-circuits. |
| Mode |
INJECTION_GUARD_MODE / INPUT_SANITIZER_MODE
|
warn |
block, warn, or log. |
| Block threshold |
blockThreshold option |
high |
Minimum severity required to block. |
Detection sources:
-
sanitizeRequest()from@/shared/utils/inputSanitizer(shared detector set used elsewhere in the pipeline). - Built-in
DEFAULT_GUARD_PATTERNS(currentlysystem_override_inlineandmarkdown_system_block, bothhighseverity). - Optional
customPatternspassed via constructor options (strings, regex, or{ name, pattern, severity }records).
When mode === "block" and at least one detection meets the severity
threshold, preCall returns { block: true, message: "Request rejected: suspicious content detected" }. In warn/log modes the guardrail logs but
allows the call. The shared helper evaluatePromptInjection() is also exported
for callers that need to evaluate prompts without going through the registry.
class BaseGuardrail {
enabled: boolean;
name: string;
priority: number;
constructor(name: string, options?: { enabled?: boolean; priority?: number });
async preCall(payload: unknown, context: GuardrailContext): Promise<GuardrailResult | void>;
async postCall(response: unknown, context: GuardrailContext): Promise<GuardrailResult | void>;
}
interface GuardrailResult<TValue = unknown> {
block?: boolean; // true short-circuits the chain
message?: string; // surfaced when blocking
meta?: Record<string, unknown> | null;
modifiedPayload?: TValue; // returned by preCall to rewrite the request
modifiedResponse?: TValue; // returned by postCall to rewrite the response
}
interface GuardrailContext {
apiKeyInfo?: Record<string, unknown> | null;
disabledGuardrails?: string[] | null;
endpoint?: string | null;
headers?: Headers | Record<string, unknown> | null;
log?: GuardrailLog | Console | null;
method?: string | null;
model?: string | null;
provider?: string | null;
sourceFormat?: string | null;
stream?: boolean;
targetFormat?: string | null;
}A guardrail signals "no change" by returning either void, {}, or
{ block: false }. Returning a modifiedPayload/modifiedResponse replaces
the value flowing through the chain for downstream guardrails.
The singleton guardrailRegistry exposes:
-
register(guardrail)— adds (or replaces by normalized name) a guardrail and re-sorts by ascendingpriority. -
clear()/list()— administrative helpers. -
runPreCallHooks(payload, context)— iterates active guardrails, threads the payload throughmodifiedPayload, and stops on the firstblock: true. -
runPostCallHooks(response, context)— same flow on the response side. -
resetGuardrailsForTests({ registerDefaults })— clears state and optionally re-registers the defaults for clean test isolation.
Both runners return { blocked, payload|response, results, guardrail?, message? }
where results is an array of GuardrailExecutionResult records that include
per-guardrail blocked, skipped, modified, error, and meta fields,
useful for tracing.
resolveDisabledGuardrails({ apiKeyInfo, body, headers }) aggregates a
de-duplicated list of guardrail names that should be skipped for the current
request. Sources (all optional, all merged):
apiKeyInfo.disabledGuardrails- Request body
disabledGuardrails(top-level) - Request body
metadata.disabledGuardrails - Header
x-omniroute-disabled-guardrails(or legacyx-disabled-guardrails)
Values may be arrays of strings or a comma-separated string; names are
normalized to lowercase kebab-case (pii_masker → pii-masker). The result
is passed through context.disabledGuardrails to the registry, which skips
matching guardrails (skipped: true in results).
For each request flowing through src/sse/handlers/chat.ts and
open-sse/handlers/chatCore.ts:
-
resolveDisabledGuardrails(...)builds the skip list from API key, body, and headers. -
guardrailRegistry.runPreCallHooks(body, ctx)runs guardrails in ascending priority order:- Disabled guardrails are recorded as
skipped. - Each guardrail's
preCallmay rewrite the payload viamodifiedPayload. - The first
block: trueshort-circuits the chain and the handler returns a guardrail rejection response.
- Disabled guardrails are recorded as
- The (potentially rewritten) payload flows into combo routing and upstream dispatch.
- After the response is assembled,
guardrailRegistry.runPostCallHooks(...)runs the same chain on the response.block: truehere drops the upstream response.
Guardrails that throw are recorded with error: <message> and logged via
logger.warn, but the chain continues — fail-open by design.
Environment variables read by the built-in guardrails:
| Variable | Used by | Effect |
|---|---|---|
INPUT_SANITIZER_ENABLED |
prompt-injection |
Set false to disable detection entirely. |
INPUT_SANITIZER_MODE |
prompt-injection, pii-masker
|
Shared mode: warn, block, log, or redact. |
INJECTION_GUARD_MODE |
prompt-injection |
Legacy alias for INPUT_SANITIZER_MODE. |
PII_REDACTION_ENABLED |
pii-masker |
When true + mode redact, request PII is stripped. |
PII_RESPONSE_SANITIZATION / _MODE
|
pii-masker (downstream) |
Controls response-side masker behavior. |
The Vision Bridge reads runtime config from the DB-backed settings store
(getSettings()), not env vars: visionBridgeEnabled, visionBridgeModel,
visionBridgePrompt, visionBridgeTimeout, visionBridgeMaxImages. Defaults
live in src/shared/constants/visionBridgeDefaults.ts.
import { BaseGuardrail, guardrailRegistry } from "@/lib/guardrails";
class BudgetGuardrail extends BaseGuardrail {
constructor() {
super("budget", { priority: 50 });
}
async preCall(payload, ctx) {
if (ctx.apiKeyInfo?.budgetExceeded) {
return { block: true, message: "Daily budget exceeded" };
}
return { block: false };
}
}
guardrailRegistry.register(new BudgetGuardrail());Steps:
- Create
src/lib/guardrails/myGuardrail.tsextendingBaseGuardrail. - Implement
preCalland/orpostCall. - Either register at import time (push from
registerDefaultGuardrails) or callguardrailRegistry.register(...)at runtime — the registry replaces any prior guardrail with the same normalized name. - Add tests under
tests/unit/(existing examples:tests/unit/guardrails-registry.test.ts,tests/unit/prompt-injection-guard.test.ts,tests/unit/guardrails/visionBridge.test.ts).
Use resetGuardrailsForTests() between tests to start from a known state.
Pass { registerDefaults: false } to start with an empty registry and
register only the guardrails under test. The Vision Bridge guardrail accepts
dependency injection (deps.getSettings, deps.callVisionModel) so tests can
exercise the full flow without DB or network access.
-
src/lib/guardrails/— implementation -
src/shared/utils/inputSanitizer.ts— shared detector that powers prompt-injection and PII masking -
src/shared/constants/visionBridgeDefaults.ts— Vision Bridge defaults and forced-bridge model list -
docs/architecture/RESILIENCE_GUIDE.md— orthogonal layer (circuit breaker, cooldowns) -
docs/reference/ENVIRONMENT.md— full env var reference
OmniRoute · Website · npm · Docker Hub
- Setup Guide
- User Guide
- Features
- Quick Start (Docker)
- Electron Desktop App
- Termux (Android)
- PWA Guide
- MCP Server
- A2A Server
- Agent Protocols
- OpenCode Plugin
- Webhooks
- Cloud Agents
- Skills
- Memory
- Evals
- Gamification
- Guardrails
- Compliance
- Error Sanitization
- Public Credentials
- Route Guard Tiers
- Stealth Guide
- CLI Token Auth