-
-
Notifications
You must be signed in to change notification settings - Fork 1.2k
2026 06 21 Compression Phase3 Request Header Plan
For agentic workers: REQUIRED SUB-SKILL: Use superpowers:subagent-driven-development (recommended) or superpowers:executing-plans to implement this plan task-by-task. Steps use checkbox (
- [ ]) syntax for tracking.
Goal: Add the x-omniroute-compression per-request header so a client can override the resolved compression plan for a single request, taking precedence over every operator-configured layer.
Architecture: A pure header interpreter (planFromHeader) is evaluated at the top of the compression resolver (resolveBasePlan in strategySelector.ts); chatCore parses the header off the wire, threads it through the existing selector signature (the same way Phase 2 threaded combos), captures the resolved { mode, source }, and emits an X-OmniRoute-Compression response header. The resolver stays pure (no DB import).
Tech Stack: TypeScript, Node test runner (node --import tsx/esm --test), better-sqlite3 (combos store, read via chatCore only), Next.js handler (open-sse/handlers/chatCore.ts).
Spec: docs/compression/2026-06-21-compression-phase3-request-header-design.md
- The resolver entry point is
resolveBasePlaninopen-sse/services/compression/strategySelector.ts. It already threads acombosmap (Phase 2). Public selectors:selectCompressionPlan/selectCompressionStrategy/getEffectiveMode, signature ending in..., combos = {}. -
DerivedPlanis defined inopen-sse/services/compression/deriveDefaultPlan.tsas{ mode: string; stackedPipeline: Array<{ engine: string; intensity?: string }> }. -
open-sse/services/compression/resolveCompressionPlan.tscarries a dormant header branch (ctx.header+ privateheaderToPlan). No caller ever passesheader(confirmed: the only.headerrefs in compression code are those lines; no test passes it). This plan removes that dormant branch so header interpretation has a single home. - Parsing precedent:
isNoMemoryRequestedinopen-sse/handlers/chatCore/headers.ts(usesgetHeaderValueCaseInsensitive). Tests for that module:tests/unit/chatcore-headers.test.ts. - A combo's
idis auuidv4()(or the seeded slugdefault-caveman);namehas no UNIQUE constraint (migration 042). Header<combo>matches name-first (Decision A). - chatCore reads request headers from
clientRawRequest?.headers. The compression block buildsnamedCombos(~line 1373) and callsselectCompressionStrategy(~line 1532) thenselectCompressionPlan(~line 1601). Response headers are built at two sites: the non-streaming JSON path (responseHeadersat ~line 4570, afterattachOmniRouteMetaHeaders) and the streaming path (responseHeadersat ~line 4697). -
Decision A:
<combo>matched name-first (lowercased), then exact id. Decision B: any valid header value bypasses auto-trigger. -
Boundary (implement as written): the master switch is the hard gate — when
config.enabledis false, the request is uncompressed regardless of the header (the header redirects among configured plans only when compression is enabled). A test asserts this.
| File | Responsibility | Change |
|---|---|---|
open-sse/services/compression/deriveDefaultPlan.ts |
DerivedPlan type + new CompressionSource union + optional source. |
Modify |
open-sse/services/compression/strategySelector.ts |
planFromHeader interpreter, header-first precedence in resolveBasePlan, source tagging, formatCompressionMeta, thread header through public selectors. |
Modify |
open-sse/services/compression/resolveCompressionPlan.ts |
Remove the dormant header branch + headerToPlan + ResolveCtx.header. |
Modify |
open-sse/handlers/chatCore/headers.ts |
resolveCompressionHeader parser. |
Modify |
src/shared/constants/headers.ts |
compression response-header name. |
Modify |
open-sse/handlers/chatCore.ts |
Parse header, add name keys to namedCombos, thread header, capture {mode,source}, emit response header at both sites. |
Modify |
tests/unit/compression/compression-header-dispatch.test.ts |
Resolver + planFromHeader + source + precedence + formatCompressionMeta. |
Create |
tests/unit/chatcore-headers.test.ts |
resolveCompressionHeader parsing. |
Modify |
docs/reference/API_REFERENCE.md, docs/compression/COMPRESSION_GUIDE.md
|
Document the header. | Modify |
config/quality/file-size-baseline.json |
Rebaseline chatCore if it crosses its pin. | Modify (if needed) |
Files:
-
Modify:
open-sse/services/compression/deriveDefaultPlan.ts -
Modify:
open-sse/services/compression/strategySelector.ts -
Modify:
open-sse/services/compression/resolveCompressionPlan.ts -
Test:
tests/unit/compression/compression-header-dispatch.test.ts(create) -
Step 1: Write the failing test
Create tests/unit/compression/compression-header-dispatch.test.ts:
import { describe, it } from "node:test";
import assert from "node:assert/strict";
import {
selectCompressionStrategy,
selectCompressionPlan,
planFromHeader,
formatCompressionMeta,
} from "../../../open-sse/services/compression/strategySelector.ts";
import {
DEFAULT_COMPRESSION_CONFIG,
type CompressionConfig,
} from "../../../open-sse/services/compression/types.ts";
const combos = {
c1: [{ engine: "rtk", intensity: "standard" }, { engine: "caveman", intensity: "full" }],
"fast combo": [{ engine: "lite" }],
};
function cfg(overrides: Partial<CompressionConfig> = {}): CompressionConfig {
return { ...DEFAULT_COMPRESSION_CONFIG, enabled: true, ...overrides };
}
// header is the 7th positional arg of selectCompressionPlan/selectCompressionStrategy.
function planWithHeader(config: CompressionConfig, header: string | null) {
return selectCompressionPlan(config, null, 0, undefined, undefined, combos, header);
}
describe("planFromHeader (Phase 3)", () => {
it("off => mode off, source request-header", () => {
const p = planFromHeader(cfg(), "off", combos);
assert.deepEqual(p, { mode: "off", stackedPipeline: [], source: "request-header" });
});
it("default => the panel-derived default, ignoring the active profile", () => {
const config = cfg({
activeComboId: "c1",
enginesExplicit: true,
engines: { rtk: { enabled: true } },
});
const p = planFromHeader(config, "default", combos);
assert.equal(p?.mode, "rtk"); // engines map default, NOT the c1 active profile
assert.equal(p?.source, "request-header");
});
it("engine:<id> => that single engine when enabled (case-insensitive)", () => {
const config = cfg({ enginesExplicit: true, engines: { rtk: { enabled: true } } });
assert.equal(planFromHeader(config, "engine:RTK", combos)?.mode, "rtk");
});
it("engine:<id> => null (fall-through) when the engine is disabled", () => {
const config = cfg({ engines: { rtk: { enabled: false } } });
assert.equal(planFromHeader(config, "engine:rtk", combos), null);
});
it("<combo> matches by name (case-insensitive) and by id", () => {
assert.deepEqual(planFromHeader(cfg(), "FAST COMBO", combos)?.stackedPipeline, combos["fast combo"]);
assert.deepEqual(planFromHeader(cfg(), "c1", combos)?.stackedPipeline, combos.c1);
});
it("unknown value => null (fall-through)", () => {
assert.equal(planFromHeader(cfg(), "nonsense", combos), null);
});
});
describe("header precedence in resolveBasePlan (Phase 3)", () => {
it("a valid header beats the active profile", () => {
const config = cfg({ activeComboId: "c1" });
assert.equal(planWithHeader(config, "off").mode, "off");
assert.equal(planWithHeader(config, "off").source, "request-header");
});
it("a valid header beats a routing-combo override", () => {
const config = cfg({ comboOverrides: { "route-x": "stacked" } });
const plan = selectCompressionPlan(config, "route-x", 0, undefined, undefined, combos, "off");
assert.equal(plan.mode, "off");
assert.equal(plan.source, "request-header");
});
it("a valid header bypasses auto-trigger (Decision B)", () => {
const config = cfg({
autoTriggerTokens: 1000,
autoTriggerMode: "aggressive",
enginesExplicit: true,
engines: { rtk: { enabled: true } },
});
// Large prompt would auto-escalate to aggressive; the header pins the panel default.
assert.equal(planWithHeader(config, "default").mode, "rtk");
});
it("an unknown header falls through to the normal resolution", () => {
const config = cfg({ activeComboId: "c1" });
const plan = planWithHeader(config, "bogus");
assert.equal(plan.mode, "stacked");
assert.equal(plan.source, "active-profile");
});
it("master-off beats the header (hard kill switch)", () => {
const config = cfg({ enabled: false, engines: { rtk: { enabled: true } } });
const plan = planWithHeader(config, "engine:rtk");
assert.equal(plan.mode, "off");
assert.equal(plan.source, "off");
});
});
describe("source on non-header paths", () => {
it("routing-override / active-profile / auto-trigger / default / off", () => {
assert.equal(
selectCompressionPlan(cfg({ comboOverrides: { r: "lite" } }), "r", 0, undefined, undefined, combos, null).source,
"routing-override"
);
assert.equal(planWithHeader(cfg({ activeComboId: "c1" }), null).source, "active-profile");
assert.equal(
planWithHeader(cfg({ autoTriggerTokens: 10, autoTriggerMode: "lite" }), null).source,
"auto-trigger"
);
assert.equal(
planWithHeader(cfg({ enginesExplicit: true, engines: { rtk: { enabled: true } } }), null).source,
"default"
);
assert.equal(planWithHeader(cfg({ enabled: false }), null).source, "off");
});
});
describe("formatCompressionMeta", () => {
it("renders '<mode>; source=<source>'", () => {
assert.equal(
formatCompressionMeta({ mode: "aggressive", stackedPipeline: [], source: "request-header" }),
"aggressive; source=request-header"
);
assert.equal(formatCompressionMeta({ mode: "off", stackedPipeline: [] }), "off; source=off");
});
});- Step 2: Run the test to verify it fails
Run: node --import tsx --import ./open-sse/utils/setupPolyfill.ts --import ./tests/_setup/isolateDataDir.ts --test --test-force-exit tests/unit/compression/compression-header-dispatch.test.ts
Expected: FAIL — planFromHeader/formatCompressionMeta are not exported yet (import error / not a function).
- Step 3: Add
CompressionSource+sourcetoDerivedPlan
In open-sse/services/compression/deriveDefaultPlan.ts, replace the DerivedPlan interface block:
export type CompressionSource =
| "request-header"
| "routing-override"
| "active-profile"
| "auto-trigger"
| "default"
| "off";
export interface DerivedPlan {
mode: string;
stackedPipeline: Array<{ engine: string; intensity?: string }>;
/** Which precedence layer decided this plan (Phase 3 observability). Optional so
* Phase 1/2 callers and snapshots are unaffected. */
source?: CompressionSource;
}(Leave the deriveDefaultPlan function body unchanged — it does not set source; the resolver tags it.)
- Step 4: Remove the dormant header branch from
resolveCompressionPlan.ts
In open-sse/services/compression/resolveCompressionPlan.ts: delete the header field from ResolveCtx, delete the // 1. header block, and delete the private headerToPlan function. The file becomes:
import { deriveDefaultPlan, type DerivedPlan } from "./deriveDefaultPlan.ts";
export interface ResolveCtx {
comboId?: string | null;
combos?: Record<string, Array<{ engine: string; intensity?: string }>>; // named combo pipelines by id
}
export function resolveCompressionPlan(config: any, ctx: ResolveCtx): DerivedPlan {
if (config?.enabled === false) return { mode: "off", stackedPipeline: [] };
// routing-combo override
const ov = ctx.comboId ? config?.comboOverrides?.[ctx.comboId] : undefined;
if (ov) return modeToPlan(ov, config);
// active named combo
if (config?.activeComboId && ctx.combos?.[config.activeComboId]) {
return { mode: "stacked", stackedPipeline: ctx.combos[config.activeComboId] };
}
// derived default
return deriveDefaultPlan(config?.engines ?? {}, config?.enabled !== false);
}
function modeToPlan(mode: string, config: any): DerivedPlan {
return mode === "stacked"
? { mode: "stacked", stackedPipeline: config?.stackedPipeline ?? [] }
: { mode, stackedPipeline: [] };
}- Step 5: Implement
planFromHeader,withSource, header-first precedence, andformatCompressionMetainstrategySelector.ts
In open-sse/services/compression/strategySelector.ts:
(a) Update the DerivedPlan import to also bring CompressionSource:
import { deriveDefaultPlan, type DerivedPlan, type CompressionSource } from "./deriveDefaultPlan.ts";(b) Add these helpers (place them just above resolveBasePlan):
/** Tags a plan with the precedence layer that produced it (Phase 3 observability). */
function withSource(plan: DerivedPlan, source: CompressionSource): DerivedPlan {
return { ...plan, source };
}
/**
* Interprets the `x-omniroute-compression` request header into a plan, or null when the
* value is unrecognized (caller falls through to normal resolution). Pure.
* off -> no compression
* default -> the panel-derived Default (ignores active profile / routing / auto-trigger)
* engine:<id> -> that single engine, when enabled in config.engines
* <combo> -> a named combo, matched name-first (lowercased) then exact id (Decision A)
*/
export function planFromHeader(
config: CompressionConfig,
header: string,
combos: NamedCombos
): DerivedPlan | null {
const h = header.trim();
if (!h) return null;
const lower = h.toLowerCase();
if (lower === "off") return withSource({ mode: "off", stackedPipeline: [] }, "request-header");
if (lower === "default") {
// Empty combos + null comboId yields the pure panel default (no active-combo leak).
return withSource(deriveDefaultPlanFromConfig(config, null, {}), "request-header");
}
if (lower.startsWith("engine:")) {
const id = lower.slice("engine:".length);
const engine = config.engines?.[id];
return engine?.enabled
? withSource(deriveDefaultPlan({ [id]: engine }, true), "request-header")
: null;
}
const combo = combos[lower] ?? combos[h];
return combo ? withSource({ mode: "stacked", stackedPipeline: combo }, "request-header") : null;
}
/** Renders the X-OmniRoute-Compression response header value. */
export function formatCompressionMeta(plan: DerivedPlan): string {
return `${plan.mode}; source=${plan.source ?? "off"}`;
}(c) Add a header parameter to resolveBasePlan and tag every return with withSource:
function resolveBasePlan(
config: CompressionConfig,
comboId: string | null,
estimatedTokens: number,
combos: NamedCombos = {},
header: string | null = null
): DerivedPlan {
if (!config.enabled) return withSource({ mode: "off", stackedPipeline: [] }, "off");
// Phase 3: an explicit, recognized header wins over every operator layer (Decision B).
// The master switch above is the hard kill: a header cannot turn compression on.
if (header) {
const fromHeader = planFromHeader(config, header, combos);
if (fromHeader) return fromHeader; // already tagged "request-header"
}
const comboMode = checkComboOverride(config, comboId);
if (comboMode) {
return withSource(resolveCompressionPlan(config, { comboId, combos }), "routing-override");
}
if (config.activeComboId && combos[config.activeComboId]) {
return withSource(
{ mode: "stacked", stackedPipeline: combos[config.activeComboId] },
"active-profile"
);
}
if (shouldAutoTrigger(config, estimatedTokens)) {
const mode = config.autoTriggerMode ?? "lite";
return withSource(
mode === "stacked"
? { mode, stackedPipeline: config.stackedPipeline ?? [] }
: { mode, stackedPipeline: [] },
"auto-trigger"
);
}
const plan = deriveDefaultPlanFromConfig(config, comboId, combos);
return withSource(plan, plan.mode === "off" ? "off" : "default");
}(d) Thread header through the public selectors (append as the last positional arg, default null), preserving source through the caching-aware adjustment:
export function getEffectiveMode(
config: CompressionConfig,
comboId: string | null,
estimatedTokens: number,
combos: NamedCombos = {},
header: string | null = null
): CompressionMode {
return resolveBasePlan(config, comboId, estimatedTokens, combos, header).mode as CompressionMode;
}
export function selectCompressionPlan(
config: CompressionConfig,
comboId: string | null,
estimatedTokens: number,
body?: Record<string, unknown>,
context?: CachingDetectionContext,
combos: NamedCombos = {},
header: string | null = null
): DerivedPlan {
const plan = resolveBasePlan(config, comboId, estimatedTokens, combos, header);
if (body) {
const ctx = detectCachingContext(body, context);
const cacheAware = getCacheAwareStrategy(plan.mode as CompressionMode, ctx);
return { ...plan, mode: cacheAware.strategy as CompressionMode }; // ...plan preserves source
}
return plan;
}
export function selectCompressionStrategy(
config: CompressionConfig,
comboId: string | null,
estimatedTokens: number,
body?: Record<string, unknown>,
context?: CachingDetectionContext,
combos: NamedCombos = {},
header: string | null = null
): CompressionMode {
return selectCompressionPlan(config, comboId, estimatedTokens, body, context, combos, header)
.mode as CompressionMode;
}- Step 6: Run the test to verify it passes
Run: node --import tsx --import ./open-sse/utils/setupPolyfill.ts --import ./tests/_setup/isolateDataDir.ts --test --test-force-exit tests/unit/compression/compression-header-dispatch.test.ts
Expected: PASS (all cases).
- Step 7: Run the full compression suite to confirm no regression
Run: node --import tsx --import ./open-sse/utils/setupPolyfill.ts --import ./tests/_setup/isolateDataDir.ts --test --test-force-exit "tests/unit/compression/**/*.test.ts"
Expected: PASS — the existing 774+ compression tests stay green (resolveCompressionPlan / strategySelector / active-combo dispatch unchanged in behavior for header-less callers).
- Step 8: Commit
git add open-sse/services/compression/deriveDefaultPlan.ts \
open-sse/services/compression/strategySelector.ts \
open-sse/services/compression/resolveCompressionPlan.ts \
tests/unit/compression/compression-header-dispatch.test.ts
git commit -m "feat(compression): header-first resolver + plan source (Phase 3 core)"Files:
-
Modify:
open-sse/handlers/chatCore/headers.ts -
Test:
tests/unit/chatcore-headers.test.ts -
Step 1: Write the failing test
Append to tests/unit/chatcore-headers.test.ts (keep existing imports; add resolveCompressionHeader to the import from ../../open-sse/handlers/chatCore/headers.ts):
import { resolveCompressionHeader } from "../../open-sse/handlers/chatCore/headers.ts";
import { describe, it } from "node:test";
import assert from "node:assert/strict";
describe("resolveCompressionHeader", () => {
it("reads the raw value case-insensitively and trims it", () => {
assert.equal(resolveCompressionHeader({ "x-omniroute-compression": " engine:rtk " }), "engine:rtk");
assert.equal(resolveCompressionHeader(new Headers({ "X-OmniRoute-Compression": "off" })), "off");
});
it("returns null when absent or blank", () => {
assert.equal(resolveCompressionHeader({}), null);
assert.equal(resolveCompressionHeader({ "x-omniroute-compression": " " }), null);
assert.equal(resolveCompressionHeader(null), null);
});
});- Step 2: Run the test to verify it fails
Run: node --import tsx --import ./open-sse/utils/setupPolyfill.ts --import ./tests/_setup/isolateDataDir.ts --test --test-force-exit tests/unit/chatcore-headers.test.ts
Expected: FAIL — resolveCompressionHeader is not exported.
- Step 3: Implement the parser
Append to open-sse/handlers/chatCore/headers.ts:
/**
* Per-request compression override via the `x-omniroute-compression` header. Mirrors the
* `x-omniroute-no-memory` convention (#4290). Returns the raw trimmed value, or null when
* absent/blank. The resolver (planFromHeader) owns interpretation and casing rules; this
* helper only reads the wire.
*/
export function resolveCompressionHeader(
headers: Record<string, unknown> | Headers | null | undefined
): string | null {
const value = (getHeaderValueCaseInsensitive(headers, "x-omniroute-compression") || "").trim();
return value || null;
}- Step 4: Run the test to verify it passes
Run: node --import tsx --import ./open-sse/utils/setupPolyfill.ts --import ./tests/_setup/isolateDataDir.ts --test --test-force-exit tests/unit/chatcore-headers.test.ts
Expected: PASS.
- Step 5: Commit
git add open-sse/handlers/chatCore/headers.ts tests/unit/chatcore-headers.test.ts
git commit -m "feat(compression): resolveCompressionHeader parser (Phase 3)"Files:
-
Modify:
src/shared/constants/headers.ts -
Modify:
open-sse/handlers/chatCore.ts -
Step 1: Add the response-header name constant
In src/shared/constants/headers.ts, add the compression entry to OMNIROUTE_RESPONSE_HEADERS (alphabetical, after cacheHit):
cacheHit: "X-OmniRoute-Cache-Hit",
compression: "X-OmniRoute-Compression",
costSaved: "X-OmniRoute-Cost-Saved",- Step 2: Import the parser and declare the capture variable
In open-sse/handlers/chatCore.ts, line 7, add resolveCompressionHeader to the existing import from ./chatCore/headers.ts:
import { getHeaderValueCaseInsensitive, isNoMemoryRequested, resolveCompressionHeader } from "./chatCore/headers.ts";Near the other compression-scoped lets (~line 1321, beside let cavemanOutputModeApplied = false;), add:
let compressionResponseMeta: string | null = null;- Step 3: Parse the header and add name keys to the combos map
In the compression block, replace the namedCombos build (~line 1373) so it keys by both id and lowercased name, and parse the header right after:
let namedCombos: Record<string, CompressionPipelineStep[]> = {};
try {
const { listCompressionCombos } = await import("../../src/lib/db/compressionCombos.ts");
namedCombos = Object.fromEntries(
listCompressionCombos().flatMap((c) => [
[c.id, c.pipeline],
[c.name.toLowerCase(), c.pipeline],
])
);
} catch (err) {
log?.debug?.(
"COMPRESSION",
"Named combos load skipped: " + (err instanceof Error ? err.message : String(err))
);
}
// Phase 3: per-request override. Unknown values fall through in the resolver (never error).
const compressionHeader = resolveCompressionHeader(clientRawRequest?.headers ?? null);
if (compressionHeader) {
log?.debug?.("COMPRESSION", `x-omniroute-compression header: ${compressionHeader}`);
}(The active-profile lookup keys on config.activeComboId, always a UUID/slug id, so the added name keys are inert for it — one map serves both paths. A combo named off/default cannot be selected by name because the keyword branches run first; note this in the docs.)
- Step 4: Thread the header into both selector calls
Append compressionHeader as the last argument to selectCompressionStrategy (~line 1532) and selectCompressionPlan (~line 1601):
const modeBeforeOutputTransform = selectCompressionStrategy(
config,
compressionComboKey,
estimatedTokens,
body as Record<string, unknown>,
{ provider, targetFormat, model: effectiveModel },
namedCombos,
compressionHeader
); const compressionPlan = selectCompressionPlan(
config,
compressionComboKey,
estimatedTokens,
compressionInputBody,
{ provider, targetFormat, model: effectiveModel },
namedCombos,
compressionHeader
);- Step 5: Capture the resolved meta
Immediately after const mode = compressionPlan.mode as CompressionConfig["defaultMode"]; (~line 1609), add (importing formatCompressionMeta alongside the other strategySelector imports destructured at ~line 1349):
compressionResponseMeta = formatCompressionMeta(compressionPlan);And add formatCompressionMeta to the destructured import from ../services/compression/strategySelector.ts (~line 1349):
formatCompressionMeta,- Step 6: Emit the response header at both build sites
In the non-streaming JSON path, right after the attachOmniRouteMetaHeaders(responseHeaders, { ... }); call (~line 4582), add:
if (compressionResponseMeta) {
responseHeaders[OMNIROUTE_RESPONSE_HEADERS.compression] = compressionResponseMeta;
}In the streaming path, right after the responseHeaders object literal closes (the }; after "x-omniroute-request-id": pendingRequestId,, ~line 4708), add:
if (compressionResponseMeta) {
responseHeaders[OMNIROUTE_RESPONSE_HEADERS.compression] = compressionResponseMeta;
}- Step 7: Typecheck + cycle/source guard
Run: npm run typecheck:core
Expected: exit 0.
Run: npm run check:cycles
Expected: exit 0 — the resolver still has no src/lib/db import (header logic lives in strategySelector.ts, name-key map built in chatCore).
- Step 8: Run the compression suite again (no regression from wiring)
Run: node --import tsx --import ./open-sse/utils/setupPolyfill.ts --import ./tests/_setup/isolateDataDir.ts --test --test-force-exit "tests/unit/compression/**/*.test.ts"
Expected: PASS.
- Step 9: Commit
git add src/shared/constants/headers.ts open-sse/handlers/chatCore.ts
git commit -m "feat(compression): wire x-omniroute-compression header + response header (Phase 3)"Files:
-
Modify:
docs/reference/API_REFERENCE.md -
Modify:
docs/compression/COMPRESSION_GUIDE.md -
Modify (if needed):
config/quality/file-size-baseline.json -
Step 1: Document the header (API reference)
In docs/reference/API_REFERENCE.md, near the x-omniroute-no-memory documentation, add a section:
### `x-omniroute-compression`
Per-request override of the compression plan. Highest precedence — beats the routing-combo
override, the active profile, auto-trigger, and the panel Default. Values:
| Value | Effect |
|-------|--------|
| `off` | No compression for this request. |
| `default` | The panel-derived Default profile (ignores the active profile). |
| `engine:<id>` | A single engine when enabled, e.g. `engine:rtk`. |
| `<combo>` | A named combo, matched by name (case-insensitive) first, then by id. |
Unknown values are ignored (the request is never rejected). The applied plan is echoed in the
`X-OmniRoute-Compression: <mode>; source=<source>` response header. The master compression
switch is a hard gate: when compression is disabled globally, this header cannot enable it.- Step 2: Document the header (compression guide)
In docs/compression/COMPRESSION_GUIDE.md, add a short "Per-request override" subsection mirroring the table above (one paragraph + the value table). Keep all angle-bracket tokens (engine:<id>, <combo>, <mode>, <source>) inside backticks or fenced blocks (the docs are MDX-compiled).
- Step 3: Validate MDX
Run: npx fumadocs-mdx
Expected: [MDX] generated files in ...ms with no error.
- Step 4: Reconcile the file-size baseline if chatCore crossed its pin
Run: npm run check:file-size
Expected: PASS. If it reports chatCore.ts: <actual> > <frozen>, update config/quality/file-size-baseline.json: set the open-sse/handlers/chatCore.ts frozen value to the reported <actual> and add a justification key:
"_rebaseline_2026_06_21_phase3_request_header": "Compression Phase 3 (x-omniroute-compression per-request header) own growth: chatCore.ts <frozen>-><actual> at the existing compression-dispatch chokepoint — parse the header (resolveCompressionHeader), add lowercased-name keys to the namedCombos map, thread it as the new last arg to selectCompressionStrategy + selectCompressionPlan, capture formatCompressionMeta(compressionPlan) into compressionResponseMeta, and emit X-OmniRoute-Compression at the two response-header build sites. The resolver stays pure (planFromHeader + source live in open-sse/services/compression/strategySelector.ts, <cap). Cohesive wiring at the existing chokepoint, mirroring the Phase 2 rebaseline; not extractable without hiding the dispatch boundary. Structural shrink of chatCore.ts tracked in #3501. Covered by tests/unit/compression/compression-header-dispatch.test.ts + tests/unit/chatcore-headers.test.ts.",- Step 5: Run the full validation suite
Run each and confirm exit 0 / green:
npm run typecheck:corenpm run lintnpm run check:cyclesnpm run check:file-size-
node --import tsx --import ./open-sse/utils/setupPolyfill.ts --import ./tests/_setup/isolateDataDir.ts --test --test-force-exit "tests/unit/compression/**/*.test.ts"(compression subset) andtests/unit/chatcore-headers.test.ts -
npm run test:unit(full unit suite — confirms no cross-suite regression)
Expected: all green. Fix any red before proceeding (do not weaken assertions to pass).
- Step 6: Commit
git add docs/reference/API_REFERENCE.md docs/compression/COMPRESSION_GUIDE.md config/quality/file-size-baseline.json
git commit -m "docs(compression): document x-omniroute-compression header + file-size reconcile (Phase 3)"1. Spec coverage
- §2 contract (off/default/engine/combo, unknown→ignore) → Task 1
planFromHeader+ tests. ✓ - Decision A (name-first combo) → Task 1
planFromHeadercombos[lower] ?? combos[h]+ Task 3 name-key map + test. ✓ - Decision B (bypass auto-trigger) → Task 1 header-first in
resolveBasePlan+ test. ✓ - §3.1 precedence (header at top) → Task 1 + precedence tests (beats active profile, routing override). ✓
- §3.2
sourceonDerivedPlan→ Task 1CompressionSource+withSource+ tests. ✓ - §3.3 parser → Task 2. ✓
- §3.4 threading + id+name map → Task 1 (signatures) + Task 3 (chatCore). ✓
- §4 response header → Task 3 (constant + two sites) +
formatCompressionMetatest. ✓ - §5 error handling (unknown→fall-through; disabled engine→fall-through; unconditional gating; master-off boundary) → Task 1 tests. ✓
- §7 testing (parser, resolver, precedence, source guard) → Tasks 1, 2; source guard via
check:cyclesin Task 3/4. ✓ - §8 scope (no bare modes, no UI, chat-only) → respected;
planFromHeaderrejects bare mode names (they fall through), no UI/non-chat files touched. ✓
2. Placeholder scan: none — every code step shows full code; commands have expected output.
3. Type consistency: DerivedPlan/CompressionSource defined in Task 1 Step 3, imported in Step 5; planFromHeader(config, header, combos) / formatCompressionMeta(plan) signatures identical across Task 1 tests, implementation, and Task 3 usage; resolveCompressionHeader(headers) identical in Task 2 and Task 3; header appended as the same final positional arg in getEffectiveMode/selectCompressionPlan/selectCompressionStrategy. ✓
Note for the implementer: typecheck:core, check:cycles, check:file-size, and lint are existing package.json scripts. There is no test:compression script — run the compression subset directly with the node test runner (node --import tsx --import ./open-sse/utils/setupPolyfill.ts --import ./tests/_setup/isolateDataDir.ts --test --test-force-exit "tests/unit/compression/**/*.test.ts"), or run the whole unit suite with npm run test:unit. Confirm any other script name in package.json before running.
OmniRoute · Website · npm · Docker Hub
- Setup Guide
- User Guide
- Features
- Quick Start (Docker)
- Electron Desktop App
- Termux (Android)
- PWA Guide
- MCP Server
- A2A Server
- Agent Protocols
- OpenCode Plugin
- Webhooks
- Cloud Agents
- Skills
- Memory
- Evals
- Gamification
- Guardrails
- Compliance
- Error Sanitization
- Public Credentials
- Route Guard Tiers
- Stealth Guide
- CLI Token Auth