Keep long AI coding sessions sharp under quota pressure.
Now also works with Cursor · Windsurf · Cline · Codex CLI · Gemini CLI
Quick Start · Multi-Agent · Why Governor · Benchmarks · Commands · Install
Governor started as a Claude Code plugin for context hygiene, tool-output filtering, memory compression, telemetry, and drift guardrails.
As of v0.2.3, Governor's core behavior — content-aware tool-output filtering and context hygiene — works with any AI coding agent via prompt-based skills. No hooks or MCP required.
Keep long coding sessions efficient without making the model dumber.
The Claude Code command namespace is /governor:*.
bash install.sh --forceRestart Claude Code, then run:
/governor:status
/governor:audit
/governor:compress CLAUDE.md
bash install.sh --project /path/to/project --agents allThis copies a Governor rules file into your project for each agent. The rules teach the agent to self-filter noisy tool output, preserve unique data, and maintain context hygiene — automatically, every session.
Governor works at two levels:
| Agent | Integration | What You Get |
|---|---|---|
| Claude Code | Plugin with hooks | Full: auto-filtering, telemetry, compression, drift guard, /governor:* commands |
| Cursor | .cursor/rules/governor.mdc |
Self-filtering, compact mode, context hygiene |
| Windsurf | .windsurf/rules/governor.md |
Self-filtering, compact mode, context hygiene |
| Cline | .clinerules/governor.md |
Self-filtering, compact mode, context hygiene |
| Codex CLI | AGENTS.md |
Self-filtering, compact mode, context hygiene |
| Gemini CLI | GEMINI.md |
Self-filtering, compact mode, context hygiene |
How it works for non-Claude agents: Governor's rules file teaches the agent to apply content-aware filtering itself. When tool output has >40% duplicate lines (test failures, log spam, build warnings), the agent compresses it — keeping the first error, file:line, and exit code. When output is unique (API responses, JSON, code), it passes through intact. No external dependencies.
Claude Code gets the deepest integration because it supports plugin hooks. Other agents get the core behavior via prompt engineering.
Long coding sessions usually do not fail because the AI writes one extra paragraph.
They fail because context gets polluted:
- verbose test, build, and MCP output floods the transcript
- bloated recurring files like
CLAUDE.md, notes, and rules tax every session - broad prompts trigger repo-wide scans and retries
- scope drift compounds over time
Governor is designed for that failure mode — in Claude Code and beyond.
| Capability | What it does | Where it works |
|---|---|---|
| Tool-output filtering | Compacts noisy output when content is repetitive; preserves unique data | All agents |
| Compact mode | Keeps responses concise and professional | All agents |
| Context hygiene | Avoids re-reads, broad scans, and context waste | All agents |
| Memory compression | Rewrites bloated prompt files into denser, safer forms | Claude Code |
| Telemetry | Reports measured savings, failures, compactions, and waste heat | Claude Code |
| Drift guardrails | Adds planning and scope checks for broad tasks | Claude Code |
Most token-saving tools optimize one layer:
- shorter replies
- shorter command output
Governor is built for the broader session problem:
- tool spam
- recurring context tax
- MCP-heavy workflows
- long-task drift
- wasted retries
That is why Governor's benchmark story starts with valid-context loss and decision preservation, not only token counts.
Recent measured Sonnet run with Claude decision grading:
| Condition | Avg token savings | Avg VCLR | Decision preserved | Wrong decision |
|---|---|---|---|---|
| Caveman | 69.1% | 0.14 | 87.5% | 12.5% |
| Governor | 45.5% | 0.00 | 100.0% | 0.0% |
What this means:
- Caveman still wins on raw compression.
- Governor preserved more valid context in this run.
- Governor won on decision quality in the latest Sonnet pass.
Artifacts:
benchmarks/v2-fixture-results.mdbenchmarks/v2-fixture-results.jsonbenchmarks/sonnet-v2-report.md
Same machine, fresh Claude CLI Sonnet sessions, same multi-turn task, same starting repo snapshot.
| Condition | Output Tokens | Cost | Turns | Intent Preserved | Obvious Regression Found |
|---|---|---|---|---|---|
| Control | 10,997 | $0.5169 | 21 | Yes | No |
| Governor | 10,113 | $0.4933 | 22 | Yes | No |
| Delta | -8.0% | -4.6% | +4.8% | Tie | Tie |
This was a narrow pilot, not a universal claim. It matters because Governor kept the implementation contract intact while shaving cost on a real multi-turn coding task.
Structured/local cases focused on the criticism that compaction can miss the real clue.
| Case | Filtered? | Blocked | Signal Preserved |
|---|---|---|---|
| Noisy pytest failure buried in long log | Yes | 64.0% | Yes |
| Burp-style MCP payload with large history + one critical finding | Yes | 90.9% | Yes |
Large Read output containing source code |
No | 0.0% | Yes |
RTK is excellent at shrinking shell output.
Governor is aimed at the wider coding session:
- tool-output filtering across all agents
- recurring prompt-file hygiene
- MCP and structured payload handling
- drift-sensitive long sessions
- measured savings and waste heat
Caveman is excellent when the main goal is making Claude talk in fewer tokens.
Governor is built for the broader session problem:
- less tool spam
- less recurring context tax
- less drift
- more decision preservation under pressure
Short version:
RTK compresses commands. Caveman compresses style. Governor protects the session.
Governor is best for:
- Claude Code Max users who hit long-session limits
- Cursor / Windsurf / Cline users who want context hygiene without switching tools
- MCP-heavy workflows (Burp, Playwright, structured tool output)
- prompt-heavy repos with large rules or command docs
- teams using multiple AI agents on the same codebase
Governor is less useful for:
- tiny chats
- already-clean prompt files
- users who only want meme-simple answer shortening
- Content-aware tool filtering: large outputs are only compacted when content is repetitive noise (>40% duplicate lines). Unique data — API responses, Burp proxy history, curl output, structured JSON — passes through unfiltered. Works in all supported agents.
- Always-on compact mode: keeps responses concise and professional. In Claude Code via hooks; in other agents via rules files.
- Professional memory compression:
/governor:compress CLAUDE.mdrewrites verbose memory files into dense prose. (Claude Code) - Protected-span safety: code blocks, inline code, paths, URLs, commands, env vars, versions, headings, tables, and warnings are preserved.
- Quality guard: low-savings compression is rejected and the backup is restored instead of pretending success.
- Inline full-output bypass: set
GOVERNOR_FULL=1as an env var in the same Bash call to skip compaction without a separate command. Immune to parallel-call cancellation. - Telemetry ledger:
/governor:statusreports blocked tokens, failures, compactions, and statusline snapshots when available. (Claude Code) - Prompt guidance: vague broad prompts get soft, non-blocking suggestions.
- Plan and drift guard: explicit contracts for broad builds, then scope
checks with
/governor:guard. (Claude Code)
Claude Code plugin commands:
| Command | Purpose |
|---|---|
/governor:on |
Enable compact professional response mode |
/governor:off |
Disable response compression |
/governor:status |
Show usage dashboard and waste heat map |
/governor:audit |
Find bloated memory/rule files and context waste |
/governor:compress CLAUDE.md |
Compress memory files with protected-span validation |
/governor:full |
Let the next diagnostic command return full output |
/governor:plan "task" |
Produce an implementation contract before broad work |
/governor:guard |
Check current changes against the approved plan |
/governor:benchmark |
Run or explain the V2 benchmark suite |
/governor:install-rules |
Copy Governor skills into other-agent projects |
gh repo clone 0xhimanshu/governor
cd governor
bash install.sh --forcebash install.sh --project /path/to/project --agents allOr install for specific agents:
bash install.sh --project . --agents cursor,windsurf
bash install.sh --project . --agents cline,codex,geminiclaude --plugin-dir ./governor:compress is automatic from the user's point of view:
- Create a timestamped backup.
- Mark protected spans.
- Rewrite the file with dense professional prose.
- Strip markers.
- Validate protected content.
- Attempt protected-span recovery if needed.
- Reject low-savings output and restore the backup if the quality guard fails.
- Report exact before/after token estimates and backup location.
Compression levels:
| Level | Target |
|---|---|
light |
Remove filler and repetition; preserve most rationale |
medium |
Collapse narrative into decision bullets |
aggressive |
Keep only rules, facts, commands, risks, and decisions |
Set GOVERNOR_ALLOW_LOW_SAVINGS=1 only if you intentionally want to keep a
low-savings compression result.
Governor stores a local JSONL ledger. The status command automatically discovers
and merges ledger files from all known locations, including the plugin data
directory set by Claude Code (CLAUDE_PLUGIN_DATA) and the manual fallback
path. Deduplication uses resolved paths so symlinks and non-canonical paths
do not cause double-counting.
It tracks:
- tool-output tokens blocked
- full-output overrides
- prompt-risk suggestions
- Bash failures
- compactions
- statusline snapshots
- memory compression savings
Use benchmarks/ for measured comparisons.
Recommended conditions:
control: no token/style plugincaveman: Caveman enabled as normalgovernor-hooks: Governor hooks enabled, memory unchangedgovernor-compressed: Governor after/governor:compress CLAUDE.mdgovernor-strict: optional strict-mode run for broad tasks
Run:
python3 scripts/run_benchmark.py \
--write-json benchmarks/v2-fixture-results.json \
--write-md benchmarks/v2-fixture-results.md- Helpful by default, strict only when invoked.
- Professional dense prose, never novelty dialect.
- Measure exact savings where possible.
- Treat 1M context as a ceiling, not a target.
- Keep broad planning and drift checks opt-in.
- Restore backups instead of accepting unsafe or low-value compression.
- If hooks do not fire,
/governor:statuswill show little or no telemetry. - Existing custom statuslines are not overwritten by the installer.
- Compression sends file content through the active Claude Code/model workflow. Do not compress secrets or sensitive private files.
- Use
/governor:fullbefore a diagnostic command when you need unfiltered logs, or prefix withGOVERNOR_FULL=1to bypass inline. - For installed-but-inactive behavior, launch Claude Code with
GOVERNOR_DEFAULT_MODE=off.
Contributions are welcome when they make Governor more useful, safer, or easier to trust. Please read CONTRIBUTING.md before opening a pull request, especially for compression, hook, telemetry, agent rules, or benchmark changes.
MIT. See LICENSE.