mulder

Mulder is an MCP server for digital forensics. It gives an AI agent the ability to create investigation cases, run forensic tools (Volatility 3, Sleuthkit, Plaso, Hayabusa, YARA, and more), index evidence into a searchable SQLite database, submit provenance-tracked findings, and generate investigation reports.

Example reports: NIST Data Leakage · National Gallery DC

Features

🔌 MCP protocol for connecting to AI clients (tested with Claude Code; should work with any MCP-compatible client)
🧰 80+ forensic tools exposed as MCP tool calls covering memory, disk, timeline, Windows event logs, YARA, network capture, mobile, and more
🗄️ Per-case SQLite database with FTS5 full-text search across all indexed evidence
📜 Append-only audit log that records every tool invocation; findings must cite real tool call IDs to prevent hallucinated evidence citations
🔗 Cross-source correlation to join evidence from different artifact types within a time range
📊 Report generation producing both Markdown and styled HTML reports with IOC tables, MITRE ATT&CK coverage, and full audit trails
⚙️ Resource throttling with configurable memory and CPU limits so extractions do not overwhelm the host
⚡ Parallel extraction with a configurable worker pool and a run_parallel meta-tool for batch dispatch

Example Output

From the agent's live terminal during a NIST insider threat investigation:

● BOMBSHELL: Informant's Downloads folder contains:
  - googledrivesync.exe + Zone.Identifier (downloaded from internet!)
  - icloudsetup.exe + Zone.Identifier (also downloaded from internet!)

  Multi-vector exfiltration: USB drives (×2), CD-R burn, Google Drive cloud
  sync, and possibly iCloud!

● SMOKING GUN — Browser Search Queries Show Premeditation:
  search?q=anti-forensic+tools      (n=85)
  search?q=ccleaner                 (n=65)
  search?q=cd+burning+method        (n=64)
  search?q=external+device+forensics (n=65)
  search?q=DLP+DRM                  (n=90)
  search?q=e-mail+investigation     (n=88)

  The informant researched how to cover their tracks AND how forensic
  investigations work. This is deliberate, premeditated data theft.

● EXPLOSIVE FIND: LNK shows network share accessed:
  \\10.11.11.128\secured_drive\Secret Project Data\final
  on 2015-03-22T14:52:21Z (drive V:).

  This is the server where the secret project files were stored!

From a single Sonnet run: 14 findings, 9 critical, 34 minutes. Report with narrative, IOCs, and MITRE ATT&CK mappings generated automatically.

See examples/ for reports from multiple forensic datasets with ground truth comparisons, including runs on both Opus and Sonnet.

🚀 Getting Started

Docker/Podman

The container image comes with all forensic tools, dependencies, and Claude Code pre-installed. Mulder is already registered as an MCP server in the container, so Claude Code can use it immediately.

docker pull ghcr.io/calebevans/mulder:1.0

Running the Container

The container expects three volume mounts:

Mount	Purpose
`/evidence`	Your evidence directory (mount read-only with `:ro`)
`/root/.mulder/cases`	Case databases, audit logs, and generated reports (persisted to host)
`/root/.claude`	Claude Code configuration and session data

With an Anthropic API key:

mkdir -p ~/mulder-cases

docker run -it --privileged \
  -v /path/to/evidence:/evidence:ro            `# evidence directory (read-only)` \
  -v ~/mulder-cases:/root/.mulder/cases        `# case DBs, audit logs, reports` \
  -v ~/.claude:/root/.claude                   `# Claude Code config and sessions` \
  -e ANTHROPIC_API_KEY=$ANTHROPIC_API_KEY \
  ghcr.io/calebevans/mulder:1.0

With Google Cloud Vertex AI:

mkdir -p ~/mulder-cases

docker run -it --privileged \
  -v /path/to/evidence:/evidence:ro            `# evidence directory (read-only)` \
  -v ~/mulder-cases:/root/.mulder/cases        `# case DBs, audit logs, reports` \
  -v ~/.claude:/root/.claude                   `# Claude Code config and sessions` \
  -e CLAUDE_CODE_USE_VERTEX=1 \
  -e CLOUD_ML_REGION=us-east5 \
  -e ANTHROPIC_VERTEX_PROJECT_ID=your-gcp-project-id \
  -e GOOGLE_APPLICATION_CREDENTIALS=/tmp/gcloud-creds.json \
  -v ~/.config/gcloud/application_default_credentials.json:/tmp/gcloud-creds.json:ro `# GCP credentials` \
  ghcr.io/calebevans/mulder:1.0

With Amazon Bedrock:

mkdir -p ~/mulder-cases

docker run -it --privileged \
  -v /path/to/evidence:/evidence:ro            `# evidence directory (read-only)` \
  -v ~/mulder-cases:/root/.mulder/cases        `# case DBs, audit logs, reports` \
  -v ~/.claude:/root/.claude                   `# Claude Code config and sessions` \
  -e CLAUDE_CODE_USE_BEDROCK=1 \
  -e AWS_REGION=us-east-1 \
  -e AWS_ACCESS_KEY_ID=$AWS_ACCESS_KEY_ID \
  -e AWS_SECRET_ACCESS_KEY=$AWS_SECRET_ACCESS_KEY \
  ghcr.io/calebevans/mulder:1.0

The container starts Claude Code directly. Once inside, use the /investigate slash command to begin:

/investigate /evidence/case-2025-001

Point it at the directory where your evidence is mounted. The directory can contain archives (zip, 7z, gz, tar, tar.gz, etc.) — the agent will automatically extract them into a temporary directory and read from there.

Case databases and reports are written to the mounted ~/mulder-cases directory on the host.

CLI Reference

`mulder serve`

Starts the MCP server. Normally you do not need to run this manually; the MCP client configuration handles it.

Option	Default	Description
`--case-id`	None	Pre-load an existing case on startup
`--db-dir`	`~/.mulder/cases`	Directory for per-case databases and audit logs
`--transport`	`stdio`	MCP transport (`stdio` or `streamable-http`)
`--workers`	`8`	Number of parallel extraction workers
`--mem-limit`	`90`	Memory usage % threshold; tools wait when exceeded (0 to disable)
`--cpu-limit`	`90`	CPU usage % threshold; tools wait when exceeded (0 to disable)

`mulder report <case_id>`

Generates reports offline without starting the MCP server.

Option	Default	Description
`--db-dir`	`~/.mulder/cases`	Directory containing case databases

Reads {case_id}.db and {case_id}.audit.jsonl from the database directory and writes {case_id}.report.md and {case_id}.report.html alongside them.

🔬 Supported Forensic Tools

Tool	Description
Volatility 3	Memory forensics framework for analyzing RAM dumps
Sleuthkit	Disk image analysis, filesystem listing, file extraction, and MAC timelines
Plaso	Super-timeline generation from disk images and log artifacts
Hayabusa	Windows event log threat hunting with Sigma rules
YARA	Pattern matching across files, memory dumps, and Volatility output
bulk_extractor	Carves emails, URLs, credit card numbers, and other IOCs from raw data
Eric Zimmerman tools	Windows artifact parsers (Prefetch, Amcache, ShimCache, Jump Lists, LNK, Shellbags, SRUM, MFT, USN Journal)
RegRipper	Windows registry hive parsing
python-evtx	Windows EVTX event log parsing and indexing
foremost	File carving from disk images
Scalpel	File carving and recovery
PhotoRec	File recovery from disk images
Binwalk	Firmware and embedded file analysis
ClamAV	Malware scanning
ExifTool	File metadata extraction
ssdeep	Fuzzy hashing for file similarity
hashdeep	Recursive cryptographic hashing
tshark	Network capture (PCAP) analysis
chkrootkit	Rootkit detection
steghide / stegdetect	Steganography detection and extraction
strings	Extract printable strings from binary files
pasco	Internet Explorer history parsing
Hindsight	Chrome/Chromium browser forensics (history, cookies, downloads, cache)
MVT	Mobile Verification Toolkit for spyware detection (Pegasus, Predator)
radare2	Binary analysis and reverse engineering for malware triage
dislocker / libbde	BitLocker volume decryption and metadata extraction
libfvde	Apple FileVault encryption metadata extraction
tcpflow / tcpxtract	TCP stream reconstruction and file extraction from PCAPs

📄 Report Generation

Mulder generates two report formats from the case database and audit log:

Markdown ({case_id}.report.md) for plain-text review and version control
HTML ({case_id}.report.html) a self-contained styled page with dark/light theme, sidebar navigation, and interactive layout

Both formats include an executive summary, severity overview, evidence integrity hashes, attack timeline, detailed findings with MITRE ATT&CK mappings, IOC tables (network, file, email), audit metrics, and a sources appendix.

Reports can be generated in two ways:

MCP tool: call finalize_report while a case is loaded in the server
CLI: run mulder report <case_id> offline without starting the server

🏗️ Architecture

See docs/architecture.md for a detailed technical overview of the server internals, data model, tool execution model, and evidence pipeline.

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
.claude		.claude
.github		.github
docs		docs
examples		examples
scripts		scripts
src/mulder		src/mulder
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
.mcp.json		.mcp.json
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mulder

Features

Example Output

🚀 Getting Started

Docker/Podman

Running the Container

CLI Reference

`mulder serve`

`mulder report <case_id>`

🔬 Supported Forensic Tools

📄 Report Generation

🏗️ Architecture

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

mulder

Features

Example Output

🚀 Getting Started

Docker/Podman

Running the Container

CLI Reference

mulder serve

mulder report <case_id>

🔬 Supported Forensic Tools

📄 Report Generation

🏗️ Architecture

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors 1

Languages

`mulder serve`

`mulder report <case_id>`

Packages