DontFeedTheAI

A transparent proxy that strips IPs, credentials, hostnames, and PII from every request before it reaches the AI — and restores them on the way back.

flowchart TD
    shell["🖥️ Your Shell\nnmap -sV dc01.acmecorp.local"]
    proxy["🛡️ DontFeedTheAI\ndc01.acmecorp.local → srv-0042.pentest.local\n10.20.0.10 → 203.0.113.47\nAdmin@Acme2024! → [CRED_XK9A2B3C]"]
    api["☁️ LLM API\nsees only\nsrv-0042.pentest.local\n203.0.113.47"]

    shell -- "① real data" --> proxy
    proxy -- "② surrogates only" --> api
    api -- "③ response + surrogates" --> proxy
    proxy -- "④ real data restored" --> shell

Layer	Detects
🧠 Ollama (local LLM)	hostnames, org names, credentials in prose
🔍 Regex	IPs, hashes, tokens, API keys

Both run on your machine. Nothing sensitive crosses the boundary.

Who	How it helps
Pentesters	Run nmap, mimikatz, bloodhound output through Claude without exposing client infrastructure
Developers & SREs	Debug with production data or internal configs in regulated environments
Legal & consulting	Anonymize client contracts, case files, or proprietary IP in AI-assisted reviews
Finance & compliance	Analyze reports or audit scripts without exposing account details
Researchers	Query LLMs on confidential datasets

Why not just send data directly?

❌ Cloud anonymization API + LLM — two bills, two third parties. Your sensitive data still leaves the machine, just through more hands.

flowchart LR
    s0["🖥️ Your Shell\nreal data"] --> a0["☁️ Anonymization API\nsees everything\nbill #1"]
    a0 --> c0["☁️ LLM API\nbill #2"]

❌ Ollama alone — your data never leaves the machine, but Ollama has no awareness of what's sensitive. It reasons on whatever you paste: real IPs, real credentials, real hostnames.

flowchart LR
    s1["🖥️ Your Shell\nreal data"] --> o1["🧠 Ollama\nno interception\nreasons on real data"]

❌ Claude / OpenAI directly — best reasoning quality, but everything lands in their infrastructure. Real client IPs, credentials, org names in API logs — one policy change or breach away from a problem.

flowchart LR
    s2["🖥️ Your Shell\nreal data"] --> c1["☁️ LLM API\nsees everything\nlogs your real data"]

✅ DontFeedTheAI — cloud reasoning quality, local detection, nothing sensitive crosses the boundary. Works with Claude Code, OpenAI SDK, OpenRouter, or any OpenAI-compatible client.

flowchart LR
    s3["🖥️ Your Shell\nreal data"] --> p["🛡️ DontFeedTheAI"]
    o2["🧠 Ollama\nlocal detector\nnever leaves machine"] --> p
    p --> c2["☁️ LLM API\nsees only surrogates"]

→ See docs/architecture.md for the full technical breakdown. For supported LLM clients and upstream configuration, see docs/providers.md.

Quick Start

With a VPS (recommended for team use or persistent engagements):

git clone https://github.com/zeroc00I/DontFeedTheAI
cd DontFeedTheAI
python3 wizard.py

The wizard asks everything — engagement name, VPS address, model — then deploys, opens the SSH tunnel, and launches Claude with the proxy active.

Locally without a VPS:

python3 wizard.py setup       # create venv + install dependencies
python3 wizard.py docker up   # start proxy + Ollama in Docker
export ANTHROPIC_BASE_URL=http://localhost:8080
export ENGAGEMENT_ID=my-engagement
claude                        # or any OpenAI-compatible client

Works on Windows, macOS, and Linux.

python3 wizard.py --help   # all available commands

Docs

Doc	About
Architecture	Two-layer pipeline, what gets anonymized and what doesn't, config reference
Providers	Supported LLM clients: Claude Code, OpenAI SDK, OpenRouter
Contributing	How to add fixtures, run the improvement loop, open areas
Threat Model	What this protects against, what it doesn't, limitations, roadmap

Verifying coverage & contributing improvements

Two tools ship with DontFeedTheAI to help you validate coverage and extend it.

Visual audit — open in browser while the proxy is running:

python3 wizard.py tunnel --audit

Shows every ORIGINAL → SURROGATE mapping logged during the session, filterable by entity type (DOMAIN, CREDENTIAL, TOKEN, HASH…) with per-request timing breakdown. Use it to spot leaks at a glance instead of grepping logs.

The audit page is a debug tool. It exposes the full surrogate → original lookup table, which is why it only runs behind the SSH tunnel. Making this write-only (no reverse lookup over HTTP) is on the roadmap — see Threat Model.

Testing the full pipeline — requires Ollama running:

python3 wizard.py test --integration

Runs all 53 fixtures through the complete pipeline (LLM + regex) and asserts zero leaks. Without --integration, the LLM is mocked and only the regex layer is validated — useful for fast iteration but not a substitute for the full run.

Auto-improvement loop — regex layer only, no Ollama required:

python3 wizard.py improve --cycles 3

Runs all fixtures through the regex layer, reports leaks and false positives, and tells you exactly which strings slipped through. The contribution cycle is: add a fixture for a real tool you use → run the loop → add a regex pattern for each leak → repeat. See Contributing.

The two commands complement each other: improve tightens the regex floor fast; test --integration confirms the full pipeline holds.

A note from the author

I'm a pentester, not a software architect.

This wasn't built to be innovative — there are already cloud APIs that do LLM-based anonymization. But that means sending your data to yet another third party, and I refuse. If you work in security, you already know why.

I built this so the architecture would be available to everyone, and so the community could help expand its effectiveness for free. You're paying for context processing — the AI doesn't need your real data for that.

— zeroc00I

Star History

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.github		.github
docs		docs
scripts/dev		scripts/dev
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.native-ollama.yml		docker-compose.native-ollama.yml
docker-compose.vps.yml		docker-compose.vps.yml
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt
wizard.py		wizard.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DontFeedTheAI

Why not just send data directly?

Quick Start

Docs

Verifying coverage & contributing improvements

A note from the author

Star History

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DontFeedTheAI

Why not just send data directly?

Quick Start

Docs

Verifying coverage & contributing improvements

A note from the author

Star History

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages