Pentest Crew

Multi-agent web pentest pipeline built with CrewAI, Burp Suite MCP, and the Autorize extension.

This project is designed for post-browsing analysis. You test the target manually first, let Burp capture HTTP history, then run this crew to triage requests, validate selected candidates, review evidence, and generate a report.

What It Does

The pipeline is sequential:

http_analyst Reads Burp HTTP/WebSocket history, runs regex searches, reviews scanner issues, maps candidates to WSTG-aligned categories, and routes them to an executable validation action.
validation_executor Replays requests with Burp MCP tools, performs targeted request mutation, Collaborator checks, and Autorize-style session swap checks.
lead_pentester Reviews the evidence, rejects weak claims, assigns CVSS, and writes technical impact and remediation content.
report_generator Produces the final Markdown pentest report.

Current Burp Tooling Model

This repository is aligned to the Burp MCP capabilities available in the connected Burp environment:

Proxy and review:
- get_proxy_http_history
- get_proxy_http_history_regex
- get_proxy_websocket_history
- get_proxy_websocket_history_regex
- get_scanner_issues
- output_project_options
- output_user_options
- set_proxy_intercept_state
Request execution:
- send_http1_request
- send_http2_request
- create_repeater_tab
- send_to_intruder (payloads are forwarded to the MCP server)
- get_active_editor_contents
- set_active_editor_contents
Collaborator and helpers:
- generate_collaborator_payload
- get_collaborator_interactions
- poll_collaborator_with_wait (wait duration configurable via COLLABORATOR_WAIT_SECS env var)
- generate_random_string
- base64_encode / base64_decode
- url_encode / url_decode
Autorize-style wrappers:
- autorize_check
- autorize_multi_role_check

Important limitations:

send_to_intruder should be treated as a handoff/setup action for manual Intruder review, not as a full automated fuzzing engine with result harvesting.
Findings are only as good as the Burp history and scope you prepared beforehand.
If Burp scope is empty or history is empty, the analyst reports that cleanly instead of inventing findings.
The Autorize wrapper tools perform session-swap testing via send_http1_request; they require you to capture and supply the relevant session tokens yourself.

Architecture

[Burp HTTP History + Scanner + Scope]
               |
               v
  [Agent 1: http_analyst]
      - scope confirmation
      - history triage
      - regex search
      - scanner cross-reference
      - action routing
               |
               v
  [Agent 2: validation_executor]
      - HTTP/1.1 replay
      - HTTP/2 replay
      - repeater setup
      - intruder handoff
      - collaborator checks
      - autorize session-swap checks
               |
               v
  [Agent 3: lead_pentester]
      - QA gate
      - evidence review
      - CVSS scoring
      - remediation writing
               |
               v
  [Agent 4: report_generator]
      - final Markdown report

Project Structure

pentest_crew/
├── .env.example
├── .gitignore
├── Guideline.md
├── README.md
├── pyproject.toml
├── src/
│   └── pentest_crew/
│       ├── main.py
│       ├── crew.py
│       ├── config/
│       │   ├── agents.yaml
│       │   └── tasks.yaml
│       └── tools/
│           ├── __init__.py
│           ├── autorize_tools.py
│           ├── burp_collaborator_tools.py
│           ├── burp_mcp_client.py
│           ├── burp_proxy_tools.py
│           └── burp_request_tools.py
└── tests/
    ├── __init__.py
    ├── test_autorize_tools.py
    ├── test_burp_request_tools.py
    └── test_main.py

Requirements

1. Python

Python >=3.10,<3.14

Install dependencies:

python -m venv .venv
source .venv/bin/activate  # Linux/macOS
# .venv\Scripts\activate    # Windows
pip install -e .

2. Burp Suite

Recommended setup:

Burp Suite Professional or Community
Burp MCP extension loaded
Autorize extension loaded
Proxy listener running
Project scope configured before analysis

The connected Burp instance used during development had:

MCP Server extension loaded
Autorize extension loaded
Proxy listener on 127.0.0.1:8080
HTTP/2 enabled

3. Environment Variables

cp .env.example .env
# then edit .env with your real API keys and engagement settings

# LLM API Keys
# Set at least one key. One key runs single-agent mode.
# Two or three keys run multi-agent mode with fallback for missing role-preferred providers.
GOOGLE_API_KEY=your_gemini_key_here
OPENAI_API_KEY=your_openai_key_here
ANTHROPIC_API_KEY=your_anthropic_key_here

# Burp MCP
BURP_MCP_HOST=127.0.0.1
BURP_MCP_PORT=9876

# Engagement
ENGAGEMENT_ID=ENG-2026-001
TARGET_URL=https://target.example.com
CLIENT_NAME=Example Corp
TEST_TYPE=greybox
TESTER_NAME=Security Team
REPORT_OUTPUT_DIR=./reports

# Optional tuning
COLLABORATOR_WAIT_SECS=30

4. Running the Crew

# Via main.py (recommended — handles report path dynamically)
python src/pentest_crew/main.py

# With inline overrides
ENGAGEMENT_ID=ENG-001 TARGET_URL=https://app.target.com python src/pentest_crew/main.py

# Via CrewAI CLI
crewai run

Expected Outputs

reports/pentest_report_<engagement_id>.md — final client-ready report
logs/pentest_crew_log.txt — crew execution audit log

Convert to PDF or DOCX if needed:

pandoc reports/pentest_report_ENG-001.md -o reports/pentest_report_ENG-001.pdf
pandoc reports/pentest_report_ENG-001.md -o reports/pentest_report_ENG-001.docx

Agent-to-Tool Mapping

`http_analyst`

output_project_options — confirm scope
get_proxy_http_history — ingest traffic
get_proxy_http_history_regex — pattern search
get_proxy_websocket_history / get_proxy_websocket_history_regex — WS traffic
get_scanner_issues — scanner cross-reference
base64_decode / url_decode — decode encoded params/tokens for analysis

`validation_executor`

send_http1_request / send_http2_request — replay with mutations
create_repeater_tab — organize tests by finding ID
send_to_intruder — handoff to Intruder for manual follow-up (payloads forwarded)
get_active_editor_contents / set_active_editor_contents — editor manipulation
generate_collaborator_payload / get_collaborator_interactions / poll_collaborator_with_wait — OOB testing
generate_random_string / base64_encode / base64_decode / url_encode / url_decode — encoding
autorize_check / autorize_multi_role_check — session-swap authorization testing
set_proxy_intercept_state — disable intercept during automated testing

`lead_pentester`

get_scanner_issues — cross-reference automated findings
get_proxy_http_history_regex — independent re-examination
get_collaborator_interactions — re-verify OOB callbacks
get_active_editor_contents — spot-check specific requests
output_project_options — verify scope compliance
base64_decode / url_decode — decode evidence tokens

`report_generator`

No Burp tools — consumes structured JSON from previous agents only.

Recommended Workflow

Configure Burp scope for the engagement.
Browse the target manually through Burp — populate HTTP history deeply.
Optionally run Burp Scanner on approved scope.
If you want access control testing, prepare Autorize with at least two sessions (victim + attacker account).
Ensure Burp intercept is disabled before running the crew.
Run the crew.
Review the generated report — validate all findings manually before delivery.

Testing

Run the test suite:

.venv/bin/python -m pytest tests/ -v

Current coverage:

Session token swap logic (cookie / bearer / custom header)
Auth header stripping and CRLF preservation
Autorize body normalization (dynamic ID/timestamp stripping)
HTTP request parsing (_split_raw_request)
HTTP/2 pseudo-header construction
Intruder payload routing
Environment variable validation and input building

Prompt and Task Design Notes

The configuration is intentionally conservative:

no forced minimum finding count
explicit handling for empty Burp scope or empty history
findings require observable evidence — no theoretical flagging
Intruder is treated as review/handoff, not automated fuzzing
unsupported cases route to MANUAL_REVIEW or NEEDS_ESCALATION
Autorize bypass detection uses relative body delta (< 2%) + structural content matching to minimize false negatives

References

Legal Notice

Use this project only for systems you are explicitly authorized to test. Unauthorized testing is illegal and unethical.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pentest Crew

What It Does

Current Burp Tooling Model

Architecture

Project Structure

Requirements

1. Python

2. Burp Suite

3. Environment Variables

4. Running the Crew

Expected Outputs

Agent-to-Tool Mapping

`http_analyst`

`validation_executor`

`lead_pentester`

`report_generator`

Recommended Workflow

Testing

Prompt and Task Design Notes

References

Legal Notice

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
src/pentest_crew		src/pentest_crew
tests		tests
.codex		.codex
.env.example		.env.example
.gitignore		.gitignore
Guideline.md		Guideline.md
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

Pentest Crew

What It Does

Current Burp Tooling Model

Architecture

Project Structure

Requirements

1. Python

2. Burp Suite

3. Environment Variables

4. Running the Crew

Expected Outputs

Agent-to-Tool Mapping

http_analyst

validation_executor

lead_pentester

report_generator

Recommended Workflow

Testing

Prompt and Task Design Notes

References

Legal Notice

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`http_analyst`

`validation_executor`

`lead_pentester`

`report_generator`

Packages