This is an HTTP REST API and command line program to gather and analyze data using web-crawling via TOR. The program is meant to be used in tandem with TorBot, but the API and CLI can be run separately.
-h
SOCKS5 proxy host, defaults to localhost-p
SOCKS5 proxy port, defaults to 9050
-server
Starts HTTP server that provides a REST API to the crawling mechanisms- Current crawling mechanisms include:
- Building relationship tree of links where children nodes represents links that can be found on a website
- Getting the IP of the current Tor client
- Retrieving phone numbers found on websites
- Retrieving emails found on websites
-d
Searching for children nodes of links, defaults to 1-o
Output destination, supported formats include:terminal
(tree is printed directly to terminal)excel
results are saved to.xlsx
file in current directoryjson
results are saved to.json
file in current directory