Skip to content

This program provides efficient web scraping services for Tor and non-Tor sites. The program has both a CLI and REST API.

Notifications You must be signed in to change notification settings

null-null-programming/gotor

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

gotor

Status/Social Links

CircleCI Open Source Helpers

This is an HTTP REST API and command line program to gather and analyze data using web-crawling via TOR. The program is meant to be used in tandem with TorBot, but the API and CLI can be run separately.

Flags

Configuration of Tor client

  • -h SOCKS5 proxy host, defaults to localhost
  • -p SOCKS5 proxy port, defaults to 9050

REST

  • -server Starts HTTP server that provides a REST API to the crawling mechanisms
  • Current crawling mechanisms include:
    • Building relationship tree of links where children nodes represents links that can be found on a website
    • Getting the IP of the current Tor client
    • Retrieving phone numbers found on websites
    • Retrieving emails found on websites

CLI

  • -d Searching for children nodes of links, defaults to 1
  • -o Output destination, supported formats include:
    • terminal (tree is printed directly to terminal)
    • excel results are saved to .xlsx file in current directory
    • json results are saved to .json file in current directory

How it works

Crawling drawio

About

This program provides efficient web scraping services for Tor and non-Tor sites. The program has both a CLI and REST API.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Go 100.0%