Skip to content

Small python tool to harvest onion domains and store them in a SQLite DB. Onion domains are scraped from the hiddenwiki and several clear web webpages, which list onion-domains.

Notifications You must be signed in to change notification settings

jgru/onion-domain-harvester

Repository files navigation

onion-domain-harvester

The onion-domain-harvester is a python tool to harvest onion domains and store them in a SQL DB. Onion domains are scraped from

  • the hiddenwiki
  • the market list of the uncensored hidden wiki
  • the market lists of deepdotweb

the resulting data, which consists of url, the corresponding title, a short description, first seen and last seen timestamps, is stored in a lightweight sqlite database. The tool should run on a daily basis via cronjob.

The resulting database may be useful for assessing digital exhibits, so that darknet markets and other artifacts could be identified and classified, even when they are not accessible anymore.

Dependencies

See ./requirements.txt

Python packages

  • urllib
  • cfscrape
  • beautifulsoup
  • sqlalchemy

Other dependencies

About

Small python tool to harvest onion domains and store them in a SQLite DB. Onion domains are scraped from the hiddenwiki and several clear web webpages, which list onion-domains.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published