darkWebBot

Dark Web Crawler for crawling the hidden onion sites and indexing them in Solr

Requirements -

Make sure all the above are installed before running the project.

Before Proceeding -

Solr dependencies -

Create a new core
Make sure no data in core - http://localhost:8983/solr/{core_name}/update?stream.body=%3Cdelete%3E%3Cquery%3E*:*%3C/query%3E%3C/delete%3E&commit=true
cd {solr_dir}/server/solr/{core_name}/conf/
mv managed-schema schema_backup.xml
mv solrconfig.xml solrconfig_backup.xml
cp darkWebBot/managed-schema .
cp darkWebBot/solrconfig.xml .
cp darkWebBot/stopwords_en.txt .
restart solr

Polipo dependencies -

To run -

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
darkWebCrawler		darkWebCrawler
.gitignore		.gitignore
LICENCE.rtf		LICENCE.rtf
README.md		README.md
config		config
managed-schema		managed-schema
solrconfig.xml		solrconfig.xml
stopwords_en.txt		stopwords_en.txt

Provide feedback