Skip to content

A web crawler to crawl and index interesting pages on the WWW. The primary aim of this project is to build an index tree for a particular website, containing internal as well as external links, for better visualization.

License

deydebaditya/zulu

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

38 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ZULU is a web crawler that is under development. The main target that ZULU wishes to achieve is crawling through pages to generate an index graph of the WWW. To run ZULU, all you need is Python version 3.4 and above.

Commands to run ZULU :

->Firstly install 'requests' via
pip install requests

->Secondly, install 'BeautifulSoup' via
pip install bs4

->Finally, do the following

python webCrawler.py

To submit an issue, please check the Issue Guidelines at meta/ISSUE_TEMPLATE.md

About

A web crawler to crawl and index interesting pages on the WWW. The primary aim of this project is to build an index tree for a particular website, containing internal as well as external links, for better visualization.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •