Skip to content

nmichalov/DistIndexer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 

Repository files navigation

DistCrawler

Distributed web crawling and indexing with python and Pyro4.

DistCrawler is divided into three parts.

CrawlDirector Which handles URL updating and delegation

DistCrawler the actual web crawler meant to be run on some set of remote servers

DataReduce currently inaccurately named, as it actually stores the data remotely on the servers running instances of DistCrawler

About

Python distributed web crawling and indexing.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages