Application that crawls news websites and identifies articles that pertain to the same central story.
Python
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
spiders
.gitattributes
.gitignore
__init__.py
archiveSites.txt
automateSpider.py
finalClusters.txt
greatClusters.txt
gui.py
hac_cluster.py
items.py
pipelines.py
queryMatcher.py
scrapy.cfg
settings.py