Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
test
urlgrab @ 61cc967
.gitignore
.gitmodules
README.rst
direct.py
generate_next.py
split.py
test.py
trawl.py

README.rst

Wikipedia Trawler

This started off as a tool just to play Get to Philosophy in response to http://xkcd.com/903/, but it's become something more in that it now doesn't just stop at Philosophy but keeps going until it hits a loop (which is usually "Philosophy")

TODO

Import a full copy of a wikipedia dump and find out what percentage of all articles get to philosophy, what's the largest loop and generally fix the "Chains" list on the main page