Scraping the Mathematical Genealogy Project for a scholar's ancestry
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
mgpspider
.gitignore
README.md
plot_ancestors.R
scrapy.cfg

README.md

mgpancestry

Scraping Mathematical Genealogy Project for a scholar's ancestry. This project is inspired from mgptree, which scraps the Mathematical Genealogy Project for a scholar's descendence.

Uses scrapy and python (2.7.10, in case a bug is found).

To run, clone this directory and run

scrapy crawl ancestryspider -a root=202169 -o output.json

where output.json is a stand-in for the output file and 202169 is a stand-in for the id of the mathematician at the root of the tree to crawl from.

Please respect the robots.txt file for the Mathematical Genealogy Project when using this web scraper.