Skip to content

dhoboy/scraping_retrosheet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

scraping_retrosheet

First pass at python files that scrape data from Retrosheet.

scrape_retrosheet.py

The file 'scrape_retrosheet.py' downloads every player's page from http://www.retrosheet.org/ and saves the pages either in the 'pitchers' or the 'position_players' folder.

Some Retrosheet player pages have extra info at the bottom. If there, these data are saved in the 'misc_data' folder.

form_scraped.py

The file 'form_scraped.py' creates a new directory called 'pitching_data' that only contains the regular season pitching records for players, where applicable.

form_scraped_further.py

The file 'form_scraped_further.py' forms the pitching files in the 'pitching_data' directory into .csv files for consumption by JavaScript.

JS Usage

The first JS example using one of these pitching csv files is here

About

My python files that scrape data from retrosheet

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages