Skip to content

Leagify/colly-draft-prospects

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

79 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

colly-draft-prospects

Source code for web scraping football draft prospects from the DraftTek NFL Big Board.

The scraper is written in Golang and uses the Colly scraper. The binary file in the repo is compiled for Linux, but it could be compiled to use in a different operating system if needed.

Once the ranks have been scraped, I use csvkit to merge all of the files and join them together with information about the locations of the schools. The csvkit commands are in the csvkitcommands.txt files.

Once the ranks have been assembled, I use OpenRefine to clean the data for consistency. The data cleaning steps are contained in openRefineDataMerge.json.