Premier_League_Lineup_Data

Jupyter Notebook containing a spider for betstudy.com that scrapes lineup data for each game in the Premier League.

Python Libraries Needed:

requests -- Used to get html code from the betstudy.com website

bs4 -- Used to parse the html

Python Libraries Recommended:

numpy -- Used for null values

pandas -- Used to format data into a .csv file

tqdm -- Used for crawling interface

The spider iterates over a table containing every game outcome in the 2018/19 season. For every row it iterates over it travels along a hyperlink that leads it to the lineup of each team for that specific game, aswell as the referees. The spider then iterates over the tables containg the lineup data and follows another hyperlink to the players profile page. From their the spider scrapes the players full name and position, it does this for the referees aswell but doesnt take their position.

Once the data for an iteration is collected it is appended to two dictionaries one containing the player names the other the positions. After the iterations are completed (all 380 games are collected) the dictionaries are formatted into two pandas dataframes which are then saved to .csv files. A players name in the name .csv has a position in the position .csv at the same index and column as their name.

The spider can easily scrape different seasons by changing the initial betstudy_url variable to another season.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
BetStudy_Scrape.ipynb		BetStudy_Scrape.ipynb
README.md		README.md
lineup_names.csv		lineup_names.csv
lineup_positions.csv		lineup_positions.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Premier_League_Lineup_Data

Python Libraries Needed:

Python Libraries Recommended:

About

Releases

Packages

Languages

linneytom/Premier_League_Lineup_Data

Folders and files

Latest commit

History

Repository files navigation

Premier_League_Lineup_Data

Python Libraries Needed:

Python Libraries Recommended:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages