- Introduction
- Data Wrangling
- Exploratory Data Analysis
- Conclusions
This database contains a vast amount of soccer data from 2007 to 2015 in a .sqllite file. I used DB Browser to export the needed dataset as a .csv file. There are 8 tables in this database but the scope of my analysis will be limited to two, that is, player and player attributes. With this table we are exploring the questions our scouting team asked us, using the 2016 season.
- How many of the players were left footed?, what is the proportion of left footed players comapred to the total population.
- Does height have an effect on a players jumping ability?
- The scouts are intrested in recommending Ronaldo because of his ariel threat but they want to know; Has time has affected his jumping ability?, also, determine abilities he improved on the most and area(s) he drastically declined as an attacker using the first season(2007) and last season (2015).
- Is there a correlation between weight, height and acceleration?
Table 1: Player
There are 6 columns in this dataframe.
- Player api id (serves as a primary key)
- Player name
- Player fifa api id (serves as a primary key)
- Birthday: Player's birthday
- Height: Player's height
- Weight: Player's weight
Table 2: Player Attributes
There are 41 columns in this columns. To mention a few.
- Player fifa api id (foreign key)
- Player api id (foreign key)
- Date: Year, month and day
- Overall rating: Player's rating
- Potential: Player's potential
- Preferred foot: Player's preferred foot (left or right)
- Finishing: Player's finishing ratings
- Vision: Player's vision ratings
- Penalties: Player's penalty ratings
- Marking: Player's marking ratings
1) Looking at the player dataframe, birthday column will be dropped as it is of no use to this analysis.
After data wrangling, visuals were used to make our result meaningful and easier to understand. Here, the dataframe statistics and visualization was explored with an aim to address the research questions that was posed in the Introduction section.
- How many players were left footed?, what is the proportion of left footed players comapred to the total population ?
- Does height have an effect on a players jumping ability?
- The scouting department is intrested in recommending Ronaldo because of his ariel threat but they want to know if time has affected his jumping ability?, determine this using the first season(2007) and last season (2015)
- Is there a correlation between weight, height and acceleration?
- The units of weight and height were not included in the documentation, infact there was no documentation.
- Due to missen documentation i was limited to some data sets because i couldn't understand some abbreviations in other data sets.
- The dataframe was quite large so i couldn't explore some aspects without my system hanging. This delayed the speed of this report drastically.
- The scope of this data analysis was limited to player and player attributes only.