<h1> Weightlifting Dataset </h1>
Based on data from Summer and Winter Olympics, 1896-2022

### Data Quirks:
1) Weight classes have changed numerous times over the course of Olympic game history, and are thus not uniform!
2) There was no weightlifting event for the 2022 Beijing Games. Weightlifting is a summer sport and the most recent game recorded was a winter game.

In [20]:
# Import modules
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt


# Import data & clean
# From: https://www.kaggle.com/datasets/piterfm/olympic-games-medals-19862018/
filePath = './Raw Data/Summer-Winter Olympics/'
fileName = 'olympic_medals.csv'
colHeads = ["discipline_title", "oly_game", "event_title", "event_gender", "medal_type", "participant_type", 
            "particpant_title", "athlete_url", "athlete_full_name", "country_name", "country_code", "country_3_letter_code"]        # column names
allSports = pd.read_csv(filePath+fileName, skiprows=1, names=colHeads)                                                              # read in all data (every sport)

# Focus exclusively on weightlifting
weightDF = allSports[allSports['discipline_title'] == 'Weightlifting']                                                              # all weightlifting medalists, by game, 1896-2022

# Trim data of labels that are redundant for our purposes
weightDF = weightDF.drop(columns=["discipline_title", "participant_type", "particpant_title", "athlete_url", 
                                  "country_code", "country_3_letter_code"], axis=1)

# Add explicit year column (to help in data analysis later)
years = pd.Series(weightDF["oly_game"].str[-4:], name='year', dtype='int')                                                          # get year for each row (last four characters of 'oly_game')
weightDF = pd.concat([weightDF, years], axis=1)                                                                                     # concatenate, column-wise
display(weightDF)

Unnamed: 0,oly_game,event_title,event_gender,medal_type,athlete_full_name,country_name,year
1049,tokyo-2020,Men's 61kg,Men,SILVER,Eko Yuli IRAWAN,Indonesia,2020
1050,tokyo-2020,Men's 61kg,Men,GOLD,Fabin LI,People's Republic of China,2020
1051,tokyo-2020,Men's 61kg,Men,BRONZE,Igor SON,Kazakhstan,2020
1052,tokyo-2020,Women's 55kg,Women,SILVER,Qiuyun LIAO,People's Republic of China,2020
1053,tokyo-2020,Women's 55kg,Women,BRONZE,Zulfiya CHINSHANLO,Kazakhstan,2020
...,...,...,...,...,...,...,...
21692,athens-1896,heavyweight - one hand lift men,Men,SILVER,Viggo JENSEN,Denmark,1896
21693,athens-1896,heavyweight - one hand lift men,Men,BRONZE,Alexandros Nikolopoulos,Greece,1896
21694,athens-1896,heavyweight - two hand lift men,Men,GOLD,Viggo JENSEN,Denmark,1896
21695,athens-1896,heavyweight - two hand lift men,Men,SILVER,Launceston ELLIOT,Great Britain,1896
