This is the repository for my blog "Structured thinking for story-telling with advanced EDA" published on my personal web: http://mingjiezhao.com
This repository includes the raw data, and Jupyter Notebooks for the analysis mentioned in this blog.
Raw data: race_Females.txt, race_Males.txt
Jupyter Notebooks:
- part1_data_cleaning: data cleaning and pre-processing, cleaned_df.csv is the output cleaned data
- part2_in_depth_EDA: scripts for analysis in the blog