Sample taken from students of SMU through stratified sampling (with the 6 schools being the 6 strata).
Technologies used:
- Pandas
- StatsModels
- Matplotlib
- Seaborn
cleanData.py
used for data cleaning to ensure GPAs collected were legitimate by removing GPAs that were impossible.
visualise.ipynb
used to visualise the data collected.