U.S. College Majors Outcomes - Exploratory Data Analysis and Visualization
Exploring the impact of college majors on graduates' earnings and unemployment, as well as the gender gap in bachelor's degrees in the U.S.
The first analyzed dataset focuses on outcomes for students who graduated from college between 2010 and 2012. It was released by the American Community Survey. FiveThirtyEight released a cleaned version of the dataset on their Github repository.
The second dataset pertains to the percentages of bachelor's degrees granted to women per majors category from 1970 to 2011, collected by the National Center for Education Statistics. A cleaned dataset compiled by Dr. Randal Olson can be downloaded on his website.
- Python 3;
- Jupyter Notebook: an open-source web application to share documents that contain live code, visualizations and explanatory text;
- NumPy: a Python library for scientific computing;
- pandas: a Python library for data manipulation and analysis;
- matplotlib: a Python plotting library for data visualization;
- seaborn: a Python data visualization library based on matplotlib for attractive statistical data visualization.
To be able to run the notebook, you'll first need to install the Python packages listed in requirements.txt:
pip install -r requirements.txt
Then, run Jupyter Notebook:
This project is licensed under the MIT License - see the LICENSE.md file for details.