Intro to Data Science: Final Project
Project Title: Predicting Median Income in the United States
Authors: Sahana Srinivasan, Jake Kochmansky, Jae Gnazzo, Nathaniel Spilka
Motivating Question: Using a small number of continuous variables, how accurately can our specified model predict median household income in United States Census Tracts?
Status: Complete. Github page can be found here. Github repository can be found here.
Data: The primary source of data is the U.S. Census Bureau's American Community Survey 5-year (ACS5) from 2013 thru 2017. An API Census Key is needed for access. Additionally, the Childhood Opportunity Index Dataset provided additional variables. More information on the 2017 ACS5 can be found here. More information on the Childhood Opportunity Index Dataset can be found here.
Citations: The following citiations acknowledge outside resources (excluding data) used in the construction on this project (in order of appearance in the project):
- Huq, S. (2022, March 23). 3.4 Million More Children in Poverty in February 2022 than December 2021. Columbia University Center on Poverty and Social Policy. https://www.povertycenter.columbia.edu/news-internal/monthly-poverty-february-2022
- Snider, S., & Kerr, E. (2021, December 16). Where Do I Fall in the American Economic Class System? U.S. News & World Report. https://money.usnews.com/money/personal-finance/family-finance/articles/where-do-i-fall-in-the-american-economic-class-system
- Scatter Plot Matrices - R Base Graphs - Easy Guides - Wiki - STHDA. (n.d.). Statistical Tools for High-Throughput Data Analysis. http://www.sthda.com/english/wiki/scatter-plot-matrices-r-base-graphs