Skip to content

sahanasrin/ds_finalproject

Repository files navigation

Intro to Data Science: Final Project

Project Title: Predicting Median Income in the United States

Authors: Sahana Srinivasan, Jake Kochmansky, Jae Gnazzo, Nathaniel Spilka

Motivating Question: Using a small number of continuous variables, how accurately can our specified model predict median household income in United States Census Tracts?

Status: Complete. Github page can be found here. Github repository can be found here.

Data: The primary source of data is the U.S. Census Bureau's American Community Survey 5-year (ACS5) from 2013 thru 2017. An API Census Key is needed for access. Additionally, the Childhood Opportunity Index Dataset provided additional variables. More information on the 2017 ACS5 can be found here. More information on the Childhood Opportunity Index Dataset can be found here.

Citations: The following citiations acknowledge outside resources (excluding data) used in the construction on this project (in order of appearance in the project):

  1. Huq, S. (2022, March 23). 3.4 Million More Children in Poverty in February 2022 than December 2021. Columbia University Center on Poverty and Social Policy. https://www.povertycenter.columbia.edu/news-internal/monthly-poverty-february-2022
  2. Snider, S., & Kerr, E. (2021, December 16). Where Do I Fall in the American Economic Class System? U.S. News & World Report. https://money.usnews.com/money/personal-finance/family-finance/articles/where-do-i-fall-in-the-american-economic-class-system
  3. Scatter Plot Matrices - R Base Graphs - Easy Guides - Wiki - STHDA. (n.d.). Statistical Tools for High-Throughput Data Analysis. http://www.sthda.com/english/wiki/scatter-plot-matrices-r-base-graphs

About

Introduction to Data Science Final Project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages