Skip to content

Final project for Messy Data and Machine Learning at NYU Steinhardt, Spring 2020. Project partners: Hope Muller and Heidi Choi

Notifications You must be signed in to change notification settings

kenny-mai/nyc-homeless-students

Repository files navigation

The purpose of the project was to build prediction models for homeless student high school graduation rates in NYC public schools. Using longitidunal data, random forest models were used to identify key variables or features to best predict if an individual was at risk of not graduating high school in time. A multi-level LASSO model was used with school level data to cross-validate the features identified by the random forest models.

Student-level data used for this project was provided by the Research Alliance for NYC Schools (RA). The data were housed on a RA server and the scripts were used in conjunction with SSHFS drive mapping to ensure data never left the server, as per RA's agreement with the NYC Department of Education.

About

Final project for Messy Data and Machine Learning at NYU Steinhardt, Spring 2020. Project partners: Hope Muller and Heidi Choi

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages