Python implementation of a case study in Module 2 of the MITProfessionalX course "Data Science: Data to insights".
The case study is: "Module 2 Case Study - Regression and prediction". This case study is about using ensemble methods in R on wages data.
Use different prediction methods (linear model, lasso, cross-validated lasso, random forest, Ridge, cross-validated Ridge, Elastic net, cross-validated Elastic net) and combine them using Ensemble method. Refer to sklearn.
Our goals are:
-
Predict wages using various characteristics of workers.
-
Assess the predictive performance of different methods (linear model, lasso, cross-validated lasso, random forest, Ridge, cross-validated Ridge, Elastic net, cross-validated Elastic net) and combine them using Ensemble method.
Data is from the March Supplement of the U.S. Current Population Survey, year 2012.