Analytics Vidhya's Black Friday Data Hackathon
The challenge was to predict purchase prices of various products purchased by customers based on historical purchase patterns. The data contained features like age, gender, marital status, categories of products purchased, city demographics etc. http://datahack.analyticsvidhya.com/contest/black-friday-data-hack
- Looked into levels of data and converted all variables into factors.
- Imputed missing values with '999' and converted such variables into factors
- Ran basic Multi Linear regression and Submitted benchmark model predictions
- Ran a basic random forest, Xgboost, GLM, GBM, Deep Learning algorithms by excluding USER_ID and PRODUCT_ID.
- Got a RMSE of 2888, Public Leader Board Ranking 66/162.
- Using H2O packages
- Feature engineering needs to be improved
- Reviewed code of top rankers. Hoping these learnings will improve my next LB ranking to top 10% :)