Please find Published report of this project here in Rpubs.
Aim:
This Mini Project involves data preparation of dataset cencus_income.csv in order to make it fit for futher analysis and model building.
Description:
-
Creating dummy variable for character variables.
-
Grouping similar category variables and making dummies.
-
Dealing with flag variables.(for numeric variables)
-
Converting the target Variable.(Y)
Data Information:
census_income.csv is a csv file containing 32561 obs and 15 variables.
It describes the income range of people with their characteristic attributes.The income range of people is >50k and <=50k which is stored in target variable Y.
We need to prepare data for the remaining (14) variables which can be further usefull in building models.