This repository stores the R script and the data of my Kernels on Kaggle. To download the files together with this description you can press the green button "Clone or download" above.
The R code and data file is in the 01_IncomePrediction folder. You can found the kernel on Kaggle.
In this kernel, I used R to demostrate the data cleaseing process and preparation for modeling tools: removing unnecessary variables, identify outliers, and reclassification of categorical variables. In addition, I created prediction models by using different modeling techniques: Neural Networ, K-nearest Neighbor algorithm, CART algorithm, and C4.5 algorithm. Also, I will demonstrate how to use the misclassification costs to adjust the models. Finally, I will conduct an evaluation of the modeling result from cost-benefit analysis approach.