Skip to content
#

preprocessing

Here are 47 public repositories matching this topic...

This is a fraudulent user detecting Kaggle competition. We developed a classification model based on Random Forest to predict when a user downloads a specific app through advertised apps. This data set contained 200 million observations which can be considered as big data. We implemented many feature engineering and data preprocessing techniques…

  • Updated May 8, 2018
  • R

Improve this page

Add a description, image, and links to the preprocessing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the preprocessing topic, visit your repo's landing page and select "manage topics."

Learn more