Skip to content

KarthikMurugadoss1804/Prediction-of-customer-churn

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Prediction of customer churn on bank data

Abstract

In this project we would be predicting customer churn from bank data. We would be adopting 8 different classification models and apply stratifiedKfold cross validation to check the performance of the models. Stratified K Fold is used because it is best for classification problems. The main challenge would be the imbalanced data which would be handled using SMOTE before training the model. After cross validation of the models, three top performing models are taken and finally hyperparameter tuning is done using grid search to identify ideal hyperparameters for best performance

Files

  1. churn modeling.csv

    The dataset used for this project. Data Source: Kaggle.

  2. main.ipynb

    Implementation of the project - Jupyter notebook file.

  3. Research.pdf

    Detailed information about all preprocessing, implementation and the research work is available in this document.