Skip to content

Churn Prediction using Machine Learning (PySpark and Scikit-learn)

Notifications You must be signed in to change notification settings

pradeepsinngh/Churn-Prediction

Repository files navigation

Churn Prediction using Machine Learning (PySpark and Scikit-learn)

Project Description:

Predicting Customer churn rate on Telco Customer churn dataset, taken from kaggale. We'll take a look at what types of customer data we have, do some preliminary analysis, and develop churn prediction models - all with Python/PySpark and different machine learning frameworks, like, ML Package and Scikit-learn.

The broad idea of this project is to develop machine learning models that could predict churn rate from given data. We will use different tools and methods like, numpy, pandas, seaborn, correlation, etc. to explore, extract and trasnform data.

Dataset:

Telco Customer Churn Dataset: This dataset consist information of telco's customers. Each row represents a customer, each column contains customer’s attributes described on the column Metadata. There are roughly 7000 customer data with 21 attribute for each customers.

Dataset link: https://www.kaggle.com/blastchar/telco-customer-churn

About

Churn Prediction using Machine Learning (PySpark and Scikit-learn)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published