Skip to content

Supervised Prediction of churn rate of a Telecom Company. Depending on his usage patterns and plans bought, classifying users if they would churn in future or not

Notifications You must be signed in to change notification settings

hasanali28/Telecom-Churn-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

Telecom-Churn-Analysis

Customer churn refers to when a customer (player, subscriber, user, etc.) ceases his or her relationship with a company. Businesses typically treat a customer as churned once a particular amount of time has elapsed since the customer’s last interaction with the site or service. The full cost of customer churn includes both lost revenue and the marketing costs involved with replacing those customers with new ones. Reduction customer churn is important because cost of acquiring a new customer is higher than retaining an existing one. Reducing customer churn is a key business goal of every business. This case is related to telecom industry where particular organizations want to know that for given certain parameters whether a person will churn or not.

Exploratory Data Analysis

Our target variable has two categories which include True and False values. True = Customer will move or churn out. False = Customer won’t move We can clearly see that our data is highly imbalanced. The occurrence of false is higher than True. There are 2850 (85.51% ) customers who churn out and 483 (14.49%) customers retained. customer churn

State Wise Churn Analysis

statewise

Area Wise Churn Analysis

Most of the churned customers are from 415 area. area_wise

Churn according to International Plan

Churn rate is more with customer using international plan. As only 323 customer using International plan and 137 churning out of them. planvise

Churn according to Voicemail Plan

922 customer using voice mail plan and 80 out of them are churning voice mail plan vise

Churn according to Customer Care Calls

Churn rate for Customer neither having voicemail plan nor international plan is 9.06%. Churning rate for customer having International plan but don’t have voicemail plan is 3.03% out of 6.93% customers. Churning of customer having both voicemail plan & international plan is 1.08% out of 2.76% customer care calls vise

Collinear Plot

‘Total day minutes’ and ‘total day charges’ are highly correlated ‘Total eve minutes’ and ‘total eve charges’ are highly correlated ‘Total night minutes’ and ‘total night charges’ are highly correlated ‘Total intl minutes’ and ‘total intl charges’ are highly correlated correlation plot_py

Distribution of variables

Most of thevariables are normally distributed. nr1

SMOTE Oversampling in Python

SMOTE synthesize new minority instances between existing real minority instances. Imagine that SMOTE Draw lines between existing minority instances. SMOTE then imagine new synthetic minority instance somewhere on that lines. Like it will generate the synthetics of two real minority cases or data points. Applying synthetic minority oversampling technique to overcome the challenge of imbalance dataset as having an imbalance dataset will have negative impact over machine learning model predictions. In python we use SMOTE

Before :-

False = 1895 True = 338

After Smote

False = 1895 True = 1895

ROSE Oversampling in R:-

IN R we have used ROSE sampling technique. Which is similar to SMOTE, It also generating the synthetic data points and also it will under sample some random points from majority class.

Before :-

False = 1881 True = 319

After Smote

False = 1101 True = 1019

Model Evaluation

  1. Random Forest
  2. Logistic Regression
  3. K- Nearest Neighbors
  4. Naïve Bayes

Model Selection

Random forest has the best results for our problem. Random Forest has the best accuracy and lowest false negative rate and also lowest false positive rate.Hence we’ll choose Random Forest.

ROC Curve

roc_curve

About

Supervised Prediction of churn rate of a Telecom Company. Depending on his usage patterns and plans bought, classifying users if they would churn in future or not

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published