Waze

User Churn Project

9 June to 1 September 2023

Waze is a community driven navigation app that helps millions of users get to where they’re going through real-time road alerts and an up-to-the-moment map. Typically, high user retention rates indicate satisfied users who repeatedly use the Waze app over time. This project develops a churn prediction model to help prevent churn, improve user retention, and grow Waze’s business. The questions answered in this project were Who are the users most likely to churn? Why do users churn? and, When do users churn?

This project was part of my Google Advanced Data Analytics Certificate.

FINDINGS

It was established that the data is insufficient for reliably predicting user churn and that further granular data is needed on app usage and geography. Given the data, it could be determined that users who are professional drivers and who use the app more in a month are the biggest predictors of whether a user will churn or be retained.

PROJECT OVERVIEW

This was a three-stage project, in which I was involved after the first stage. Jupyter Notebooks of code I wrote are found in this repository for stages 2 through 5.

Stage 1: Project proposal (not involved)

Data was imported and explored for useful user churn information
A project proposal was accepted by Waze for an in-depth EDA (stage 2), statistical testing (stage 3), and predictive modelling (stages 4 & 5)

Stage 2: EDA (9-12 June 2023)

Churn rate is highest for users who didn’t drive using the app much in the last month
Device types had similar churn rates
Key conclusion: Statistical tests need to be run on variable classes (e.g., device used) to determine significant relationships with churn

Stage 3: Two-sample hypothesis test (24-28 June 2023)

Calculations show that iPhone users have a higher average use of the app compared to Android users
However, this difference is not statistically significant
Key conclusion: More marketing-relevant data is needed for statistically examining churn by device use and other variables.

Stage 4:Logistic regression analysis (17-20 July 2023)

Ran a binomial logistic regression with slightly better than benchmark precision but very low recall
Contrary to what was expected from EDA findings, the amount of driving was the second-least-important variable for predicting churn

Stage 5: Predictive classification models (28 August to 1 September 2023)

Features of interest were extracted, and a random forest model and a GBM model on predicting user churn were developed and performances compared
The GBM outperformed the random forest model, and it had similar levels of precision and accuracy to the logistic regression, with a much better (though still unsatisfactory) recall score
The models confirmed the insufficiency of the data and the need for driver-level data collection (e.g., drive times and geographic information) and user interaction with the app (e.g., input a road hazard).

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
Waze_project_Stage_2_EDA.ipynb		Waze_project_Stage_2_EDA.ipynb
Waze_project_Stage_3_HypothesisTest.ipynb		Waze_project_Stage_3_HypothesisTest.ipynb
Waze_project_Stage_4_LogisticRegression.ipynb		Waze_project_Stage_4_LogisticRegression.ipynb
Waze_project_Stage_5_ML.ipynb		Waze_project_Stage_5_ML.ipynb
waze_dataset.csv		waze_dataset.csv
waze_gbm_cm.png		waze_gbm_cm.png
waze_gbm_feature_importance.png		waze_gbm_feature_importance.png
waze_logistic_regression_confusion_matrix.png		waze_logistic_regression_confusion_matrix.png
waze_logistic_regression_logit_activitydays.png		waze_logistic_regression_logit_activitydays.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Waze

User Churn Project

Stage 1: Project proposal (not involved)

Stage 2: EDA (9-12 June 2023)

Stage 3: Two-sample hypothesis test (24-28 June 2023)

Stage 4:Logistic regression analysis (17-20 July 2023)

Stage 5: Predictive classification models (28 August to 1 September 2023)

About

Releases

Packages

Languages

DStrix66/waze-user-churn

Folders and files

Latest commit

History

Repository files navigation

Waze

User Churn Project

Stage 1: Project proposal (not involved)

Stage 2: EDA (9-12 June 2023)

Stage 3: Two-sample hypothesis test (24-28 June 2023)

Stage 4:Logistic regression analysis (17-20 July 2023)

Stage 5: Predictive classification models (28 August to 1 September 2023)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages