AWS-Machine-Learning-Engineer-Capstone

Project developed for AWS Machine Learning Engineer Scholarship offered by Udacity (2023)

Churn Prediction

Customer churn, also referred to as customer attrition, poses a significant challenge for businesses. It arises when customers discontinue the use of a company's products or services, and high churn rates can have adverse effects on a company's revenue and profitability. To tackle this issue, machine learning algorithms can be leveraged to identify the factors that contribute to churn. Churn models are designed to identify early warning signs and recognize customers who are more likely to voluntarily leave. As part of this project, I will be delving into three different algorithms: logistic regression, decision tree, and random forest. Through the application of these three powerful tools, I aim to develop a highly accurate classifier that can predict which customers are likely to churn and which are not.

Full text

Project overview

Package Version

polars                          0.17.0
folium                          0.14.0
imblearn                        0.0
sagemaker                       2.144.0
kaggle                          1.5.13

Notebook enviroment

Loading kaggle dataset

Setup kaggle API

1 - Create a new account in kaggle.com if you do not have one

2 - Access account settings

3 - Click on 'Create New API Token'

4- Upload the kaggle.json to SageMaker

4- Run the setup_kaggle_api.sh script on terminal or run in a notebook cell !bash setup_kaggle_api.sh

5- Run the load_dataset.sh script on terminal or run in a notebook cell !bash load_dataset.sh

References

SageMaker

SageMaker Hyperparameter Tuning

SagaMaker Training Jobs

SageMaker Batch Transfom

SageMaker Inference Pipelines

AWS S3

N. Forhad, M. S. Hussain, and R. M. Rahman, "Churn analysis: Predicting churners," in Proceedings of the Ninth International Conference on Digital Information Management (ICDIM 2014), Phitsanulok, Thailand, 2014, pp. 237-241, doi: 10.1109/ICDIM.2014.6991433.

Qureshi, Saad, Ammar Rehman, Ali Qamar, Aatif Kamal, and Ahsan Rehman. "Telecommunication Subscribers' Churn Prediction Model Using Machine Learning." In Proceedings of the 8th International Conference on Digital Information Management (ICDIM 2013), 2013, pp. 133-137, doi: 10.1109/ICDIM.2013.6693977.

Ullah, Irfan, Basit Raza, Ahmad Malik, Muhammad Imran, Saif Islam, and Sung Won Kim. "A Churn Prediction Model Using Random Forest: Analysis of Machine Learning Techniques for Churn Prediction and Factor Identification in Telecom Sector." IEEE Access, vol. 7, pp. 104634-104647, 2019, doi: 10.1109/ACCESS.2019.2914999.

Khan, Muhammad, Johua Manoj, Anikate Singh, and Joshua Blumenstock. "Behavioral Modeling for Churn Prediction: Early Indicators and Accurate Predictors of Custom Defection and Loyalty." In Proceedings of the IEEE International Congress on Big Data (BigData Congress), 2015, pp. 7-14, doi: 10.1109/BigDataCongress.2015.107.

G. Menardi and N. Torelli, "Training and assessing classification rules with imbalanced data," Data Mining and Knowledge Discovery, vol. 28, no. 1, pp. 92-122, 2014, https://doi.org/10.1007/s10618-012-0295-5.

V. S. Spelmen and R. Porkodi, "A Review on Handling Imbalanced Data," in Proceedings of the 2018 International Conference on Current Trends towards Converging Technologies (ICCTCT), Coimbatore, India, 2018, pp. 1-11, doi: 10.1109/ICCTCT.2018.8551020.

N. V. Chawla, K. W. Bowyer, and W. P. Kegelmeyer, "SMOTE: synthetic minority over-sampling technique," Journal of Artificial Intelligence Research, vol. 16, pp. 321-357, 2002.

R. Kohavi, "A study of cross-validation and bootstrap for accuracy estimation and model selection," in Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI'95), San Francisco, CA, USA, 1995, pp. 1137-1143.

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
figures		figures
scripts		scripts
.gitignore		.gitignore
Capstone proposal - AWS Machine Learning Engineer Nanodegree.pdf		Capstone proposal - AWS Machine Learning Engineer Nanodegree.pdf
Churn Prediction - AWS Machine Learning Engineer Nanodegree.pdf		Churn Prediction - AWS Machine Learning Engineer Nanodegree.pdf
LICENSE		LICENSE
README.md		README.md
load_dataset.sh		load_dataset.sh
requirements.txt		requirements.txt
sagemaker_churn_notebook.ipynb		sagemaker_churn_notebook.ipynb
setup_kaggle_api.sh		setup_kaggle_api.sh
test_notebook.ipynb		test_notebook.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AWS-Machine-Learning-Engineer-Capstone

Churn Prediction

Project overview

Package Version

Notebook enviroment

Loading kaggle dataset

Setup kaggle API

References

About

Releases

Packages

Languages

License

mathewsrc/AWS-Machine-Learning-Engineer-Capstone

Folders and files

Latest commit

History

Repository files navigation

AWS-Machine-Learning-Engineer-Capstone

Churn Prediction

Project overview

Package Version

Notebook enviroment

Loading kaggle dataset

Setup kaggle API

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages