- Python
- Pandas, NumPy
- Matplotlib, Seaborn
- Scikit-learn
This project analyzes customer churn data to identify key factors influencing customer attrition and builds a machine learning model to predict churn.
- Telecom customer churn dataset with customer demographics, services, and billing information.
- Data cleaning and preprocessing (type conversion, missing value handling)
- Exploratory Data Analysis (EDA) using visualizations
- Feature encoding and scaling
- Built a Logistic Regression model for churn prediction
- Evaluated model performance using accuracy, precision, recall, and F1-score
- Logistic Regression Accuracy: ~78%
- Strong performance in identifying non-churn customers
Identified important churn drivers such as contract type, tenure, and monthly charges, providing actionable insights for customer retention strategies.
Customer_Churn_Analysis.ipynbβ Jupyter NotebookWA_Fn-UseC_-Telco-Customer-Churn.csvβ Dataset
Note This project is part of my Data Analytics portfolio, showcasing Python-based data analysis and machine learning skills.