This project focuses on predicting customer churn using a synthetic dataset and a logistic regression model. The goal is to identify clients who are likely to leave a service based on their usage patterns and profile data.
The dataset includes the following features:
tenure: Time with the company (in months)monthly_charges: Monthly billing amounttotal_charges: Total amount billed to datenum_support_calls: Number of customer support callsuses_internet: Binary flag indicating internet usage (0 = No, 1 = Yes)contract_type: Type of contract- 0 = Month-to-month
- 1 = One-year
- 2 = Two-year
The script trains a Logistic Regression model and evaluates its performance using:
- Accuracy
- Confusion matrix
The trained model is saved as churn_model.pkl.
Ensure you have the required dependencies installed (see below), then run the script:
python churn_model.py