# 1. Business Understanding

## 1.1 Background

SyriaTel is one of the largest telecommunications providers in Syria, offering mobile services to a broad range of customers.  
In the highly competitive telecom industry, retaining existing customers is critical for sustaining profitability and market share.  

However, customer churn — when subscribers discontinue using SyriaTel’s services — poses a significant business challenge.  
Losing customers not only decreases revenue but also increases costs due to the need for acquiring new customers through marketing and promotional efforts.

---

## 1.2 Business Objective

The key objective of this project is to develop a **predictive model** that accurately identifies customers likely to churn.  
With such a model, SyriaTel will be able to:

- Take **proactive retention measures** for high-risk customers  
- Design **targeted marketing campaigns**  
- Improve **customer satisfaction** by addressing key pain points  

Ultimately, this will help **reduce churn rates**, save on customer acquisition costs, and maximize long-term profitability.

---

## 1.3 Stakeholders

| Stakeholder | Interest 
|---|---|
| **Executive Management** | Strategic decisions on customer retention initiatives |
| **Marketing Team** | Design targeted offers and promotions |
| **Customer Service** | Improve customer engagement and support |
| **Data Science Team** | Develop and deploy predictive churn models |

---

## 1.4 Business Problem Statement

- Churn impacts both revenue and brand reputation.
- The cost of acquiring a new customer is significantly higher than retaining an existing one.
- The current challenge is **lack of early warning systems** for customer churn.

By predicting churn probability, SyriaTel can intervene before losing the customer.

---

## 1.5 Project Goals

- **Build a classification model** using customer data to predict churn risk.
- **Identify key drivers** (features) contributing to churn.
- **Translate data insights into actionable business recommendations.**

---

## 1.6 Success Metrics

- **Model Performance:** Achieve an acceptable balance between precision and recall (using classification metrics like F1-score).
- **Business Impact:** Provide insights that SyriaTel’s marketing and customer success teams can implement.
- **Actionable Insights:** Highlight features that are strong predictors of churn to inform business decisions.

---

## 1.7 Project Scope and Limitations

- The model will be based on historical customer data available at the time of this project.
- Predictions will be probabilistic — they suggest likelihood, not certainty.
- Results and recommendations are constrained by the quality and depth of the dataset provided.

---


## 2. Data Understanding.
### Dataset Overview  
The dataset contains historical customer information from SyriaTel, including demographics, account information, usage patterns, and churn status.  
Our goal in this phase is to understand the structure of the data, inspect column details, assess data quality, and identify potential issues for cleaning and preparation.

2.1 Importing Libraries 

In [None]:
# Importing Libraries 
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns



## 2.2 Loading and Preview Dataset

In [6]:
df = pd.read_csv('../data/bigml_59c28831336c6604c800002a.csv')

## 2.3 Check Dataset Shape & Columns

In [None]:

print(f"Dataset Shape: {df.shape}")
print("\nColumns:\n", df.columns.tolist())


Dataset Shape: (3333, 21)

Columns:
 ['state', 'account length', 'area code', 'phone number', 'international plan', 'voice mail plan', 'number vmail messages', 'total day minutes', 'total day calls', 'total day charge', 'total eve minutes', 'total eve calls', 'total eve charge', 'total night minutes', 'total night calls', 'total night charge', 'total intl minutes', 'total intl calls', 'total intl charge', 'customer service calls', 'churn']


## 2.3 Checking Data Types and Null values.

In [None]:

df.info()
df.isnull().sum()
