## **Logistics Regression**

### 1. Short History of Logistic Regression

The logistic function was introduced in the **19th century** by **Pierre-FranÃ§ois Verhulst** to model population growth.

In the **early 20th century**, statisticians realized that the same function could model **binary outcomes** (yes/no, success/failure).

By the **1940sâ€“1950s**, logistic regression was formally developed as a statistical method, especially in **biostatistics and social sciences**, to model probabilities of events that have only two possible outcomes.

**Today**, logistic regression is one of the foundational algorithms in machine learning, widely used for problems such as:

- **Churn prediction**
- **Fraud detection**
- **Medical diagnosis**

**Because it is**:
- **Interpretable**
- **Probabilistic** 
- **Mathematically well-grounded**

---

### 2. What is Logistic Regression?

**Logistic regression** is a **supervised learning algorithm** used for **binary classification**.

**Its goal**: Model the probability that an outcome belongs to the **positive class** (e.g., customer churns).

**Key difference**: Instead of predicting a class label directly, logistic regression predicts a **probability**, which is then converted into a class decision using a **threshold** (commonly **0.5**).

---

### 3. Why the Linear Model Output is Not a Probability

Logistic regression starts with a **linear model**, just like linear regression:

$z = 0.8x - 1.2$


**Key issue**:

**Linear model outputs**: `z âˆˆ (-âˆž, +âˆž)` (any real number)

**Probabilities must satisfy**: `0 â‰¤ p â‰¤ 1`

**Therefore**:
- Linear output `z` **cannot** be interpreted as a probability
- We need a transformation that maps all real numbers into the interval `(0,1)`

---

### 4. Log-Odds (Logit): Linking Linear Models to Probability

Instead of modeling probability directly, logistic regression models the **log-odds**, also called the **logit**.

**Odds**:

$$odds = \frac{p}{1 - p}$$


**Log-odds (logit)**:

$$\log\left(\frac{p}{1 - p}\right)$$


**Important property**:

- Log-odds range: (-âˆž, +âˆž)
- Linear model range: (-âˆž, +âˆž)
  
âœ“ Perfect match!


---

### 5. Logistic Regression Model (Example)

**"Logistics Regression Fits a linear model to the log-odds"**:

$$\log\left(\frac{p}{1-p}\right) = 0.8x - 1.2$$



**Left-hand side**: Log-odds (logit) of churning  
**Right-hand side**: Linear regression on feature `x`

**This equation means**:
- Features influence the **log-odds of churn linearly**
- **Not** the probability directly

---

### 6. Converting Log-Odds into a Valid Probability

To recover the probability `p`, we solve the log-odds equation.

This leads to the **logistic (sigmoid) function**:

$$p = \frac{1}{1 + e^{-(0.8x - 1.2)}}$$


**Properties of the logistic function**:
- Maps **any real number** to `(0,1)`
- Produces a **smooth, interpretable probability**
- **Ensures outputs are always valid probabilities**

---

### 7. Final Interpretation

1. Linear model â†’ z = 0.8x - 1.2 (log-odds score)

2. Logistic function â†’ p = sigmoid(z) (probability)

3. Threshold â†’ Class decision (p > 0.5 = churn)


**Complete flow**:

Features â†’ Linear Model â†’ Log-odds â†’ Sigmoid â†’ Probability â†’ Class


**That's logistic regression!** ðŸŽ¯


### **Data Preparation**