# Logistic Regression: Intuition and Motivation

> **Note:** Logistic regression is used for solving **binary classification problems**.

We will also discuss its **use cases**, the **cost function**, and other key concepts.

---

## Binary Classification Problem

Whenever we say **binary classification problem**, what does it mean?

Consider a dataset with:

- One feature: **study hours**
- Output (dependent) feature: **pass/fail** (binary categories)

Example data:

| Study Hours | Result  |
|------------|---------|
| 2          | Fail    |
| 3          | Fail    |
| 4          | Fail    |
| 5          | Pass    |
| 6          | Pass    |

The goal is:  

> Given the **study hours**, predict whether the student will **pass or fail**.

This is a **binary classification problem** because the output has **two categories**: 0 (fail) or 1 (pass).

---

## Can Linear Regression Solve This?

Let's try to use **linear regression** to predict a binary outcome.

- Assign **Pass = 1** and **Fail = 0**
- Plot the data points with **x-axis = study hours** and **y-axis = output (0 or 1)**

Linear regression tries to fit a **best-fit line**:

$$
y = \beta_0 + \beta_1 x
$$

Where:

- $x$ = study hours
- $y$ = predicted probability (ideally between 0 and 1)
- $\beta_0$, $\beta_1$ = coefficients

---

### Predicting with Linear Regression

- For a new data point, predict $y$.
- Convert it to binary output:

$$
\hat{y} = 
\begin{cases} 
1 & \text{if } y > 0.5 \\
0 & \text{if } y \leq 0.5 
\end{cases}
$$

This seems to work initially.

---

### Problems with Linear Regression for Classification

1. **Sensitive to outliers:**

   - If a new data point is an outlier (e.g., 12 study hours â†’ Pass), the best-fit line shifts.
   - This may incorrectly predict earlier points (e.g., 5 hours â†’ Fail).

2. **Output can be outside [0,1]:**

   - Linear regression can predict $y > 1$ or $y < 0$.
   - Binary classification requires **predictions between 0 and 1**.

---

## Why Logistic Regression?

To overcome the above issues, we need a method that **squashes** the output between 0 and 1.

- Logistic regression applies the **sigmoid function**:

$$
\sigma(z) = \frac{1}{1 + e^{-z}}
$$

Where:

$$
z = \beta_0 + \beta_1 x
$$

- This ensures:

$$
0 \leq \sigma(z) \leq 1
$$

- We can then classify using:

$$
\hat{y} = 
\begin{cases} 
1 & \text{if } \sigma(z) > 0.5 \\
0 & \text{if } \sigma(z) \leq 0.5
\end{cases}
$$

---

### Summary: Why Not Linear Regression?

1. **Outliers** affect the line drastically.
2. **Predictions can exceed [0,1]**, which is invalid for probabilities.
3. Logistic regression **squashes** the line using the **sigmoid function**, ensuring valid probabilities.

---

---



# Performance Metrics in Classification

In this video, we are going to discuss **performance metrics** which are specifically used in:

- **Binary Classification**
- **Multi-class Classification**

We will cover:

- Confusion Matrix  
- Accuracy  
- Precision  
- Recall  
- F-beta Score  

---

## Logistic Regression Recap

Logistic regression is used for **classification problems**.  
We separate categories using a **decision boundary (linear line)**.  

Example:  
- If a point lies above the line â†’ category 1  
- If a point lies below the line â†’ category 0  

To evaluate how the model is performing, we need **performance metrics**.  

For regression problems, we used:  
- $R^2$ score  
- Adjusted $R^2$ score  

For classification problems, we use:  
- Confusion Matrix  
- Accuracy  
- Precision  
- Recall  
- F-beta Score  

---

## Confusion Matrix

The **confusion matrix** is the foundation of classification metrics.  
For **binary classification**, it is a **2 Ã— 2 matrix**.

|                | **Predicted 1** | **Predicted 0** |
|----------------|-----------------|-----------------|
| **Actual 1**   | True Positive (TP) | False Negative (FN) |
| **Actual 0**   | False Positive (FP) | True Negative (TN) |

- **TP (True Positive):** Model predicts **1**, actual is **1**  
- **TN (True Negative):** Model predicts **0**, actual is **0**  
- **FP (False Positive):** Model predicts **1**, actual is **0** (Type I Error)  
- **FN (False Negative):** Model predicts **0**, actual is **1** (Type II Error)  

---

## Accuracy

Accuracy measures the fraction of **correct predictions**:

$$
\text{Accuracy} = \frac{TP + TN}{TP + TN + FP + FN}
$$

Example: If  
- $TP = 3$, $TN = 1$, $FP = 2$, $FN = 1$

Then:

$$
\text{Accuracy} = \frac{3 + 1}{3 + 1 + 2 + 1} = \frac{4}{7}
$$

---

## Problem with Accuracy: Imbalanced Dataset

If dataset has **imbalanced categories**, accuracy may give a **misleading result**.

Example:  
- 1000 samples â†’ 900 = Class 1, 100 = Class 0  
- If a model predicts **all as Class 1**, accuracy = $90\%$  
- But the model is **useless**, since it never predicts Class 0 correctly.  

Thus, in **imbalanced datasets**, we use **Precision and Recall**.

---

## Precision

Precision focuses on **False Positives**.  

Formula:

$$
\text{Precision} = \frac{TP}{TP + FP}
$$

Interpretation:  
- Out of all **predicted positives**, how many are actually positive?  

---

## Recall

Recall focuses on **False Negatives**.  

Formula:

$$
\text{Recall} = \frac{TP}{TP + FN}
$$

Interpretation:  
- Out of all **actual positives**, how many are correctly predicted?  

---

## Precision vs Recall: Use Cases

### 1. Spam Classification
- If a mail is **not spam**, but predicted as **spam** â†’ **False Positive**  
- This is a **big blunder** (important mails lost).  
- **Focus:** Reduce False Positives â†’ Use **Precision**  

### 2. Medical Diagnosis (e.g., Diabetes Detection)
- If a person **has diabetes**, but model predicts **no diabetes** â†’ **False Negative**  
- This is **dangerous** (disease missed).  
- **Focus:** Reduce False Negatives â†’ Use **Recall**

---

## F-beta Score

When **both FP and FN are important**, we use **F-beta score**:

$$
F_\beta = \frac{(1 + \beta^2) \cdot (\text{Precision} \cdot \text{Recall})}{(\beta^2 \cdot \text{Precision}) + \text{Recall}}
$$

Special cases:

1. **F1 Score** (balanced case, FP = FN important):  
   $\beta = 1$

   $$
   F_1 = \frac{2 \cdot (\text{Precision} \cdot \text{Recall})}{\text{Precision} + \text{Recall}}
   $$

   â†’ Harmonic mean of Precision and Recall  

2. **F0.5 Score** (FP more important than FN):  
   $\beta = 0.5$

   $$
   F_{0.5} = \frac{1.25 \cdot (\text{Precision} \cdot \text{Recall})}{0.25 \cdot \text{Precision} + \text{Recall}}
   $$

3. **F2 Score** (FN more important than FP):  
   $\beta = 2$

   $$
   F_2 = \frac{5 \cdot (\text{Precision} \cdot \text{Recall})}{4 \cdot \text{Precision} + \text{Recall}}
   $$

---

## Summary

- **Accuracy** â†’ Use when dataset is **balanced**  
- **Precision** â†’ Use when **False Positives** are critical (e.g., spam classification)  
- **Recall** â†’ Use when **False Negatives** are critical (e.g., medical diagnosis)  
- **F-beta Score** â†’ Use when **both FP and FN are important**  

---




# Performance Metrics in Classification

 **performance metrics** which are specifically used in:

- **Binary Classification**
- **Multi-class Classification**


We will cover:

- Confusion Matrix  
- Accuracy  
- Precision  
- Recall  
- F-beta Score  

---

## Logistic Regression Recap

Logistic regression is used for **classification problems**.  
We separate categories using a **decision boundary (linear line)**.  

Example:  
- If a point lies above the line â†’ category 1  
- If a point lies below the line â†’ category 0  

To evaluate how the model is performing, we need **performance metrics**.  

For regression problems, we used:  
- $R^2$ score  
- Adjusted $R^2$ score  

For classification problems, we use:  
- Confusion Matrix  
- Accuracy  
- Precision  
- Recall  
- F-beta Score  

---

## Confusion Matrix

The **confusion matrix** is the foundation of classification metrics.  
For **binary classification**, it is a **2 Ã— 2 matrix**.

|                | **Predicted 1** | **Predicted 0** |
|----------------|-----------------|-----------------|
| **Actual 1**   | True Positive (TP) | False Negative (FN) |
| **Actual 0**   | False Positive (FP) | True Negative (TN) |

- **TP (True Positive):** Model predicts **1**, actual is **1**  
- **TN (True Negative):** Model predicts **0**, actual is **0**  
- **FP (False Positive):** Model predicts **1**, actual is **0** (Type I Error)  
- **FN (False Negative):** Model predicts **0**, actual is **1** (Type II Error)  

---

## Accuracy

Accuracy measures the fraction of **correct predictions**:

$$
\text{Accuracy} = \frac{TP + TN}{TP + TN + FP + FN}
$$

Example: If  
- $TP = 3$, $TN = 1$, $FP = 2$, $FN = 1$

Then:

$$
\text{Accuracy} = \frac{3 + 1}{3 + 1 + 2 + 1} = \frac{4}{7}
$$

---



## Logistic Regression Recap

Logistic regression is used for **classification problems**.  
We separate categories using a **decision boundary (linear line)**.  

Example:  
- If a point lies above the line â†’ category 1  
- If a point lies below the line â†’ category 0  

To evaluate how the model is performing, we need **performance metrics**.  

For regression problems, we used:  
- $R^2$ score  
- Adjusted $R^2$ score  

For classification problems, we use:  
- Confusion Matrix  
- Accuracy  
- Precision  
- Recall  
- F-beta Score  

---


## Confusion Matrix

The **confusion matrix** is the foundation of classification metrics.  
For **binary classification**, it is a **2 Ã— 2 matrix**.

|                | **Predicted 1** | **Predicted 0** |
|----------------|-----------------|-----------------|
| **Actual 1**   | True Positive (TP) | False Negative (FN) |
| **Actual 0**   | False Positive (FP) | True Negative (TN) |

- **TP (True Positive):** Model predicts **1**, actual is **1**  
- **TN (True Negative):** Model predicts **0**, actual is **0**  
- **FP (False Positive):** Model predicts **1**, actual is **0** (Type I Error)  
- **FN (False Negative):** Model predicts **0**, actual is **1** (Type II Error)  

---


## Confusion Matrix

The **confusion matrix** is the foundation of classification metrics.  
For **binary classification**, it is a **2 Ã— 2 matrix**.

|                | **Predicted 1** | **Predicted 0** |
|----------------|-----------------|-----------------|
| **Actual 1**   | True Positive (TP) | False Negative (FN) |
| **Actual 0**   | False Positive (FP) | True Negative (TN) |

- **TP (True Positive):** Model predicts **1**, actual is **1**  
- **TN (True Negative):** Model predicts **0**, actual is **0**  
- **FP (False Positive):** Model predicts **1**, actual is **0** (Type I Error)  
- **FN (False Negative):** Model predicts **0**, actual is **1** (Type II Error)  

---

The **confusion matrix** is the foundation of classification metrics.  
For **binary classification**, it is a **2 Ã— 2 matrix**.



- **TP (True Positive):** Model predicts **1**, actual is **1**  
- **TN (True Negative):** Model predicts **0**, actual is **0**  
- **FP (False Positive):** Model predicts **1**, actual is **0** (Type I Error)  
- **FN (False Negative):** Model predicts **0**, actual is **1** (Type II Error)  

---

|                | **Predicted 1** | **Predicted 0** |
|----------------|-----------------|-----------------|
| **Actual 1**   | True Positive (TP) | False Negative (FN) |
| **Actual 0**   | False Positive (FP) | True Negative (TN) |

## Accuracy

Accuracy measures the fraction of **correct predictions**:

$$
\text{Accuracy} = \frac{TP + TN}{TP + TN + FP + FN}
$$

Example: If  
- $TP = 3$, $TN = 1$, $FP = 2$, $FN = 1$

Then:

$$
\text{Accuracy} = \frac{3 + 1}{3 + 1 + 2 + 1} = \frac{4}{7}
$$

---

## Problem with Accuracy: Imbalanced Dataset

If dataset has **imbalanced categories**, accuracy may give a **misleading result**.

Example:  
- 1000 samples â†’ 900 = Class 1, 100 = Class 0  
- If a model predicts **all as Class 1**, accuracy = $90\%$  
- But the model is **useless**, since it never predicts Class 0 correctly.  

Thus, in **imbalanced datasets**, we use **Precision and Recall**.

---

## Precision

Precision focuses on **False Positives**.  

Formula:

$$
\text{Precision} = \frac{TP}{TP + FP}
$$

Interpretation:  
- Out of all **predicted positives**, how many are actually positive?  

---

## Recall

Recall focuses on **False Negatives**.  

Formula:

$$
\text{Recall} = \frac{TP}{TP + FN}
$$

Interpretation:  
- Out of all **actual positives**, how many are correctly predicted?  

---

## Precision vs Recall: Use Cases

### 1. Spam Classification
- If a mail is **not spam**, but predicted as **spam** â†’ **False Positive**  
- This is a **big blunder** (important mails lost).  
- **Focus:** Reduce False Positives â†’ Use **Precision**  

### 2. Medical Diagnosis (e.g., Diabetes Detection)
- If a person **has diabetes**, but model predicts **no diabetes** â†’ **False Negative**  
- This is **dangerous** (disease missed).  
- **Focus:** Reduce False Negatives â†’ Use **Recall**

---

## F-beta Score

When **both FP and FN are important**, we use **F-beta score**:

$$
F_\beta = \frac{(1 + \beta^2) \cdot (\text{Precision} \cdot \text{Recall})}{(\beta^2 \cdot \text{Precision}) + \text{Recall}}
$$

Special cases:

1. **F1 Score** (balanced case, FP = FN important):  
   $\beta = 1$

   $$
   F_1 = \frac{2 \cdot (\text{Precision} \cdot \text{Recall})}{\text{Precision} + \text{Recall}}
   $$

   â†’ Harmonic mean of Precision and Recall  

2. **F0.5 Score** (FP more important than FN):  
   $\beta = 0.5$

   $$
   F_{0.5} = \frac{1.25 \cdot (\text{Precision} \cdot \text{Recall})}{0.25 \cdot \text{Precision} + \text{Recall}}
   $$

3. **F2 Score** (FN more important than FP):  
   $\beta = 2$

   $$
   F_2 = \frac{5 \cdot (\text{Precision} \cdot \text{Recall})}{4 \cdot \text{Precision} + \text{Recall}}
   $$

---

## Summary

- **Accuracy** â†’ Use when dataset is **balanced**  
- **Precision** â†’ Use when **False Positives** are critical (e.g., spam classification)  
- **Recall** â†’ Use when **False Negatives** are critical (e.g., medical diagnosis)  
- **F-beta Score** â†’ Use when **both FP and FN are important**  

---


# Performance Metrics in Classification

- **Binary Classification**
- **Multi-class Classification**

We will cover:

- Confusion Matrix  
- Accuracy  
- Precision  
- Recall  
- F-beta Score  

---

## Logistic Regression Recap

Logistic regression is used for **classification problems**.  
We separate categories using a **decision boundary (linear line)**.  

Example:  
- If a point lies above the line â†’ category 1  
- If a point lies below the line â†’ category 0  

To evaluate how the model is performing, we need **performance metrics**.  

For regression problems, we used:  
- $R^2$ score  
- Adjusted $R^2$ score  

For classification problems, we use:  
- Confusion Matrix  
- Accuracy  
- Precision  
- Recall  
- F-beta Score  

---

## Confusion Matrix

The **confusion matrix** is the foundation of classification metrics.  
For **binary classification**, it is a **2 Ã— 2 matrix**.

|                | **Predicted 1** | **Predicted 0** |
|----------------|-----------------|-----------------|
| **Actual 1**   | True Positive (TP) | False Negative (FN) |
| **Actual 0**   | False Positive (FP) | True Negative (TN) |

- **TP (True Positive):** Model predicts **1**, actual is **1**  
- **TN (True Negative):** Model predicts **0**, actual is **0**  
- **FP (False Positive):** Model predicts **1**, actual is **0** (Type I Error)  
- **FN (False Negative):** Model predicts **0**, actual is **1** (Type II Error)  

---

## Accuracy

Accuracy measures the fraction of **correct predictions**:

$$
\text{Accuracy} = \frac{TP + TN}{TP + TN + FP + FN}
$$

Example: If  
- $TP = 3$, $TN = 1$, $FP = 2$, $FN = 1$

Then:

$$
\text{Accuracy} = \frac{3 + 1}{3 + 1 + 2 + 1} = \frac{4}{7}
$$

---

## Problem with Accuracy: Imbalanced Dataset

If dataset has **imbalanced categories**, accuracy may give a **misleading result**.

Example:  
- 1000 samples â†’ 900 = Class 1, 100 = Class 0  
- If a model predicts **all as Class 1**, accuracy = $90\%$  
- But the model is **useless**, since it never predicts Class 0 correctly.  

Thus, in **imbalanced datasets**, we use **Precision and Recall**.

---

## Precision

Precision focuses on **False Positives**.  

Formula:

$$
\text{Precision} = \frac{TP}{TP + FP}
$$

Interpretation:  
- Out of all **predicted positives**, how many are actually positive?  

---

## Recall

Recall focuses on **False Negatives**.  

Formula:

$$
\text{Recall} = \frac{TP}{TP + FN}
$$

Interpretation:  
- Out of all **actual positives**, how many are correctly predicted?  

---

## Precision vs Recall: Use Cases

### 1. Spam Classification
- If a mail is **not spam**, but predicted as **spam** â†’ **False Positive**  
- This is a **big blunder** (important mails lost).  
- **Focus:** Reduce False Positives â†’ Use **Precision**  

### 2. Medical Diagnosis (e.g., Diabetes Detection)
- If a person **has diabetes**, but model predicts **no diabetes** â†’ **False Negative**  
- This is **dangerous** (disease missed).  
- **Focus:** Reduce False Negatives â†’ Use **Recall**

---

## F-beta Score

When **both FP and FN are important**, we use **F-beta score**:

$$
F_\beta = \frac{(1 + \beta^2) \cdot (\text{Precision} \cdot \text{Recall})}{(\beta^2 \cdot \text{Precision}) + \text{Recall}}
$$

Special cases:

1. **F1 Score** (balanced case, FP = FN important):  
   $\beta = 1$

   $$
   F_1 = \frac{2 \cdot (\text{Precision} \cdot \text{Recall})}{\text{Precision} + \text{Recall}}
   $$

   â†’ Harmonic mean of Precision and Recall  

2. **F0.5 Score** (FP more important than FN):  
   $\beta = 0.5$

   $$
   F_{0.5} = \frac{1.25 \cdot (\text{Precision} \cdot \text{Recall})}{0.25 \cdot \text{Precision} + \text{Recall}}
   $$

3. **F2 Score** (FN more important than FP):  
   $\beta = 2$

   $$
   F_2 = \frac{5 \cdot (\text{Precision} \cdot \text{Recall})}{4 \cdot \text{Precision} + \text{Recall}}
   $$

---

## Summary

- **Accuracy** â†’ Use when dataset is **balanced**  
- **Precision** â†’ Use when **False Positives** are critical (e.g., spam classification)  
- **Recall** â†’ Use when **False Negatives** are critical (e.g., medical diagnosis)  
- **F-beta Score** â†’ Use when **both FP and FN are important**  

---



# Performance Metrics in Classification

- **Binary Classification**
- **Multi-class Classification**

We will cover:

- Confusion Matrix  
- Accuracy  
- Precision  
- Recall  
- F-beta Score  

---

## Logistic Regression Recap

Logistic regression is used for **classification problems**.  
We separate categories using a **decision boundary (linear line)**.  

Example:  
- If a point lies above the line â†’ category 1  
- If a point lies below the line â†’ category 0  

To evaluate how the model is performing, we need **performance metrics**.  

For regression problems, we used:  
- $R^2$ score  
- Adjusted $R^2$ score  

For classification problems, we use:  
- Confusion Matrix  
- Accuracy  
- Precision  
- Recall  
- F-beta Score  

---

## Confusion Matrix

The **confusion matrix** is the foundation of classification metrics.  
For **binary classification**, it is a **2 Ã— 2 matrix**.

|                | **Predicted 1** | **Predicted 0** |
|----------------|-----------------|-----------------|
| **Actual 1**   | True Positive (TP) | False Negative (FN) |
| **Actual 0**   | False Positive (FP) | True Negative (TN) |

- **TP (True Positive):** Model predicts **1**, actual is **1**  
- **TN (True Negative):** Model predicts **0**, actual is **0**  
- **FP (False Positive):** Model predicts **1**, actual is **0** (Type I Error)  
- **FN (False Negative):** Model predicts **0**, actual is **1** (Type II Error)  

---

## Accuracy

Accuracy measures the fraction of **correct predictions**:

$$
\text{Accuracy} = \frac{TP + TN}{TP + TN + FP + FN}
$$

Example: If  
- $TP = 3$, $TN = 1$, $FP = 2$, $FN = 1$

Then:

$$
\text{Accuracy} = \frac{3 + 1}{3 + 1 + 2 + 1} = \frac{4}{7}
$$

---

## Problem with Accuracy: Imbalanced Dataset

If dataset has **imbalanced categories**, accuracy may give a **misleading result**.

Example:  
- 1000 samples â†’ 900 = Class 1, 100 = Class 0  
- If a model predicts **all as Class 1**, accuracy = $90\%$  
- But the model is **useless**, since it never predicts Class 0 correctly.  

Thus, in **imbalanced datasets**, we use **Precision and Recall**.

---

## Precision

Precision focuses on **False Positives**.  

Formula:

$$
\text{Precision} = \frac{TP}{TP + FP}
$$

Interpretation:  
- Out of all **predicted positives**, how many are actually positive?  

---

## Recall

Recall focuses on **False Negatives**.  

Formula:

$$
\text{Recall} = \frac{TP}{TP + FN}
$$

Interpretation:  
- Out of all **actual positives**, how many are correctly predicted?  

---

## Precision vs Recall: Use Cases

### 1. Spam Classification
- If a mail is **not spam**, but predicted as **spam** â†’ **False Positive**  
- This is a **big blunder** (important mails lost).  
- **Focus:** Reduce False Positives â†’ Use **Precision**  

### 2. Medical Diagnosis (e.g., Diabetes Detection)
- If a person **has diabetes**, but model predicts **no diabetes** â†’ **False Negative**  
- This is **dangerous** (disease missed).  
- **Focus:** Reduce False Negatives â†’ Use **Recall**

---

## F-beta Score

When **both FP and FN are important**, we use **F-beta score**:

$$
F_\beta = \frac{(1 + \beta^2) \cdot (\text{Precision} \cdot \text{Recall})}{(\beta^2 \cdot \text{Precision}) + \text{Recall}}
$$

Special cases:

1. **F1 Score** (balanced case, FP = FN important):  
   $\beta = 1$

   $$
   F_1 = \frac{2 \cdot (\text{Precision} \cdot \text{Recall})}{\text{Precision} + \text{Recall}}
   $$

   â†’ Harmonic mean of Precision and Recall  

2. **F0.5 Score** (FP more important than FN):  
   $\beta = 0.5$

   $$
   F_{0.5} = \frac{1.25 \cdot (\text{Precision} \cdot \text{Recall})}{0.25 \cdot \text{Precision} + \text{Recall}}
   $$

3. **F2 Score** (FN more important than FP):  
   $\beta = 2$

   $$
   F_2 = \frac{5 \cdot (\text{Precision} \cdot \text{Recall})}{4 \cdot \text{Precision} + \text{Recall}}
   $$

---

## Summary

- **Accuracy** â†’ Use when dataset is **balanced**  
- **Precision** â†’ Use when **False Positives** are critical (e.g., spam classification)  
- **Recall** â†’ Use when **False Negatives** are critical (e.g., medical diagnosis)  
- **F-beta Score** â†’ Use when **both FP and FN are important**  

---

> Next: We will discuss the **ROC Curve** and AUC as performance metrics. ðŸš€


# Performance Metrics in Classification

In this video, we are going to discuss **performance metrics** which are specifically used in:

- **Binary Classification**
- **Multi-class Classification**

We will cover:

- Confusion Matrix  
- Accuracy  
- Precision  
- Recall  
- F-beta Score  

---

## Logistic Regression Recap

Logistic regression is used for **classification problems**.  
We separate categories using a **decision boundary (linear line)**.  

Example:  
- If a point lies above the line â†’ category 1  
- If a point lies below the line â†’ category 0  

To evaluate how the model is performing, we need **performance metrics**.  

For regression problems, we used:  
- $R^2$ score  
- Adjusted $R^2$ score  

For classification problems, we use:  
- Confusion Matrix  
- Accuracy  
- Precision  
- Recall  
- F-beta Score  

---

## Confusion Matrix

The **confusion matrix** is the foundation of classification metrics.  
For **binary classification**, it is a **2 Ã— 2 matrix**.

|                | **Predicted 1** | **Predicted 0** |
|----------------|-----------------|-----------------|
| **Actual 1**   | True Positive (TP) | False Negative (FN) |
| **Actual 0**   | False Positive (FP) | True Negative (TN) |

- **TP (True Positive):** Model predicts **1**, actual is **1**  
- **TN (True Negative):** Model predicts **0**, actual is **0**  
- **FP (False Positive):** Model predicts **1**, actual is **0** (Type I Error)  
- **FN (False Negative):** Model predicts **0**, actual is **1** (Type II Error)  

---

## Accuracy

Accuracy measures the fraction of **correct predictions**:

$$
\text{Accuracy} = \frac{TP + TN}{TP + TN + FP + FN}
$$

Example: If  
- $TP = 3$, $TN = 1$, $FP = 2$, $FN = 1$

Then:

$$
\text{Accuracy} = \frac{3 + 1}{3 + 1 + 2 + 1} = \frac{4}{7}
$$

---

## Problem with Accuracy: Imbalanced Dataset

If dataset has **imbalanced categories**, accuracy may give a **misleading result**.

Example:  
- 1000 samples â†’ 900 = Class 1, 100 = Class 0  
- If a model predicts **all as Class 1**, accuracy = $90\%$  
- But the model is **useless**, since it never predicts Class 0 correctly.  

Thus, in **imbalanced datasets**, we use **Precision and Recall**.

---

## Precision

Precision focuses on **False Positives**.  

Formula:

$$
\text{Precision} = \frac{TP}{TP + FP}
$$

Interpretation:  
- Out of all **predicted positives**, how many are actually positive?  

---

## Recall

Recall focuses on **False Negatives**.  

Formula:

$$
\text{Recall} = \frac{TP}{TP + FN}
$$

Interpretation:  
- Out of all **actual positives**, how many are correctly predicted?  

---

## Precision vs Recall: Use Cases

### 1. Spam Classification
- If a mail is **not spam**, but predicted as **spam** â†’ **False Positive**  
- This is a **big blunder** (important mails lost).  
- **Focus:** Reduce False Positives â†’ Use **Precision**  

### 2. Medical Diagnosis (e.g., Diabetes Detection)
- If a person **has diabetes**, but model predicts **no diabetes** â†’ **False Negative**  
- This is **dangerous** (disease missed).  
- **Focus:** Reduce False Negatives â†’ Use **Recall**

---

## F-beta Score

When **both FP and FN are important**, we use **F-beta score**:

$$
F_\beta = \frac{(1 + \beta^2) \cdot (\text{Precision} \cdot \text{Recall})}{(\beta^2 \cdot \text{Precision}) + \text{Recall}}
$$

Special cases:

1. **F1 Score** (balanced case, FP = FN important):  
   $\beta = 1$

   $$
   F_1 = \frac{2 \cdot (\text{Precision} \cdot \text{Recall})}{\text{Precision} + \text{Recall}}
   $$

   â†’ Harmonic mean of Precision and Recall  

2. **F0.5 Score** (FP more important than FN):  
   $\beta = 0.5$

   $$
   F_{0.5} = \frac{1.25 \cdot (\text{Precision} \cdot \text{Recall})}{0.25 \cdot \text{Precision} + \text{Recall}}
   $$

3. **F2 Score** (FN more important than FP):  
   $\beta = 2$

   $$
   F_2 = \frac{5 \cdot (\text{Precision} \cdot \text{Recall})}{4 \cdot \text{Precision} + \text{Recall}}
   $$

---

## Summary

- **Accuracy** â†’ Use when dataset is **balanced**  
- **Precision** â†’ Use when **False Positives** are critical (e.g., spam classification)  
- **Recall** â†’ Use when **False Negatives** are critical (e.g., medical diagnosis)  
- **F-beta Score** â†’ Use when **both FP and FN are important**  

---

> Next: We will discuss the **ROC Curve** and AUC as performance metrics. ðŸš€


# Performance Metrics in Classification

In this video, we are going to discuss **performance metrics** which are specifically used in:

- **Binary Classification**
- **Multi-class Classification**

We will cover:

- Confusion Matrix  
- Accuracy  
- Precision  
- Recall  
- F-beta Score  

---

## Logistic Regression Recap

Logistic regression is used for **classification problems**.  
We separate categories using a **decision boundary (linear line)**.  

Example:  
- If a point lies above the line â†’ category 1  
- If a point lies below the line â†’ category 0  

To evaluate how the model is performing, we need **performance metrics**.  

For regression problems, we used:  
- $R^2$ score  
- Adjusted $R^2$ score  

For classification problems, we use:  
- Confusion Matrix  
- Accuracy  
- Precision  
- Recall  
- F-beta Score  

---

## Confusion Matrix

The **confusion matrix** is the foundation of classification metrics.  
For **binary classification**, it is a **2 Ã— 2 matrix**.

|                | **Predicted 1** | **Predicted 0** |
|----------------|-----------------|-----------------|
| **Actual 1**   | True Positive (TP) | False Negative (FN) |
| **Actual 0**   | False Positive (FP) | True Negative (TN) |

- **TP (True Positive):** Model predicts **1**, actual is **1**  
- **TN (True Negative):** Model predicts **0**, actual is **0**  
- **FP (False Positive):** Model predicts **1**, actual is **0** (Type I Error)  
- **FN (False Negative):** Model predicts **0**, actual is **1** (Type II Error)  

---

## Accuracy

Accuracy measures the fraction of **correct predictions**:

$$
\text{Accuracy} = \frac{TP + TN}{TP + TN + FP + FN}
$$

Example: If  
- $TP = 3$, $TN = 1$, $FP = 2$, $FN = 1$

Then:

$$
\text{Accuracy} = \frac{3 + 1}{3 + 1 + 2 + 1} = \frac{4}{7}
$$

---

## Problem with Accuracy: Imbalanced Dataset

If dataset has **imbalanced categories**, accuracy may give a **misleading result**.

Example:  
- 1000 samples â†’ 900 = Class 1, 100 = Class 0  
- If a model predicts **all as Class 1**, accuracy = $90\%$  
- But the model is **useless**, since it never predicts Class 0 correctly.  

Thus, in **imbalanced datasets**, we use **Precision and Recall**.

---

## Precision

Precision focuses on **False Positives**.  

Formula:

$$
\text{Precision} = \frac{TP}{TP + FP}
$$

Interpretation:  
- Out of all **predicted positives**, how many are actually positive?  

---

## Recall

Recall focuses on **False Negatives**.  

Formula:

$$
\text{Recall} = \frac{TP}{TP + FN}
$$

Interpretation:  
- Out of all **actual positives**, how many are correctly predicted?  

---

## Precision vs Recall: Use Cases

### 1. Spam Classification
- If a mail is **not spam**, but predicted as **spam** â†’ **False Positive**  
- This is a **big blunder** (important mails lost).  
- **Focus:** Reduce False Positives â†’ Use **Precision**  

### 2. Medical Diagnosis (e.g., Diabetes Detection)
- If a person **has diabetes**, but model predicts **no diabetes** â†’ **False Negative**  
- This is **dangerous** (disease missed).  
- **Focus:** Reduce False Negatives â†’ Use **Recall**

---

## F-beta Score

When **both FP and FN are important**, we use **F-beta score**:

$$
F_\beta = \frac{(1 + \beta^2) \cdot (\text{Precision} \cdot \text{Recall})}{(\beta^2 \cdot \text{Precision}) + \text{Recall}}
$$

Special cases:

1. **F1 Score** (balanced case, FP = FN important):  
   $\beta = 1$

   $$
   F_1 = \frac{2 \cdot (\text{Precision} \cdot \text{Recall})}{\text{Precision} + \text{Recall}}
   $$

   â†’ Harmonic mean of Precision and Recall  

2. **F0.5 Score** (FP more important than FN):  
   $\beta = 0.5$

   $$
   F_{0.5} = \frac{1.25 \cdot (\text{Precision} \cdot \text{Recall})}{0.25 \cdot \text{Precision} + \text{Recall}}
   $$

3. **F2 Score** (FN more important than FP):  
   $\beta = 2$

   $$
   F_2 = \frac{5 \cdot (\text{Precision} \cdot \text{Recall})}{4 \cdot \text{Precision} + \text{Recall}}
   $$

---

## Summary

- **Accuracy** â†’ Use when dataset is **balanced**  
- **Precision** â†’ Use when **False Positives** are critical (e.g., spam classification)  
- **Recall** â†’ Use when **False Negatives** are critical (e.g., medical diagnosis)  
- **F-beta Score** â†’ Use when **both FP and FN are important**  

---

> Next: We will discuss the **ROC Curve** and AUC as performance metrics. ðŸš€


# Performance Metrics in Classification

In this video, we are going to discuss **performance metrics** which are specifically used in:

- **Binary Classification**
- **Multi-class Classification**

We will cover:

- Confusion Matrix  
- Accuracy  
- Precision  
- Recall  
- F-beta Score  

---

## Logistic Regression Recap

Logistic regression is used for **classification problems**.  
We separate categories using a **decision boundary (linear line)**.  

Example:  
- If a point lies above the line â†’ category 1  
- If a point lies below the line â†’ category 0  

To evaluate how the model is performing, we need **performance metrics**.  

For regression problems, we used:  
- $R^2$ score  
- Adjusted $R^2$ score  

For classification problems, we use:  
- Confusion Matrix  
- Accuracy  
- Precision  
- Recall  
- F-beta Score  

---

## Confusion Matrix

The **confusion matrix** is the foundation of classification metrics.  
For **binary classification**, it is a **2 Ã— 2 matrix**.

|                | **Predicted 1** | **Predicted 0** |
|----------------|-----------------|-----------------|
| **Actual 1**   | True Positive (TP) | False Negative (FN) |
| **Actual 0**   | False Positive (FP) | True Negative (TN) |

- **TP (True Positive):** Model predicts **1**, actual is **1**  
- **TN (True Negative):** Model predicts **0**, actual is **0**  
- **FP (False Positive):** Model predicts **1**, actual is **0** (Type I Error)  
- **FN (False Negative):** Model predicts **0**, actual is **1** (Type II Error)  

---

## Accuracy

Accuracy measures the fraction of **correct predictions**:

$$
\text{Accuracy} = \frac{TP + TN}{TP + TN + FP + FN}
$$

Example: If  
- $TP = 3$, $TN = 1$, $FP = 2$, $FN = 1$

Then:

$$
\text{Accuracy} = \frac{3 + 1}{3 + 1 + 2 + 1} = \frac{4}{7}
$$

---

## Problem with Accuracy: Imbalanced Dataset

If dataset has **imbalanced categories**, accuracy may give a **misleading result**.

Example:  
- 1000 samples â†’ 900 = Class 1, 100 = Class 0  
- If a model predicts **all as Class 1**, accuracy = $90\%$  
- But the model is **useless**, since it never predicts Class 0 correctly.  

Thus, in **imbalanced datasets**, we use **Precision and Recall**.

---

## Precision

Precision focuses on **False Positives**.  

Formula:

$$
\text{Precision} = \frac{TP}{TP + FP}
$$

Interpretation:  
- Out of all **predicted positives**, how many are actually positive?  

---

## Recall

Recall focuses on **False Negatives**.  

Formula:

$$
\text{Recall} = \frac{TP}{TP + FN}
$$

Interpretation:  
- Out of all **actual positives**, how many are correctly predicted?  

---

## Precision vs Recall: Use Cases

### 1. Spam Classification
- If a mail is **not spam**, but predicted as **spam** â†’ **False Positive**  
- This is a **big blunder** (important mails lost).  
- **Focus:** Reduce False Positives â†’ Use **Precision**  

### 2. Medical Diagnosis (e.g., Diabetes Detection)
- If a person **has diabetes**, but model predicts **no diabetes** â†’ **False Negative**  
- This is **dangerous** (disease missed).  
- **Focus:** Reduce False Negatives â†’ Use **Recall**

---

## F-beta Score

When **both FP and FN are important**, we use **F-beta score**:

$$
F_\beta = \frac{(1 + \beta^2) \cdot (\text{Precision} \cdot \text{Recall})}{(\beta^2 \cdot \text{Precision}) + \text{Recall}}
$$

Special cases:

1. **F1 Score** (balanced case, FP = FN important):  
   $\beta = 1$

   $$
   F_1 = \frac{2 \cdot (\text{Precision} \cdot \text{Recall})}{\text{Precision} + \text{Recall}}
   $$

   â†’ Harmonic mean of Precision and Recall  

2. **F0.5 Score** (FP more important than FN):  
   $\beta = 0.5$

   $$
   F_{0.5} = \frac{1.25 \cdot (\text{Precision} \cdot \text{Recall})}{0.25 \cdot \text{Precision} + \text{Recall}}
   $$

3. **F2 Score** (FN more important than FP):  
   $\beta = 2$

   $$
   F_2 = \frac{5 \cdot (\text{Precision} \cdot \text{Recall})}{4 \cdot \text{Precision} + \text{Recall}}
   $$

---

## Summary

- **Accuracy** â†’ Use when dataset is **balanced**  
- **Precision** â†’ Use when **False Positives** are critical (e.g., spam classification)  
- **Recall** â†’ Use when **False Negatives** are critical (e.g., medical diagnosis)  
- **F-beta Score** â†’ Use when **both FP and FN are important**  

---

> Next: We will discuss the **ROC Curve** and AUC as performance metrics. ðŸš€

