# Confusion Matrix

#### In this notebook we have:

1. [Introduction](#introduction)
2. [Confusion Matrix](#confusion-matrix)
3. [Name of Each Combination](#name-of-each-combination)
4. [Table 1](#table-1)
5. [Scenario](#scenario)
   - [True Positive](#true-positive)
   - [True Negative](#true-negative)
   - [False Positive](#false-positive)
   - [False Negative](#false-negative)
6. [Type I and Type II Errors](#type-i-and-type-ii-errors)
7. [Confusion Matrix](#confusion-matrix-representation)
8. [Table 2](#table-2)
9. [Confusion Matrix Example](#confusion-matrix-example)
10. [Performance Metrics](#performance-metrics)
11. [Accuracy](#accuracy)
12. [Accuracy Can Be Misleading](#accuracy-can-be-misleading)
13. [Recall or Sensitivity or True Positive Rate](#recall-or-sensitivity-or-true-positive-rate)
14. [Specificity or True Negative Rate](#specificity-or-true-negative-rate)
15. [Precision](#precision)
16. [F1 Score](#f1-score)

---

## Introduction

- A **confusion matrix** is a method to explain the results of a classification model.

---

## Confusion Matrix

- In **binary classification**, the outcomes are:
  - **True**
  - **False**
  
- Example: A binary classifier scans MRI images and predicts whether a person has **cancer** or **not**.
- The predicted outcomes by the classifier and actual outcomes can have **four combinations**:

---

## Name of Each Combination

| Combination       | Description |
|-------------------|-------------|
| **True Positive**  | Model predicts **cancer** (positive) and the patient actually has **cancer**. |
| **False Positive** | Model predicts **cancer** (positive) but the patient does **not** have cancer. |
| **False Negative** | Model predicts **no cancer** (negative) but the patient actually has **cancer**. |
| **True Negative**  | Model predicts **no cancer** (negative) and the patient does **not** have cancer. |

---

## Scenario

Consider a classification task (like **pregnant or not pregnant**) performed by a machine learning model.

### True Positive
- A person who is **actually pregnant** (positive) and classified as **pregnant** (positive).
- **TRUE POSITIVE (TP)**

### True Negative
- A person who is **not pregnant** (negative) and classified as **not pregnant** (negative).
- **TRUE NEGATIVE (TN)**

### False Positive
- A person who is **not pregnant** (negative) but classified as **pregnant** (positive).
- **FALSE POSITIVE (FP)**

### False Negative
- A person who is **pregnant** (positive) but classified as **not pregnant** (negative).
- **FALSE NEGATIVE (FN)**

---

## Type I and Type II Errors

- **Type I Error** (False Positive):
  - Occurs when you reject the **null hypothesis** when it was **true**.

- **Type II Error** (False Negative):
  - Occurs when you accept the **null hypothesis** when it was **false**.

---

## Confusion Matrix Representation

A **Confusion Matrix** can be represented as:

|               | Predicted Positive | Predicted Negative |
|---------------|-------------------|-------------------|
| **Actual Positive** | True Positive (TP)  | False Negative (FN) |
| **Actual Negative** | False Positive (FP) | True Negative (TN)  |

- **Total Predictions** = TP + FP + FN + TN

---

## Confusion Matrix Example

Let’s use real numbers to explain the confusion matrix:

- **True Positives (TP)** = 57  
- **False Positives (FP)** = 14  
- **False Negatives (FN)** = 23  
- **True Negatives (TN)** = 171  

**Total Predictions** = 57 + 14 + 23 + 171 = **265**

---

## Performance Metrics

Confusion matrix allows us to calculate various performance metrics:

---

## Accuracy

- **Accuracy** is the fraction of correct predictions (both **TP** and **TN**) out of total predictions.
  
$$
\text{Accuracy} = \frac{TP + TN}{TP + FP + FN + TN}
$$

Example:

$$
\frac{57 + 171}{265} = 0.86 \text{ or } 86\%
$$

---

## Accuracy Can Be Misleading

- Accuracy alone may be misleading, especially with **class imbalance**.

Example:
- If a classifier predicts **all negatives** (no cancer cases identified), you may still have high accuracy if **negatives** dominate the dataset.
- This is why we need more metrics.

---

## Recall or Sensitivity or True Positive Rate

- **Recall** is the fraction of positive cases correctly predicted out of all **actual positive cases**:

$$
\text{Recall} = \frac{TP}{TP + FN}
$$

Example:

$$
\frac{57}{57 + 23} = 0.71 \text{ or } 71\%
$$

---

## Specificity or True Negative Rate

- **Specificity** is the fraction of negative cases correctly predicted out of all **actual negative cases**:

$$
\text{Specificity} = \frac{TN}{TN + FP}
$$

Example:

$$
\frac{171}{171 + 14} = 0.92 \text{ or } 92\%
$$

---

## Precision

- **Precision** is the fraction of correctly predicted positive cases out of all predicted **positive cases**:

$$
\text{Precision} = \frac{TP}{TP + FP}
$$

Example:

$$
\frac{57}{57 + 14} = 0.80 \text{ or } 80\%
$$

---

## F1 Score

- The **F1 score** balances **Recall** and **Precision** into a single metric:

$$
F1 = 2 \times \frac{\text{Precision} \times \text{Recall}}{\text{Precision} + \text{Recall}}
$$

---

