# Evaluation Metrics in Machine Learning
- the real way to measure how good your model is.

##  1️⃣ Accuracy

### What is Accuracy?

Accuracy tells you **how often your model is correct**.

**Accuracy= Total Predictions/Correct Predictions**
	​

| Actual   | Predicted    | Result |
|--------  |------------  |------- |
| Spam     | Spam         | ✅    |
| Not Spam | Not Spam     | ✅    |
| Spam     | Not Spam     | ❌    |
| Not Spam | Not Spam     | ✅    |
| Spam     | Spam         | ✅    |

✅ Correct Predictions = 4  
❌ Incorrect Predictions = 1  

**Accuracy=5/4​=0.8=80%**

### Why it matters:
- Gives you a **quick overall idea** of performance.  
- But it can be **misleading** if your data is **imbalanced**.

###  Example of Imbalance

Suppose 95% of emails are **Not Spam** and only 5% are **Spam**.  
If your model **always predicts "Not Spam"** →  
✅ 95% Accurate  
❌ 0% useful at detecting spam!

That’s why we need other metrics like **Precision**, **Recall**, and **F1-Score**.


##  2️⃣ Precision

###  What is Precision?

Precision answers:  
> “Out of all the positive predictions, how many were actually correct?”

**Precision=True Positives/(True Positives + False Positives​)**

### 💡 Example

Your spam filter predicted 10 emails as spam:
- 7 were actually spam ✅  
- 3 were not spam ❌

**Precision=7/(7+3)​=0.7=70%**

### Why it matters:
Precision focuses on **how reliable your positive predictions are**.


##  3️⃣ Recall (Sensitivity or True Positive Rate)

###  What is Recall?

Recall answers:  
> “Out of all the actual positive cases, how many did the model correctly identify?”

**Recall = True Positives/(True Positives + False Negatives)**
	
### Example

- There are 10 actual spam emails.  
- Your model correctly catches 7, but misses 3.
**Recall=7/(7+3​)=0.7=70%**

### Why it matters:
Recall focuses on **how many real positives your model captures**.


## 4️⃣ F1-Score

###  What is F1-Score?

It’s the **harmonic mean** of precision and recall.

F1 = 2× Precision × Recall/
        ​Precision + Recall
### Why use it:
- Great for **imbalanced datasets**
- Gives a **balanced measure** of model performance


F1 rewards you only if you’re **good at both** —  
like a **balanced student**, not just strong in one subject.

## 5️⃣ Logarithmic Loss (Log Loss)

###  What is Log Loss?

Unlike accuracy or F1 (which only check correct vs incorrect),  
**Log Loss measures how confident your predictions are** —  
it’s used in **probabilistic models** like Logistic Regression.


###  Example

Suppose for one email:

Actual: Spam (1)

Predicted probability: 0.95 → great confidence ✅
→ Log loss = small (good)

If predicted 0.51 → still correct but uncertain 😬
→ Log loss = higher

If predicted 0.05 → confidently wrong ❌
→ Log loss = huge (very bad)

| Case  | Actual   | Predicted Probability  | Log Loss  | Interpretation          |
|-------|--------- |------------------------|-----------|----------------         |
| 1     | 1        | 0.95                   | Small     | ✅ Confident & Correct  |
| 2     | 1        | 0.51                   | Moderate  | 😬 Uncertain            |
| 3     | 1        | 0.05                   | Huge      | ❌ Confidently Wrong    |
