# 📊 **Performance Metrics in Logistic Regression**

---

## **1. Confusion Matrix**

* A **summary table** of predictions vs. actual outcomes.

|                     | Predicted Positive  | Predicted Negative  |
| ------------------- | ------------------- | ------------------- |
| **Actual Positive** | True Positive (TP)  | False Negative (FN) |
| **Actual Negative** | False Positive (FP) | True Negative (TN)  |

* Helps derive all other metrics.

---

## **2. Accuracy**

* **Overall correctness** of the model.

$$
\text{Accuracy} = \frac{TP + TN}{TP + TN + FP + FN}
$$

* **Good when classes are balanced**, misleading if data is imbalanced.

---

## **3. Precision (Positive Predictive Value)**

* Out of all predicted positives, how many are actually positive?

$$
\text{Precision} = \frac{TP}{TP + FP}
$$

* Useful when **false positives are costly** (e.g., spam filters).

---

## **4. Recall (Sensitivity / True Positive Rate)**

* Out of all actual positives, how many did we correctly predict?

$$
\text{Recall} = \frac{TP}{TP + FN}
$$

* Useful when **false negatives are costly** (e.g., disease detection).

---

## **5. Specificity (True Negative Rate)**

* Out of all actual negatives, how many did we correctly predict?

$$
\text{Specificity} = \frac{TN}{TN + FP}
$$

* Complements recall: **focuses on negatives**.

---

## **6. F1 Score**

* **Harmonic mean** of precision and recall.

$$
F1 = 2 \cdot \frac{\text{Precision} \cdot \text{Recall}}{\text{Precision} + \text{Recall}}
$$

* Good when we need a **balance between precision and recall**.

---

## **7. F-beta Score**

* Weighted version of F1.

$$
F_\beta = (1+\beta^2)\cdot \frac{\text{Precision}\cdot\text{Recall}}{(\beta^2 \cdot \text{Precision}) + \text{Recall}}
$$

* β > 1 → more weight on recall.
* β < 1 → more weight on precision.

---

## **8. ROC Curve & AUC**

* **ROC Curve**: Plots **TPR (Recall)** vs. **FPR (1 - Specificity)** at different thresholds.
* **AUC (Area Under Curve)**: Measures how well the model separates classes.

  * AUC = 1 → perfect model.
  * AUC = 0.5 → random guessing.

---

## **9. Log Loss (Cross-Entropy Loss)**

* Measures how well predicted probabilities match actual labels.

$$
\text{Log Loss} = -\frac{1}{N} \sum_{i=1}^N \big[ y_i \log(p_i) + (1-y_i) \log(1-p_i) \big]
$$

* Lower log loss = better model.

---

✅ **Summary:**

* **Accuracy** → overall correctness (works best for balanced data).
* **Precision** → good when FP is costly.
* **Recall** → good when FN is costly.
* **F1 / F-beta** → balance precision and recall.
* **ROC-AUC** → good for comparing models.
* **Log Loss** → probability-based evaluation.

