# Chapter 4 Introduction to Diagnostic Testing Metrics/ meterics for Model Performance:



### Many students and professionals misunderstand test precision metrics (e.g., confusing test error rates with accuracy).
- Example: Casscells et al. (1978) highlighted that most participants incorrectly interpreted the PPV of a diagnostic test with 1/1000 disease prevalence and 5% false positive rate. Correct answer: ~2%; common incorrect response: 95%.

- If a model says a person has a disease, think of it as the prediction says the person has the disease, not that the person has the disease. It sounds weird, but what it is saying think of the prediction as a metric not Truth (Ground Truth). 

- The prediction reflects a probability or classification that should be interpreted alongside the model's sensitivity, specificity, and predictive values.

### Definitions for Sensitivity, Specificity, and Related Metrics

1. **True Positive (TP):**  
   A result where the test predicts a positive outcome, and the subject actually has the condition.

2. **False Positive (FP):**  
   A result where the test predicts a positive outcome, but the subject does not have the condition.

3. **True Negative (TN):**  
   A result where the test predicts a negative outcome, and the subject does not have the condition.

4. **False Negative (FN):**  
   A result where the test predicts a negative outcome, but the subject actually has the condition.

5. **Sensitivity (Recall, True Positive Rate):**  
   $$
   \text{Sensitivity} = \frac{\text{TP}}{\text{TP} + \text{FN}}
   $$  
   Measures the ability of a test to correctly identify positive cases.

6. **Specificity (True Negative Rate):**  
   $$
   \text{Specificity} = \frac{\text{TN}}{\text{TN} + \text{FP}}
   $$  
   Measures the ability of a test to correctly identify negative cases.

7. **Positive Predictive Value (PPV):**  
   $$
   \text{PPV} = \frac{\text{TP}}{\text{TP} + \text{FP}}
   $$  
   Probability that a subject with a positive test result actually has the condition.

8. **Negative Predictive Value (NPV):**  
   $$
   \text{NPV} = \frac{\text{TN}}{\text{TN} + \text{FN}}
   $$  
   Probability that a subject with a negative test result does not have the condition.

9. **Accuracy:**  
   $$
   \text{Accuracy} = \frac{\text{TP} + \text{TN}}{\text{TP} + \text{FP} + \text{TN} + \text{FN}}
   $$  
   Measures the overall correctness of the test across all cases.

10. **Prevalence:**  
    $$
    \text{Prevalence} = \frac{\text{TP} + \text{FN}}{\text{TP} + \text{FP} + \text{TN} + \text{FN}}
    $$  
    The proportion of the population with the condition.

11. **False Positive Rate (FPR):**  
    $$
    \text{FPR} = \frac{\text{FP}}{\text{FP} + \text{TN}}
    $$  
    The probability of falsely identifying a negative case as positive.

12. **False Negative Rate (FNR):**  
    $$
    \text{FNR} = \frac{\text{FN}}{\text{TP} + \text{FN}}
    $$  
    The probability of failing to identify a positive case.

13. **Receiver Operating Characteristic (ROC) Curve:**  
    A plot of **Sensitivity** ($y$-axis) vs. \(1 - \text{Specificity}\) ($x$-axis), used to evaluate the performance of a diagnostic test.

14. **Area Under the Curve (AUC):**  
    The area under the ROC curve, representing the test's ability to discriminate between positive and negative cases.

---

### Example Table Representation

| Disease (D)        | No Disease (C)   | Total                |
|---------------------|------------------|----------------------|
| **Test Positive (P)** | TP              | FP                   | $$n_P = TP + FP$$ |
| **Test Negative (N)** | FN              | TN                   | $$n_N = FN + TN$$ |
| **Total**           | $$n_D = TP + FN$$| $$n_C = FP + TN$$    | $$n = n_D + n_C$$ |
