### **When to Use Recall vs. Precision?**  
The choice between **recall** (sensitivity) and **precision** depends on the **cost of false negatives (FN) vs. false positives (FP)** in your application. Here’s a simple guide:

| **Metric**  | **Focus**               | **When to Prioritize**                                  | **Example Use Cases**                     |
|-------------|-------------------------|-------------------------------------------------------|------------------------------------------|
| **Recall**  | Minimize **False Negatives (FN)** | When missing a positive case is **dangerous/costly**. | Cancer detection, spam filtering (security), fraud detection |
| **Precision** | Minimize **False Positives (FP)** | When false alarms are **harmful/expensive**.          | Email spam (business), recommendation systems, legal document review |

---

### **1. When to Prioritize Recall (High Recall)**
**Goal:** Catch **as many true positives as possible**, even if it means some false alarms.  
**Use cases:**  
- **Medical Diagnosis (Cancer, HIV):** Missing a case (FN) could be fatal.  
- **Spam Detection (Security):** Letting phishing emails (FN) into the inbox is risky.  
- **Fraud Detection:** Missing fraud (FN) costs money.  
- **Search & Rescue:** Missing a survivor (FN) is unacceptable.  

**Trade-off:** Higher recall often means **more false positives (FP)**.  

**Example:**  
- A cancer test with **90% recall** catches 90% of cancers but may flag some healthy patients (FP).  

---

### **2. When to Prioritize Precision (High Precision)**
**Goal:** Ensure **positive predictions are correct**, even if some true positives are missed.  
**Use cases:**  
- **Email Spam (Business):** Incorrectly blocking important emails (FP) is worse than some spam slipping through.  
- **Recommendation Systems:** Showing irrelevant products (FP) hurts user trust.  
- **Legal/Financial Docs:** Wrongly flagging a legal doc as fraudulent (FP) causes delays.  
- **Autonomous Vehicles:** False alarms (FP) could make the car brake unnecessarily.  

**Trade-off:** Higher precision often means **more false negatives (FN)**.  

**Example:**  
- A spam filter with **95% precision** rarely mislabels good emails as spam but may let some spam through (FN).  

---

### **3. When to Balance Both (F1-Score)**
If both **FN and FP** are important, use the **F1-score** (harmonic mean of precision and recall).  

**Use cases:**  
- **Moderate-risk scenarios** (e.g., customer churn prediction, defect detection).  
- **When no single error type is drastically worse** than the other.  

**Example:**  
- A social media content filter balances **removing harmful posts (recall)** and **avoiding over-censorship (precision)**.  

---

### **4. Practical Decision Flow**  
Ask:  
1. **Is a missed detection (FN) worse than a false alarm (FP)?** → **Recall**.  
   - *Example:* Missing cancer is worse than a false scare.  
2. **Is a false alarm (FP) worse than a missed case (FN)?** → **Precision**.  
   - *Example:* Blocking a legitimate email is worse than missing spam.  
3. **Are both important?** → **F1-score or adjust threshold**.  

---

### **5. How to Adjust Recall vs. Precision?**
- **Increase Recall:** Lower the classification threshold (predict "positive" more liberally).  
- **Increase Precision:** Raise the threshold (only predict "positive" with high confidence).  

**Example in Spam Detection:**  
- **For security (high recall):** Classify even slightly suspicious emails as spam.  
- **For business (high precision):** Only block emails that are almost certainly spam.  

---

### **Summary Table**
| **Priority** | **Optimize For** | **Cost of Error**               | **Example**                |
|--------------|------------------|---------------------------------|----------------------------|
| **Recall**   | Minimize FN      | Missing a positive is costly    | Cancer tests, fraud detection |
| **Precision**| Minimize FP      | False alarms are costly         | Email spam (business), legal checks |
| **F1-Score** | Balance both     | Both errors matter moderately   | Customer churn, defect detection |



### **How to Use Precision, Recall, and F1-Score to Evaluate Model Performance**  

When evaluating a classification model (e.g., spam detection, cancer diagnosis), **precision, recall, and F1-score** help assess performance beyond just accuracy. But **whether "larger is better" depends on your goal**. Here’s how to interpret and use them:

---

## **1. Key Definitions & Formulas**
| Metric      | Formula                          | Focus                     |
|-------------|----------------------------------|---------------------------|
| **Precision** | $$\frac{TP}{TP + FP}$$        | **How many predicted positives are correct?** (Avoid FP) |
| **Recall**    | $$\frac{TP}{TP + FN}$$        | **How many actual positives were caught?** (Avoid FN) |
| **F1-Score**  | $$2 \times \frac{Precision \times Recall}{Precision + Recall}$$ | **Balances precision and recall** |

---

## **2. When to Prefer Higher Values?**
### **(A) "Larger is Better" Depends on the Problem**
| **Goal**                | **Optimize For** | **When to Use** |
|-------------------------|------------------|----------------|
| **Avoid False Alarms (FP)** | **High Precision** | Spam filtering (business), legal docs |
| **Avoid Missed Cases (FN)**  | **High Recall**    | Cancer detection, fraud prevention |
| **Balance FP & FN**         | **High F1-Score** | Moderate-risk scenarios (e.g., customer churn) |

### **(B) Trade-offs**
- **↑ Precision → ↓ Recall** (Fewer FP but more FN)  
- **↑ Recall → ↓ Precision** (Fewer FN but more FP)  

---

## **3. How to Evaluate Model Performance?**
### **Step 1: Define Business Impact**
- **Is a False Positive (FP) worse?** → Optimize **Precision**.  
  - *Example:* Blocking a legitimate email (FP) is bad.  
- **Is a False Negative (FN) worse?** → Optimize **Recall**.  
  - *Example:* Missing cancer (FN) is deadly.  
- **Need a balance?** → Use **F1-Score**.  

### **Step 2: Check the Confusion Matrix**
|                     | **Actual Positive** | **Actual Negative** |
|---------------------|---------------------|---------------------|
| **Predicted Positive** | True Positive (TP)  | False Positive (FP) |
| **Predicted Negative** | False Negative (FN) | True Negative (TN) |

- **High Recall?** → FN should be **low**.  
- **High Precision?** → FP should be **low**.  

### **Step 3: Adjust the Classification Threshold**
- **To increase Recall:** Lower the threshold (predict more positives).  
- **To increase Precision:** Raise the threshold (only predict positives with high confidence).  

### **Step 4: Compare Models**
- If **FN is costly**, pick the model with **highest recall**.  
- If **FP is costly**, pick the model with **highest precision**.  
- If both matter, pick the model with **best F1-score**.  

---

## **4. Real-World Examples**
### **Example 1: Spam Detection**
- **Goal:** Avoid blocking important emails (FP).  
- **Optimize:** **Precision** (even if some spam slips through).  
- **Evaluation:**  
  - Precision = 95% → Only 5% of flagged emails are mistakes.  
  - Recall = 70% → 30% of spam reaches the inbox (acceptable).  

### **Example 2: Cancer Screening**
- **Goal:** Catch all cancers (FN).  
- **Optimize:** **Recall** (even if healthy patients get extra tests).  
- **Evaluation:**  
  - Recall = 98% → Only 2% of cancers missed.  
  - Precision = 60% → 40% of "positive" tests are false alarms (acceptable).  

### **Example 3: Fraud Detection**
- **Goal:** Balance fraud catches (FN) and false accusations (FP).  
- **Optimize:** **F1-Score**.  
- **Evaluation:**  
  - F1 = 85% → Good balance between catching fraud and minimizing false alerts.  

---

## **5. Summary: How to Choose?**
| **Scenario**               | **Priority Metric** | **Why?** |
|----------------------------|---------------------|----------|
| **Medical diagnosis**      | **Recall**          | Missing a disease (FN) is worse than false alarms (FP). |
| **Spam filtering (business)** | **Precision**      | Blocking legitimate emails (FP) is worse than missing spam (FN). |
| **Fraud detection**        | **F1-Score**        | Need a balance—too many FP annoys users, too many FN loses money. |

### **Final Rule of Thumb**
- **"Larger is better" for the metric that aligns with your goal.**  
- **Trade-offs exist:** You can’t maximize both precision and recall at the same time.  
- **Use F1 when both FP and FN matter.**  

