<p>--

### **Q1: Define overfitting and underfitting in machine learning. What are the consequences of each, and how can they be mitigated?**

#### **Overfitting**
Overfitting happens when a model learns the training data too well, capturing noise and fluctuations rather than general patterns. As a result, it performs well on the training set but poorly on new, unseen data.  
**Consequences:**  
- Poor generalization to new data  
- High accuracy on training data but low accuracy on test data  

**Mitigation Strategies:**  
- Reduce model complexity  
- Use more training data  
- Apply regularization (L1, L2)  
- Use dropout in neural networks  

#### **Underfitting**  
Underfitting happens when a model is too simple and fails to capture the underlying patterns in the data. It performs poorly on both training and test data.  
**Consequences:**  
- High bias and poor accuracy on both training and test data  
- The model is unable to learn meaningful patterns  

**Mitigation Strategies:**  
- Increase model complexity (use more layers or features)  
- Train for a longer duration  
- Reduce regularization  

---

### **Q2: How can we reduce overfitting? Explain in brief.**  
Overfitting can be reduced using the following techniques:

1. **Cross-validation:** Use k-fold cross-validation to ensure the model generalizes well.  
2. **Regularization:** Apply L1 (Lasso) or L2 (Ridge) regularization to penalize large weights.  
3. **Dropout:** Randomly drop neurons during training in neural networks.  
4. **More training data:** Adding more samples improves generalization.  
5. **Early stopping:** Stop training when validation loss starts increasing.  
6. **Pruning:** Reduce the complexity of decision trees.  

---

### **Q3: Explain underfitting. List scenarios where underfitting can occur in ML.**  
Underfitting occurs when a model is too simple to learn the underlying structure of the data.  

**Scenarios where underfitting can occur:**  
- Using a linear model for complex, non-linear relationships.  
- Training a neural network with too few layers or neurons.  
- Insufficient training epochs leading to poor learning.  
- Over-regularization (high lambda values in Ridge/Lasso).  

---

### **Q4: Explain the bias-variance tradeoff in machine learning. What is the relationship between bias and variance, and how do they affect model performance?**  

The **bias-variance tradeoff** describes the balance between two sources of error:  

- **Bias (Underfitting)**: Error due to overly simplistic assumptions in the model.  
- **Variance (Overfitting)**: Error due to excessive sensitivity to training data noise.  

**Effect on model performance:**  
- **High bias → Underfitting:** Poor performance on both training and test data.  
- **High variance → Overfitting:** Good training accuracy but poor test accuracy.  

The goal is to find an optimal balance to minimize total error.  

---

### **Q5: Discuss some common methods for detecting overfitting and underfitting in machine learning models. How can you determine whether your model is overfitting or underfitting?**  

**Methods to detect overfitting:**  
- High training accuracy but low validation/test accuracy.  
- Increasing validation loss while training loss keeps decreasing.  
- Large difference between train and test scores (high variance).  

**Methods to detect underfitting:**  
- Low training and test accuracy.  
- Both training and validation loss are high.  
- The model fails to capture patterns in the data.  

---

### **Q6: Compare and contrast bias and variance in machine learning. What are some examples of high bias and high variance models, and how do they differ in terms of their performance?**  

| Aspect | Bias (Underfitting) | Variance (Overfitting) |
|--------|----------------------|------------------------|
| Definition | Model is too simple and fails to learn patterns | Model is too complex and captures noise |
| Training Error | High | Low |
| Test Error | High | High |
| Cause | Oversimplification | Over-sensitivity to training data |
| Example | Linear regression on non-linear data | Deep neural networks without regularization |

Examples:  
- **High Bias:** Linear regression on non-linear data.  
- **High Variance:** Decision trees without pruning, deep neural networks without regularization.  

---

### **Q7: What is regularization in machine learning, and how can it be used to prevent overfitting? Describe some common regularization techniques and how they work.**  

**Regularization** is a technique to prevent overfitting by adding a penalty term to the loss function. It discourages complex models and large weight values.

**Common Regularization Techniques:**  
1. **L1 Regularization (Lasso)** – Shrinks some weights to zero, effectively performing feature selection.  
2. **L2 Regularization (Ridge)** – Shrinks weights but does not eliminate them. Helps reduce overfitting.  
3. **Dropout (for Neural Networks)** – Randomly drops neurons during training to prevent reliance , upload it to GitHub, and share the repository link as required. Let me know if you need any clarifications! 🚀</p>