Q1. Explain the concept of R-squared in linear regression models. How is it calculated, and what does it
represent?
Q2. Define adjusted R-squared and explain how it differs from the regular R-squared.
Q3. When is it more appropriate to use adjusted R-squared?
Q4. What are RMSE, MSE, and MAE in the context of regression analysis? How are these metrics
calculated, and what do they represent?
Q5. Discuss the advantages and disadvantages of using RMSE, MSE, and MAE as evaluation metrics in
regression analysis.
Q6. Explain the concept of Lasso regularization. How does it differ from Ridge regularization, and when is
it more appropriate to use?
Q7. How do regularized linear models help to prevent overfitting in machine learning? Provide an
example to illustrate.
Q8. Discuss the limitations of regularized linear models and explain why they may not always be the best
choice for regression analysis.
Q9. You are comparing the performance of two regression models using different evaluation metrics.
Model A has an RMSE of 10, while Model B has an MAE of 8. Which model would you choose as the better
performer, and why? Are there any limitations to your choice of metric?
Q10. You are comparing the performance of two regularized linear models using different types of
regularization. Model A uses Ridge regularization with a regularization parameter of 0.1, while Model B
uses Lasso regularization with a regularization parameter of 0.5. Which model would you choose as the
better performer, and why? Are there any trade-offs or limitations to your choice of regularization
method?

### **Q1: R-squared in Linear Regression**
- **Definition**: R-squared (\( R^2 \)) measures the proportion of variance in the dependent variable that is explained by the independent variables in the model.
- **Formula**:

  \[
  R^2 = 1 - \frac{\sum (y_i - \hat{y}_i)^2}{\sum (y_i - \bar{y})^2}
  \]

  where \( y_i \) are actual values, \( \hat{y}_i \) are predicted values, and \( \bar{y} \) is the mean of \( y \).

- **Interpretation**:  
  - \( R^2 = 0 \) → Model explains no variance.  
  - \( R^2 = 1 \) → Model perfectly fits the data.  
  - Higher \( R^2 \) indicates a better fit but does not imply causation.

---

### **Q2: Adjusted R-squared vs. Regular R-squared**
- **Adjusted R-squared** adjusts for the number of predictors in the model to prevent overestimation of model performance.

  \[
  R^2_{adj} = 1 - \left( \frac{(1 - R^2)(n - 1)}{n - p - 1} \right)
  \]

  where \( n \) is the number of observations, and \( p \) is the number of predictors.

- **Difference**:
  - **Regular \( R^2 \)** increases with more predictors, even if they are irrelevant.
  - **Adjusted \( R^2 \)** only increases if new predictors improve the model.

---

### **Q3: When to Use Adjusted R-squared?**
- When comparing models with different numbers of predictors.
- When adding new features to check if they truly improve the model.
- When avoiding overfitting due to excessive predictors.

---

### **Q4: RMSE, MSE, and MAE in Regression Analysis**
- **Mean Absolute Error (MAE)**: Average absolute difference between actual and predicted values.

  \[
  MAE = \frac{1}{n} \sum |y_i - \hat{y}_i|
  \]

- **Mean Squared Error (MSE)**: Average squared difference between actual and predicted values.

  \[
  MSE = \frac{1}{n} \sum (y_i - \hat{y}_i)^2
  \]

- **Root Mean Squared Error (RMSE)**: Square root of MSE, measuring error in the original units.

  \[
  RMSE = \sqrt{MSE}
  \]

- **Interpretation**:
  - MAE is less sensitive to large errors.
  - MSE and RMSE penalize large errors more heavily.

---

### **Q5: Advantages and Disadvantages of RMSE, MSE, and MAE**
| Metric  | Advantages  | Disadvantages  |
|---------|------------|---------------|
| **MAE**  | Easy to interpret | Treats all errors equally (ignores variance). |
| **MSE**  | Penalizes large errors more, useful for outliers | Not in original units, difficult to interpret. |
| **RMSE**  | Same units as the dependent variable, useful for model comparison | More sensitive to large errors, can be misleading with skewed data. |

---

### **Q6: Lasso Regularization vs. Ridge Regularization**
- **Lasso (Least Absolute Shrinkage and Selection Operator)**:
  - Adds **L1 penalty**:

    \[
    \sum |\beta_j|
    \]

  - Shrinks coefficients and can set some to **zero** (feature selection).
  - Useful when many irrelevant features exist.

- **Ridge Regularization**:
  - Adds **L2 penalty**:

    \[
    \sum \beta_j^2
    \]

  - Shrinks coefficients but **does not** set them to zero.
  - Useful when features are correlated.

---

### **Q7: How Regularized Models Prevent Overfitting**
- **Regularization (Lasso/Ridge)** reduces overfitting by penalizing large coefficients.
- **Example**:
  - Without regularization, a model with many features may fit noise.
  - Ridge keeps all features but reduces their impact.
  - Lasso removes irrelevant features, making the model simpler.

---

### **Q8: Limitations of Regularized Models**
- **Lasso**: May remove important features if lambda is too high.
- **Ridge**: Does not perform feature selection, making interpretation difficult.
- **Both**: Regularization strength (\(\lambda\)) must be carefully tuned.

---

### **Q9: Comparing Model A (RMSE = 10) and Model B (MAE = 8)**
- RMSE penalizes larger errors more than MAE.
- If large errors are rare but impactful, **Model A (RMSE = 10)** may be worse.
- If errors are generally small and evenly spread, **Model B (MAE = 8)** is better.
- **Limitations**: MAE and RMSE are not directly comparable; RMSE exaggerates large errors.

---

### **Q10: Comparing Ridge (0.1) vs. Lasso (0.5)**
- **Lasso (\(\lambda = 0.5\))** may remove some features entirely.
- **Ridge (\(\lambda = 0.1\))** keeps all features but shrinks them.
- **Choice**:
  - If feature selection is needed → **Lasso**.
  - If correlated features exist → **Ridge**.
  - **Trade-offs**: Higher \(\lambda\) increases bias but reduces overfitting.
