### Q1. What is R-squared in Linear Regression?

R-squared (also called the coefficient of determination) tells us how much of the variation in the dependent variable can be explained by the independent variables in the model.

**How it’s calculated**:  
R-squared = 1 - (SS_res / SS_total)  
- SS_res = sum of squared residuals (errors)  
- SS_total = total sum of squares (total variation)

**What it means**:  
An R-squared of 0.80 means 80% of the variation in the output can be explained by the model.

---

### Q2. What is Adjusted R-squared?

Adjusted R-squared is a modified version of R-squared that adjusts for the number of predictors in the model. It increases only if the new predictor improves the model more than expected by chance.

**Why it’s useful**:  
Unlike R-squared, adjusted R-squared doesn’t always go up when more variables are added. It can decrease if the new variable doesn't help much.

---

### Q3. When to Use Adjusted R-squared?

Use **adjusted R-squared** when comparing models with a different number of independent variables. It helps avoid overfitting by penalizing unnecessary variables.

---

### Q4. What are RMSE, MSE, and MAE?

These are evaluation metrics used to measure prediction errors in regression:

- **MAE (Mean Absolute Error)**: Average of absolute errors.  
  Formula: sum(abs(actual - predicted)) / n

- **MSE (Mean Squared Error)**: Average of squared errors.  
  Formula: sum((actual - predicted)^2) / n

- **RMSE (Root Mean Squared Error)**: Square root of MSE.  
  Formula: sqrt(MSE)

---

### Q5. Advantages and Disadvantages of RMSE, MSE, MAE

- **MAE**:
  - + Easy to understand
  - – Treats all errors equally

- **MSE**:
  - + Penalizes large errors more (useful if big mistakes are worse)
  - – Not interpretable in original units

- **RMSE**:
  - + Penalizes large errors, and result is in original units
  - – Sensitive to outliers

---

### Q6. What is Lasso Regularization?

Lasso (Least Absolute Shrinkage and Selection Operator) adds a penalty equal to the **absolute** value of the coefficients to the loss function.

**Key difference from Ridge**:
- **Lasso** can shrink some coefficients to zero (feature selection).
- **Ridge** shrinks them towards zero but never exactly zero.

Use **Lasso** when you want to remove unnecessary features.

---

### Q7. How Do Regularized Models Prevent Overfitting?

Regularized models add a penalty to the loss function to reduce model complexity. This discourages the model from fitting noise in the training data.

**Example**: In a dataset with many predictors, Lasso can drop irrelevant ones, and Ridge can reduce the impact of less useful features.

---

### Q8. Limitations of Regularized Linear Models

- Might not perform well on **non-linear relationships** unless combined with polynomial features.
- Choosing the right **regularization strength (alpha)** requires tuning.
- **Lasso** might drop too many features if alpha is too high.
- **Ridge** can’t perform feature selection.

---

### Q9. Comparing Model A (RMSE = 10) and Model B (MAE = 8)

You can’t directly compare RMSE and MAE values unless both are calculated for the same model.

But generally:
- If MAE is much lower, it means fewer big errors.
- RMSE emphasizes large errors more than MAE.

**Limitations**: RMSE might look worse even if most predictions are good but a few are really bad. MAE is more stable.

Best approach: compare both metrics **on both models**, not one per model.

---

### Q10. Comparing Ridge (Model A) vs Lasso (Model B)

- **Model A** uses Ridge (alpha = 0.1)
- **Model B** uses Lasso (alpha = 0.5)

If Lasso improves performance **and reduces model complexity** by removing irrelevant features, it might be preferred.

**Trade-offs**:
- Lasso may ignore important but weak predictors.
- Ridge keeps all features, useful if all have some contribution.

Final decision depends on:
- Model accuracy on test data
- Need for feature selection
- Simplicity vs completeness
