Q1. Explain the concept of R-squared in linear regression models. How is it calculated, and what does it
represent?

answer = **R-squared in Linear Regression**

R-squared, or the coefficient of determination, measures how well the regression line fits the data. It shows the proportion of variation in the dependent variable (Y) that is explained by the independent variable(s) (X).

**Calculation:**

   **R^2 = 1-(SSresidual/SStotal)**


- SS_residual: Sum of squared differences between actual and predicted values.

- SS_total: Sum of squared differences between actual values and their mean.

**Interpretation:**

- 0: Model explains none of the variance.
- 1: Model explains all the variance.

A higher R-squared means a better fit, but it doesn’t guarantee the model is good—always check other metrics.

Q2. Define adjusted R-squared and explain how it differs from the regular R-squared.



Answer =

 **Adjusted R-squared**--This measure the variation for a multiple regression model, and helps us determine goodness of fit.

**Formula**

Adjusted R^2 = 1-(((1-R^2)(N-1))/N-P-1)

where,

- N=no. of datapoints

- R^2 = =R-squared

- P = no. of independent feature

**Key Difference:**

**R-squared:** Increases or stays the same as more predictors are added, even if they don’t improve the model.

**Adjusted R-squared:** Increases only if the added predictors improve the model; decreases otherwise.


Q3. When is it more appropriate to use adjusted R-squared?

Answer = **When to Use Adjusted R-squared**:

Adjusted R-squared is more appropriate when dealing with multiple predictors in a regression model. It accounts for the number of predictors and avoids overestimating the model’s performance by penalizing irrelevant variables.

**Key Scenarios:**
- When adding predictors to check if they genuinely improve the model.
- When comparing models with different numbers of predictors to ensure fairness.

Q4. What are RMSE, MSE, and MAE in the context of regression analysis? How are these metrics
calculated, and what do they represent?

Answer =

 1. **Mean Absolute Error (MAE):**
Measures the average absolute difference between predicted and actual values. It gives equal weight to all errors.

2. **Mean Squared Error (MSE):**
Calculates the average of squared differences between predicted and actual values. It penalizes larger errors more heavily.

3. **Root Mean Squared Error (RMSE):**
Square root of MSE. It is in the same units as the target variable, making it easier to interpret.

Key Differences:

- MAE: Focuses on absolute errors and is less sensitive to outliers.
- MSE: Penalizes larger errors more due to squaring.
- RMSE: Adds interpretability by bringing errors back to the original scale.

Q5. Discuss the advantages and disadvantages of using RMSE, MSE, and MAE as evaluation metrics in
regression analysis.


Answer =
1. **Mean Absolute Error (MAE):**

 - Advantages:
Easy to understand and interpret.
Treats all errors equally, avoiding overemphasis on outliers.
 - Disadvantages:
May not reflect model performance well when large errors matter more.


2. **Mean Squared Error (MSE):**

 - Advantages:
 1. Penalizes large errors more, making it useful when larger deviations are critical.
 2. Smooths gradients, aiding optimization in machine learning models.
 - Disadvantages:
 1. Harder to interpret as it’s not in the same units as the target variable.
 2. Overly sensitive to outliers.

3. **Root Mean Squared Error (RMSE):**

 - Advantages:
 1. Same units as the target variable, making interpretation intuitive.
 2. Highlights large errors, useful when big mistakes are costly.
- Disadvantages:
 1. Also sensitive to outliers, similar to MSE.
 2. More computationally intensive than MAE.


 Choosing the Metric:

- Use MAE for robust evaluation with fewer outliers.
- Use MSE or RMSE when large errors need greater emphasis.

Q6. Explain the concept of Lasso regularization. How does it differ from Ridge regularization, and when is
it more appropriate to use?


Answer= **Lasso Regularization (L1 Regularization)**: Lasso regression , or least absolute sharinkage and selection operatar is a statistical method that uses regularization to improve the accuracy of predictive models.


**it differ from Ridge regularization**

 - **Lasso regression**
Uses the L1 penalty, which penalizes the sum of the absolute values of coefficients. This allows lasso to set coefficients to zero, which is useful for feature selection.

- **Ridge regression**
Uses the L2 penalty, which penalizes the sum of the squared values of coefficients. This allows ridge regression to shrink coefficients close to zero, which is useful for reducing model complexity

**When to Use Lasso:**

- we need a simpler model with fewer predictors.
- our data has high dimensionality with potentially irrelevant features.
- Feature selection is critical for interpretability.

Q7. How do regularized linear models help to prevent overfitting in machine learning? Provide an
example to illustrate.

Answer =
**Regularized Linear Models and Overfitting**

Regularized models like Lasso and Ridge add a penalty to large coefficients, reducing model complexity. This prevents the model from fitting noise in the training data, which leads to overfitting.

**xample:**
In a dataset with 100 features, a plain linear regression may overfit by assigning large weights to irrelevant features. Ridge shrinks all coefficients, while Lasso eliminates irrelevant ones by setting them to zero, creating a more generalized model.

**Result:**
Regularization balances bias and variance, improving performance on unseen data.

Q8. Discuss the limitations of regularized linear models and explain why they may not always be the best
choice for regression analysis.

Answer:
Limitations of Regularized Linear Models

 1. **Interpretability Issues:**
Regularization alters coefficients, making it harder to interpret feature importance directly.

2. **Feature Scaling Required:**
Regularized models require standardized or normalized data for proper penalty application.

3. **Sensitivity to Hyperparameters:**
Performance depends on choosing the right regularization parameter (
𝜆
λ), requiring cross-validation.

4. **Feature Selection in Ridge:**
Ridge retains all features, even irrelevant ones, as it only shrinks coefficients, unlike Lasso.

5. **Non-linear Relationships:**
Regularized linear models assume linearity and may perform poorly with non-linear relationships without proper feature engineering.

6. **Sparse Data Challenges:**
Lasso may struggle when features are highly correlated, as it arbitrarily selects one feature and ignores others.



Q9. You are comparing the performance of two regression models using different evaluation metrics.
Model A has an RMSE of 10, while Model B has an MAE of 8. Which model would you choose as the better
performer, and why? Are there any limitations to your choice of metric?

Answer= Choosing Between Model A and Model B

The choice depends on the context and the importance of error types:

- **Model A (RMSE = 10):** RMSE penalizes larger errors more, so it’s better if large errors significantly impact outcomes.
- **Model B (MAE = 8):** MAE treats all errors equally, making it preferable if all errors, regardless of size, are equally important.

**Better Model:**
Without more context, Model B appears better because it has a lower error value (MAE of 8).

**Limitations of Metrics:**

1. Different Scales: RMSE and MAE are not directly comparable as they measure errors differently.
2. Outliers: RMSE is sensitive to outliers, which may skew results if the dataset contains extreme values.
3. Context Dependency: The metric should align with the problem. For instance, in predicting house prices, large errors might matter more, favoring RMSE.

**Conclusion:** Evaluate both metrics for each model or choose based on problem-specific priorities.

Q10. You are comparing the performance of two regularized linear models using different types of
regularization. Model A uses Ridge regularization with a regularization parameter of 0.1, while Model B
uses Lasso regularization with a regularization parameter of 0.5. Which model would you choose as the
better performer, and why? Are there any trade-offs or limitations to your choice of regularization
method?

Answer= Choosing Between Model A (Ridge) and Model B (Lasso)

The better model depends on the dataset and goals:

- Model A (Ridge): Ridge regularization (L2) is ideal when all features are relevant and multicollinearity exists, as it shrinks coefficients without setting them to zero.

- Model B (Lasso): Lasso regularization (L1) is better for feature selection as it can shrink irrelevant coefficients to zero, simplifying the model.

**Choice:**

- If interpretability and feature selection are priorities, choose Model B.
- If multicollinearity or the presence of many small but relevant features matters, choose Model A.

**rade-offs and Limitations:**

1. **Lasso:**

 May arbitrarily exclude one feature in correlated groups.
Can struggle with high-dimensional data where relevant features are weakly correlated with the target.

2. **Ridge:**

 Retains all features, even irrelevant ones, leading to less interpretability.
May not reduce complexity as effectively as Lasso.

**Conclusion:**
The choice depends on the specific dataset, the presence of irrelevant features, and the importance of feature interpretability. Consider cross-validation to evaluate both models comprehensively.

