
## Q1: Simple Linear Regression vs. Multiple Linear Regression

### Simple Linear Regression:
Simple Linear Regression involves modeling the relationship between two variables - one independent variable (predictor) and one dependent variable (response). It assumes a linear relationship between the variables and aims to find the best-fitting line that minimizes the sum of squared differences between the observed and predicted values.

**Example:**
Suppose we want to predict a student's final exam score (dependent variable) based on the number of hours they studied (independent variable).

### Multiple Linear Regression:
Multiple Linear Regression extends the concept of linear regression to multiple independent variables. It models the relationship between one dependent variable and two or more independent variables. It aims to find the best-fitting hyperplane in a higher-dimensional space.

**Example:**
Predicting a house's price (dependent variable) based on various features like square footage, number of bedrooms, and distance to the city center (independent variables).

## Q2: Assumptions of Linear Regression and Checking Assumptions

Assumptions of Linear Regression:
1. Linearity: The relationship between independent and dependent variables is linear.
2. Independence: Residuals (differences between observed and predicted values) are independent of each other.
3. Homoscedasticity: Residuals have constant variance across all levels of the predictor variables.
4. Normality: Residuals are normally distributed.

To check assumptions:
- **Residual Plot:** Plot residuals vs. predicted values to check linearity and homoscedasticity.
- **Normality Plot:** Plot residuals vs. a normal distribution to check normality.
- **Durbin-Watson Test:** Checks for independence of residuals.
- **Variance Inflation Factor (VIF):** Checks for multicollinearity (related to Q6).

## Q3: Interpretation of Slope and Intercept

In a simple linear regression equation (y = mx + b), the slope (m) represents the change in the dependent variable for a unit change in the independent variable. The intercept (b) represents the predicted value of the dependent variable when the independent variable is zero.

**Example:**
For the student's exam score prediction, the slope represents the increase in the final exam score for each additional hour studied. The intercept represents the expected score when the student hasn't studied at all.

## Q4: Gradient Descent in Machine Learning

Gradient Descent is an optimization algorithm used to minimize the loss function in machine learning models. It iteratively adjusts the model parameters in the direction of steepest descent (negative gradient) to reach the optimal values. It's widely used in training various machine learning models, including linear regression, neural networks, and more.

## Q5: Multiple Linear Regression Model

Multiple Linear Regression models the relationship between a dependent variable and multiple independent variables. The model equation is:

y = b0 + b1*x1 + b2*x2 + ... + bn*xn + ε

Where:
- y is the dependent variable.
- b0 is the intercept.
- b1, b2, ..., bn are the coefficients for the independent variables x1, x2, ..., xn.
- ε is the error term.

## Q6: Multicollinearity in Multiple Linear Regression

Multicollinearity occurs when two or more independent variables in a multiple linear regression are highly correlated, making it difficult to isolate the individual effect of each variable. It can lead to unstable coefficient estimates and reduced interpretability.

To detect multicollinearity:
- **Correlation Matrix:** Calculate correlation coefficients between independent variables.
- **VIF (Variance Inflation Factor):** High VIF values indicate multicollinearity.

To address multicollinearity:
- Remove one of the correlated variables.
- Combine correlated variables into a single variable.
- Use regularization techniques like Ridge or Lasso regression.

## Q7: Polynomial Regression Model

Polynomial Regression is an extension of linear regression where the relationship between the dependent and independent variables is modeled as an nth-degree polynomial. It can capture non-linear relationships in the data.

## Q8: Advantages and Disadvantages of Polynomial Regression

Advantages:
- Can capture complex non-linear relationships.
- More flexible than linear regression.

Disadvantages:
- Prone to overfitting with higher-degree polynomials.
- Interpretability may decrease with increased complexity.

**When to Use:**
Use polynomial regression when you suspect a non-linear relationship between variables and linear regression doesn't fit well. Be cautious of overfitting and consider using techniques to control model complexity.

Please note that this is a textual representation of a Jupyter Notebook-style format. You can adapt this content into an actual Jupyter Notebook by adding code cells for visualization or calculations, if needed.