## <u>Introduction to Regression</u>

### What is Regression?

Regression analysis is a predictive modeling technique used to estimate the relationship between a dependent variable (target) and one or more independent variables (features). It is widely used for forecasting and predicting numerical outcomes.

### Types of Regression

-   **Linear Regression**: Assumes a linear relationship between the dependent and independent variables.
-   **Non-linear Regression**: Models complex relationships that cannot be captured by linear models.
-   **Polynomial Regression**: A type of non-linear regression where the relationship is modeled as an nth degree polynomial.
-   **Regularized Regression**: Includes techniques like Ridge and Lasso that add regularization terms to prevent overfitting.
-   **Logistic Regression**: Used for binary classification problems, despite its name suggesting a regression model.




## 1. Linear Regression

### Simple Linear Regression

Simple linear regression models the relationship between two variables by fitting a linear equation to the observed data. The equation has the form:


<img src="images/simplelinearregression.jpg" alt="loaderror" />

Where:

-   yyy is the dependent variable.
-   xxx is the independent variable.
-   β0 ​ is the intercept.
-   β1 ​ is the slope.
-   ϵ  is the error term.

### Multiple Linear Regression

Multiple linear regression extends simple linear regression by using multiple independent variables to predict the dependent variable.


<img src="images/multilinearregression.jpg" alt="loaderror" />

### Assumptions of Linear Regression

1.  **Linearity**: The relationship between the dependent and independent variables is linear.
2.  **Independence**: Observations are independent of each other.
3.  **Homoscedasticity**: The residuals (errors) have constant variance.
4.  **Normality**: The residuals of the model are normally distributed.

### Evaluation Metrics

-   **Mean Absolute Error (MAE)**: Average of absolute errors.
-   **Mean Squared Error (MSE)**: Average of squared errors.
-   **Root Mean Squared Error (RMSE)**: Square root of MSE.
-   **R-squared (R2R^2R2)**: Proportion of variance explained by the model.

2\. Polynomial Regression
---------------------------

Polynomial regression models the relationship as an nth degree polynomial. It is useful for capturing non-linear relationships.

<img src="images/polynomialregression.jpg" alt="loaderror" />

3\. Regularization Techniques
-----------------------------

### Ridge Regression (L2 Regularization)

Ridge regression adds a penalty equal to the sum of the squared coefficients to the loss function. This prevents overfitting by shrinking the coefficients.

<img src="images/L2Regularization.jpg" alt="loaderror" />

### Lasso Regression (L1 Regularization)

Lasso regression adds a penalty equal to the sum of the absolute values of the coefficients. This can lead to sparse models where some coefficients are exactly zero.

<img src="images/L1Regularization.jpg" alt="loaderror" />

### Elastic Net

Elastic Net combines L1 and L2 regularization.

<img src="images/elasticnet.jpg" alt="loaderror" />

4\. Advanced Regression Techniques
----------------------------------

### Support Vector Regression (SVR)

SVR uses the same principles as Support Vector Machines (SVM) for classification but is adapted for regression. It tries to fit the best line within a threshold value.

### Decision Tree Regression

Decision tree regression uses a tree-like model of decisions. It splits the data into branches to predict the target value.

### Random Forest Regression

Random forest regression is an ensemble method that uses multiple decision trees to predict the target value. The final prediction is an average of all tree predictions.

### Gradient Boosting Regression

Gradient boosting regression builds models sequentially, each trying to correct the errors of the previous model. It combines weak learners into a strong learner.

### K-Nearest Neighbors (KNN) Regression

KNN regression predicts the target value based on the average of the k-nearest neighbors in the feature space.

### Bayesian Regression

Bayesian regression applies Bayesian principles to linear regression. It involves prior distributions for the model parameters and updates these priors with data to form posterior distributions.

