# POLYNOMIAL REGRESSION

To explain polynomial regression for a dataset with two feature instances ($x_1$ and $x_2$), where each feature instance has two data points, let's consider a hypothetical dataset. Unlike linear regression, polynomial regression allows us to model the relationship between the independent variables ($x_1$ and $x_2$) and the dependent variable ($y$) as a polynomial, which can capture more complex relationships.

### Example Dataset:

| Data Point | $x_1$ (Feature 1) | $x_2$ (Feature 2) | $y$ (Target) |
|------------|-------------------|-------------------|--------------|
| 1          | $x_{11}$          | $x_{12}$          | $y_1$        |
| 2          | $x_{21}$          | $x_{22}$          | $y_2$        |

### Polynomial Regression Model:

For illustrative purposes, let's consider a polynomial regression model that includes terms up to the second degree for each feature and an interaction term between $x_1$ and $x_2$. The model can be represented as:

$\hat{y} = \theta_0 + \theta_1 x_1 + \theta_2 x_2 + \theta_3 x_1^2 + \theta_4 x_2^2 + \theta_5 x_1 x_2$

Where:
- $\hat{y}$is the predicted value of the target variable.
- $\theta_0$ is the intercept term.
- $\theta_1$ and $\theta_2$ are the coefficients for the linear terms of features $x_1$ and $x_2$, respectively.
- $\theta_3$ and $\theta_4$ are the coefficients for the squared terms of features $x_1^2$ and $x_2^2$, respectively, allowing the model to capture non-linear relationships.
- $\theta_5$ is the coefficient for the interaction term $x_1 x_2$, allowing the model to capture the combined effect of $x_1$ and $x_2$ on $y$.

### Objective:

The objective in polynomial regression, similar to linear regression, is to find the values of the coefficients $\theta_0, \theta_1, \theta_2, \theta_3, \theta_4,$and $\theta_5$ that minimize the difference between the predicted values $\hat{y}$and the actual target values $y$ in the dataset. This is typically achieved by minimizing a cost function, such as the Mean Squared Error (MSE), which for our dataset with two data points becomes:

$\text{MSE} = \frac{1}{n} \sum_{i=1}^{n} (y_i - \hat{y}_i)^2$ \
$\text{MSE} = \frac{1}{2} \left[ \text{Residual}_1^2 + \text{Residual}_2^2 \right]$

Where:
- $\text{Residual}_1 = y_1 - (\theta_0 + \theta_1 x_{11} + \theta_2 x_{12} + \theta_3 x_{11}^2 + \theta_4 x_{12}^2 + \theta_5 x_{11} x_{12})$
- $\text{Residual}_2 = y_2 - (\theta_0 + \theta_1 x_{21} + \theta_2 x_{22} + \theta_3 x_{21}^2 + \theta_4 x_{22}^2 + \theta_5 x_{21} x_{22})$

### Solution:

The optimal values of the coefficients can be found using various optimization methods, such as Gradient Descent, since the presence of higher-degree terms and interaction terms generally precludes an analytical solution like the Normal Equation used in linear regression.

### Interpretation:

Once the optimal coefficients are determined, the polynomial regression model can predict the target variable $y$ for any given values of $x_1$ and $x_2$ using the model equation. This model can capture more complex relationships than a simple linear model due to the inclusion of higher-degree terms and interaction terms, making it more flexible in fitting the underlying pattern in the data. However, care must be taken to avoid overfitting, especially as the degree of the polynomial increases.