# Polynomial Regression

---

## What is Polynomial Regression?

**Polynomial Regression** is a type of regression analysis where the relationship between the **independent variable** \( x \) and the **dependent variable** \( y \) is modeled as an **nth-degree polynomial**.

It is an extension of **Linear Regression**, but instead of fitting a straight line, it fits a **curved line** to the data.

### General form:

$$
\hat{y} = w_0 + w_1x + w_2x^2 + w_3x^3 + \cdots + w_nx^n
$$

- $\hat{y}$ : predicted output  
- $x$ : input variable  
- $w_0, w_1, w_2, \dots, w_n$ : coefficients (learned from data)

---

## Why Use Polynomial Regression?

We use **Polynomial Regression** when the **data shows a curved or non-linear pattern** — meaning:

-  A **straight line** (from linear regression) doesn't fit the data well  
-  A **curved line** (polynomial regression) fits much better  

---

### Real-life examples:

- Predicting **population growth** (which often curves upward)
- Modeling **COVID-19 spread** over time
- Estimating **sales trends** that rise and fall seasonally
- Forecasting **temperature patterns** over months

---

## Comparison:

| Model Type            | Shape of Fit       | Use When...                        |
|----------------------|--------------------|------------------------------------|
| Linear Regression     | Straight line      | Data looks like a line             |
| Polynomial Regression | Curve (U, ∩, etc.) | Data curves or bends (non-linear)  |

---

## In Simple Terms:

> Polynomial regression helps us draw a **smooth curve** through data when a **straight line is not enough**.

#Polynomial Regression (Degree 2) - Step-by-Step Example

---

## Goal:

We want to **fit a quadratic curve** (polynomial of degree 2) through 3 points.

Let’s say the data points are:

| x | y  |
|---|----|
| 1 | 2  |
| 2 | 5  |
| 3 | 10 |

We want to find the best-fitting curve:

$$
\hat{y} = w_1 x + w_2 x^2 + b
$$

---

## Step 1: Write the equation for each point

We plug each x and y into the equation:

$$
y = w_1 x + w_2 x^2 + b
$$

---

### For point (1, 2):

$$
2 = w_1 \cdot 1 + w_2 \cdot 1^2 + b \\
\Rightarrow w_1 + w_2 + b = 2 \quad \text{(Equation ①)}
$$

---

### For point (2, 5):

$$
5 = w_1 \cdot 2 + w_2 \cdot 4 + b \\
\Rightarrow 2w_1 + 4w_2 + b = 5 \quad \text{(Equation ②)}
$$

---

### For point (3, 10):

$$
10 = w_1 \cdot 3 + w_2 \cdot 9 + b \\
\Rightarrow 3w_1 + 9w_2 + b = 10 \quad \text{(Equation ③)}
$$

---

## Step 2: Solve the system of equations

We now solve the following system:

① w_1 + w_2 + b = 2

② 2w_1 + 4w_2 + b = 5

③ 3w_1 + 9w_2 + b = 10


---

### Step A: Subtract Equation ① from Equation ②

$$
(2w_1 + 4w_2 + b) - (w_1 + w_2 + b) = 5 - 2 \\
\Rightarrow w_1 + 3w_2 = 3 \quad \text{(Equation ④)}
$$

---

### Step B: Subtract Equation ② from Equation ③

$$
(3w_1 + 9w_2 + b) - (2w_1 + 4w_2 + b) = 10 - 5 \\
\Rightarrow w_1 + 5w_2 = 5 \quad \text{(Equation ⑤)}
$$

---

### Step C: Subtract Equation ④ from Equation ⑤

$$
(w_1 + 5w_2) - (w_1 + 3w_2) = 5 - 3 \\
\Rightarrow 2w_2 = 2 \Rightarrow w_2 = 1
$$

---

### Step D: Plug \( w_2 = 1 \) into Equation ④

$$
w_1 + 3(1) = 3 \Rightarrow w_1 = 0
$$

---

### Step E: Plug \( w_1 = 0, w_2 = 1 \) into Equation ①

$$
0 + 1 + b = 2 \Rightarrow b = 1
$$

---

##  Final Answer:

- \( w_1 = 0 \)
- \( w_2 = 1 \)
- \( b = 1 \)

So the final equation is:

$$
\hat{y} = x^2 + 1
$$

---

## Let’s Verify:

| x |         Prediction            | Actual y |
|---|-------------------------------|----------|
| 1 | \( 1^2 + 1 = 2 \)             | 2        |
| 2 | \( 2^2 + 1 = 5 \)             | 5        |
| 3 | \( 3^2 + 1 = 10 \)            | 10       |

The model fits **perfectly**.



## How to Decide the Polynomial Degree?

Choosing the right degree for polynomial regression depends on the **pattern in your data** and **model performance**.

---

### 1. Plot the Data

- If the data points look like a **U** or an **∩** shape → try **degree 2** (quadratic).
- If the data shows a **wave-like** or **multiple bends** → try **degree 3** (cubic) or higher.

---

### 2. Use Trial and Error

- Start with degree 2  
- Then try degree 3  
- Compare results and **check the curve fit visually**

---

### 3. Use Evaluation Metrics

Compare performance using metrics such as:

- **Mean Squared Error (MSE)** — lower is better
- **R² Score** — closer to 1 is better

Evaluate these metrics on **test data**, not just training data, to avoid overfitting.
