#  Optimization in Calculus (Maxima and Minima)

**Optimization** in calculus refers to the process of finding the **maximum** or **minimum** values of a function.

---

## Maxima
- Points where a function reaches its **highest value**, either **locally** or **globally**.

## Minima
- Points where a function reaches its **lowest value**, either **locally** or **globally**.

---

## How to Find These Points

1. **Take the first derivative** of the function.
2. **Set the derivative equal to zero** to find **critical points**.
3. **Use the second derivative** to classify these critical points:
   - If the second derivative is **positive**: it's a **local minimum**.
   - If the second derivative is **negative**: it's a **local maximum**.
   - If the second derivative is **zero**: the test is **inconclusive**.

---

This process is a core concept in training machine learning models where we minimize or maximize an objective function (like a loss or reward).


## Mathematical Example with a Very Small Dataset

Let’s take a simple **linear regression** problem:

###  Dataset:

| x | y |
|---|---|
| 1 | 2 |
| 2 | 4 |

We want to fit a model:

$$
y = wx
$$

---

### Loss Function (Mean Squared Error):

$$
L(w) = \frac{1}{2n} \sum_{i=1}^{n} (wx_i - y_i)^2
$$

For our case (\(n = 2\)):

$$
L(w) = \frac{1}{4} \left[(w \cdot 1 - 2)^2 + (w \cdot 2 - 4)^2\right]
$$

---

### Simplify the Loss:

$$
L(w) = \frac{1}{4} \left[(w - 2)^2 + (2w - 4)^2\right]
$$

$$
= \frac{1}{4} \left[w^2 - 4w + 4 + 4w^2 - 16w + 16\right]
$$

$$
= \frac{1}{4} \left[5w^2 - 20w + 20\right]
$$

---

### First Derivative:

$$
\frac{dL}{dw} = \frac{1}{4} \cdot (10w - 20) = \frac{10w - 20}{4}
$$

Set the derivative to zero:

$$
\frac{10w - 20}{4} = 0 \quad \Rightarrow \quad 10w - 20 = 0 \quad \Rightarrow \quad w = 2
$$

---

### Second Derivative:

$$
\frac{d^2L}{dw^2} = \frac{d}{dw} \left( \frac{10w - 20}{4} \right) = \frac{10}{4} = 2.5 > 0
$$

This confirms that **\( w = 2 \)** is a **minimum**.
