---

<center>

# **Uncertainty Quantification**

<center>

---


## Least Squares Method:

---

Imagine you have two sets of data: a vector $\mathbf{f} = [f_1, f_2, ..., f_n]$ (the observed values) and a vector $\mathbf{d} = [d_1, d_2, ..., d_n]$  (some predictor values). You want to find a constant  $k$ such that $f_i \approx k d_i$ for all  $i$. The least squares method helps us find the "best" $k$ by minimizing the sum of the squared differences between the observed $f_i$ and the predicted $k d_i$. This sum is called the error, denoted $S$:

$$S = \sum_{i=1}^n (f_i - k d_i)^2$$


Our goal is to choose $k$ to make $S$ as small as possible.

### <u>Step 1: Define the Error Function</u>
We start by writing the error $S$ as a function of $k$:

$$S(k) = \sum_{i=1}^n (f_i - k d_i)^2$$

This is a quadratic function in $k$, and since it’s a sum of squares, it’s always non-negative and has a minimum we can find.



### <u>Step 2: Minimize the Error</u>
To find the value of $k$ that minimizes $S(k)$, we use calculus. Take the derivative of $S(k)$ with respect to $k$ and set it equal to zero (this finds the critical point, which will be the minimum because $S(k)$ is a parabola opening upwards):

$$\frac{dS}{dk} = 0$$

Let’s compute the derivative. Expand the sum:

$$S(k) = \sum_{i=1}^n (f_i - k d_i)^2 = \sum_{i=1}^n (f_i^2 - 2 k f_i d_i + k^2 d_i^2)$$

Now, differentiate term by term with respect to $k$ (treating $f_i$ and $d_i$ as constants):

- Derivative of $\sum f_i^2$: 0 (no $k$ in this term).
- Derivative of $\sum -2 k f_i d_i$: $-2 \sum f_i d_i$ (since $f_i$ and $d_i$ are constants).
- Derivative of $\sum k^2 d_i^2$: $\sum 2 k d_i^2$ (since $\frac{d}{dk} k^2 = 2k$).

So:

$$\frac{dS}{dk} = -2 \sum_{i=1}^n f_i d_i + 2 k \sum_{i=1}^n d_i^2$$

Factor out the 2:

$$\frac{dS}{dk} = 2 \left( k \sum_{i=1}^n d_i^2 - \sum_{i=1}^n f_i d_i \right)$$

Set the derivative equal to zero:

$$2 \left( k \sum_{i=1}^n d_i^2 - \sum_{i=1}^n f_i d_i \right) = 0$$

Since 2 isn’t zero, divide through:

$$k \sum_{i=1}^n d_i^2 - \sum_{i=1}^n f_i d_i = 0$$


### <u>Step 3: Solve for $k$</u>
Rearrange the equation:

$$k \sum_{i=1}^n d_i^2 = \sum_{i=1}^n f_i d_i$$

Now, solve for $k$:

$$k = \frac{\sum_{i=1}^n f_i d_i}{\sum_{i=1}^n d_i^2}$$


### <u>Step 4: Vector Notation</u>
In linear algebra, we can write this more compactly using vectors. The term $\sum_{i=1}^n f_i d_i$ is the dot product of $\mathbf{f}$ and $\mathbf{d}$, written as $\mathbf{d}^T \mathbf{f}$ (assuming column vectors, with $\mathbf{d}^T$ being the transpose). Similarly, $\sum_{i=1}^n d_i^2 = \mathbf{d}^T \mathbf{d}$, the dot product of $\mathbf{d}$ with itself. So the expression becomes:

$$k = \frac{\mathbf{d}^T \mathbf{f}}{\mathbf{d}^T \mathbf{d}}$$

or

$${k}=([d]^T[d])^{-1}[d]^T\mathbf{f}$$

---

### <u>Why This Makes Sense</u>
This $k$ is the optimal scalar that makes $k \mathbf{d}$ as close as possible to $\mathbf{f}$ in terms of squared error. Intuitively, it’s like a weighted average that balances how well $f_i$ aligns with $d_i$ across all data points, giving more weight where $d_i$ is larger.
