# MTH 651: Advanced Numerical Analysis

## Lecture 5

### Topics

* Inner-product spaces
* Hilbert spaces
* Projections onto subspaces

#### Textbook references

Sections 2.1, 2.2, 2.3

#### Inner-Product Spaces

> **_DEFINITION:_** Given a space $V$, a **bilinear form** is a mapping $b : V \times V \to \mathbb{R}$ such that
>
> * $w \mapsto b(v, w)$ is linear for all $v \in V$, and
> * $v \mapsto b(v, w)$ is linear for all $w \in V$
>
> i.e.
>
> * $b(u, av + w) = a b(u, v) + b(u, w)$, and
> * $b(a u + v, w) = a b(u, v) + b(v, w)$
>
> for all $u, v, w \in V$ and $a \in \mathbb{R}$.
>
> $b(\cdot, \cdot)$ is **symmetric** if
>
> * $b(u, v) = b(v, u)$ for all $u, v \in V$
>
> An **inner product**, here denoted $(\cdot, \cdot)$, is a bilinear form satisfying
>
> * $(v, v) > 0$ for all $0 \neq v \in V$
>     * (this property is called **positive definiteness**)

> **_EXAMPLES:_** 
> 
> * $\mathbb{R}^n$ with the usual Euclidean inner product, $$ (x, y) = \sum_{i=1}^n x_i y_i $$
> * $L^2(\Omega)$ with the $L^2$ inner product $$ (u, v)_{L^2(\Omega)} = \int_\Omega u(x) v(x) \, dx $$
> * $W^k_2(\Omega)$ with inner product $$ (u, v)_k = \sum_{|\alpha| \leq k} (D^\alpha u, D^\alpha v)_{L^2(\Omega)} $$

> **_NOTATION:_** The spaces $W^k_2(\Omega)$ will be denoted $H^k(\Omega)$. The reason for the letter $H$ will soon be clear.

> **_THEOREM (Cauchy-Schwarz Inequality):_**
> Given an inner-product space $V$,
> $$
>  | (u, v) | \leq (u, u)^{1/2} (v, v)^{1/2}
> $$
> Equality holds iff $u$ and $v$ are linearly dependent.

> **_Proof._**
> For any $t \in \mathbb{R}$
> $$
>  0 \leq (u - tv, u - tv) = (u, u) - 2t(u, v) + t^2 (v, v).
> $$
>
> If either $u$ or $v$ is zero, then both sides of the inequality are zero.
> So, we may assume WLOG that $u$ and $v$ are both nonzero, and in particular, $(v, v) > 0$.
>
> Let $t = (u, v) / (v, v)$ and insert this into the above, obtaining
> $$
>  0 \leq (u,u) - 2 \frac{(u,v)^2}{(v,v)} + \frac{(u,v)^2}{(v,v)^2}(v,v) = (u,u) - (u,v)^2 / (v,v)
> $$
> Rearranging, we obtain
> $$
>  (u, v)^2 \leq (u, u) (v, v),
> $$
> proving the inequality.
>
> It remains to show that equality holds iff $u$ and $v$ are linearly dependent.
>
> If $u$ and $v$ are linearly dependent (and nonzero) then $u = t v$ for some $t \in \mathbb{R}$.
> Then, we have $(u - tv, u - tv) = 0$, and the argument above proves the equality.
>
> Finally, suppose that the equality holds.
> Following the argument above in reverse, this shows that
> $$ (u - tv, u - tv) = 0 $$
> which by definition of an inner product means that $u - tv = 0$, and so $u = tv$, completing the proof.

> **_DEFINITION:_** The **norm** $\| u \|$ **induced by the inner product** $(\cdot, \cdot)$ is defined by
> $$ \| u \| = \sqrt{(u, u)} $$

> **_PROPOSITION:_** The definition of $\| u \|$ really defines a norm, i.e. satisfies the properties
>
> * $\| u \| \geq 0$ for all $u \in V$
> * $\| u \| = 0$ iff $ u = 0$
> * $\| a u \| = |a| \| u \|$ for all $a \in \mathbb{R}$
> * $\| u + v \| \leq \|u \| + \| v \|$ (triangle inequality)
>

> **_Proof._** 
> The two properties follow immediately from positive definiteness of the inner product.
> We also see immediately by bilinearity,
> $$ (a u, a u)^{1/2} = ( a^2 (u, u) )^{1/2} = |a| \| u \| $$
> It remains to prove the triangle inequality.
>
> $$
> \begin{aligned}
>  \| u + v \|^2
>     &= (u+v, u+v) \\
>     &= (u,u) + 2(u,v) + (v,v) \\
>     &\leq (u,u) + 2(u,u)^{1/2} (v,v)^{1/2} + (v,v) \\
>     &= ( \| u \| + \| v \| )^2
> \end{aligned}
> $$
> Taking the square root of both sides proves the result.

#### Hilbert Spaces

> **_DEFINITION:_** An inner-product space $V$ is a **Hilbert space** if it is complete with respect to the norm induced by the inner product.

All of the examples of inner product spaces shown above are also examples of Hilbert spaces.

The norm induced by the inner product $(\cdot, \cdot)_k$ defined on $W^k_2(\Omega) = H^k(\Omega)$ is the **same** as the Sobolev norm $\| \cdot \|_{W^k_2(\Omega)}$ considered previously.

> **_DEFINITION:_** A closed linear space $S \subseteq H$ of a Hilbert space $H$ is called a **subspace**. $S$ is also a Hilbert space. (Why?)

Here are some examples of subspaces:

* $H$ and $\{ 0 \}$ are the extreme examples.
* Let $T : H \to K$ be a continuous linear map from $H$ into some other linear space $K$. Then the kernel of $T$ is a subspace.
* Let $x \in H$ and define $x^\perp$ denote the set of all $v \in H$ **orthogonal to** $x$, i.e.
   $$ x^\perp = \{ v \in H : (v, x) = 0 \} $$
   Proof. Let $L_x : H \to \mathbb{R}$ be defined by $L_x(v) = (v, x)$. Then $x^\perp = \operatorname{ker}(L_x)$. If $L_x$ is continuous, then this result follows from the previous example. To see, this
   $$ | L_x(v) | = | (v, x) | \leq \| x \| \| v \| $$
   and the result holds.
* For any subset $M \subseteq H$, then $M^\perp = \{ v \in H : (v, x) = 0 \text{ for all } x \in M \}$ is a subspace of $H$.

> **_PROPOSITION:_** Let $H$ be a Hilbert space.
> 1. For any subsets $M, N$ then $M \subseteq N$ implies that $N^\perp \subseteq M^\perp$
> 2. For any subset $M$ with $0 \in M$ then $M \cap M^\perp = \{ 0 \}$
> 3. $\{ 0 \}^\perp = H$
> 4. $H^\perp = \{ 0 \}$

> **_Proof._** 
>
> 1. Suppose $M \subseteq N$. Let $v \in N^\perp$. Take any $x \in M$. Then, $x \in N$ since $M \subseteq N$. So $(x, v) = 0$. This implies that $v \in M^\perp$, so $N^\perp \subseteq M^\perp$.
> 2. Suppose that $0 \in M$. Let $v \in M \cap M^\perp$. Then, $(v, v) = 0$ and so $v = 0$, hence $M \cap M^\perp = \{ 0 \}$.
> 3. Let $v \in H$ be arbitrary. Then $(v, 0) = 0$, so $v \in 0^\perp$.
> 4. Let $v \in H^\perp$. Then, since $v$ is also in $H$, we have $(v, v) = 0$, and $v = 0$.

> **_THEOREM (Parallelogram Law):_**
> $$ \| v + w \|^2 + \| v - w \|^2 = 2\left( \| v \|^2 + \| w \|^2 \right) $$

> **_Proof._** 
>
> $$ \begin{aligned}
>  (v + w, v + w) + (v - w, v - w)
>     &= \| v \|^2 + 2(v, w) + \|w\|^2 + \|v\|^2 - 2(v, w) + \|w\|^2.
> \end{aligned} $$

#### Projections

> **_PROPOSITION._** 
> Let $M \subseteq H$ be a subspace.
> Let $v \in H \setminus M$.
> Define
>  $$ \delta = \inf \{ \| v - w \| : w \in M \}. $$
> Then, there exists some $w_0 \in M$ such that
>
> 1. $\| v - w_0 \| = \delta$, i.e., there exists a closest point in $w_0 \in M$ to $v$
> 2. $(v - w_0) \in M^\perp$

> **_Proof._** 
>
> We first prove statement 1.
> 
> Let $\{ w_n \}$ be a minimizing sequence, i.e. $\| v - w_n \| \to \delta$.
> We show that $\{ w_n \}$ is a Cauchy sequence.
>
> Consider two elements of the sequence, $w_n$ and $w_m$.
> By the parallelogram law,
> $$
>  \| (w_n - v) + (w_m - v) \|^2 + \| (w_n - v) - (w_m - v) \|^2 = 2( \| w_n - v \|^2 + \| w_m - v \|^2 ).
> $$
> Rearranging,
> $$ \begin{aligned}
>  \| (w_n - v) - (w_m - v) \|^2
>     &= \| w_n - w_m \|^2 \\
>     &= 2( \| w_n - v \|^2 + \| w_m - v \|^2 ) - \| (w_n - v) + (w_m - v) \|^2 \\
>     &= 2( \| w_n - v \|^2 + \| w_m - v \|^2 ) - 4 \| \tfrac{1}{2} (w_n + w_m) - v \|^2 \\
> \end{aligned} $$
> Since $M$ is a linear space, we have $\frac{1}{2}(w_n + w_m) \in M$, and so $\| \tfrac{1}{2} (w_n + w_m) - v \| \geq \delta$.
> Therefore,
> $$
>  2( \| w_n - v \|^2 + \| w_m - v \|^2 ) - 4 \| \tfrac{1}{2} (w_n + w_m) - v \|^2
>     \leq 2 ( \| w_n - v \|^2 + \| w_m - v \|^2 ) - 4 \delta^2.
> $$
> As $n, m \to \infty$ we have $\| w_n - v \|, \| w_m - v \| \to \delta$, and so
> $$ 2 ( \| w_n - v \|^2 + \| w_m - v \|^2 ) - 4 \delta^2 \to 0 $$
> proving that the sequence is Cauchy.
>
> So, $w_n$ converges to some limit $w_0 \in H$.
> Since $M$ is a subspace, it is closed, so $w_0 \in M$, proving statement 1.
>
> Now, we prove statement 2.
>
> Let $z = w_0 - v$, so that $\| z \| = \delta$. We want to show that $z \in M^\perp$.
> Let $w \in M$ be arbitrary.
> Since $w_0$ minimizes the distance to $v$, the function
> $$ \| v - (w_0 + t w) \|^2 $$
> has a minimum at $t = 0$ (since $w_0 + t w \in M$ for $t \in \mathbb{R}$).
> Letting $z = v - w_0$, we see
> $$ \| v - (w_0 + t w) \|^2 = \| z - tw \|^2 = (z - tw, z - tw) = (z, z) - 2t(z, w) + t^2 (w, w) $$
> Since this function has a minimum at $t = 0$, we see that its derivative must vanish at $t = 0$, so
> $$ 0 = \frac{d}{dt} \| v - (w_0 + tw) \| |_{t=0} = - 2 (z, w) $$
> and therefore $(z, w) = 0$, proving the proposition.

As a consequence, let $M \subseteq H$ be a subspace.
Then, for any $v \in H$, we have the decomposition
$$ v = w_0 + w_1$$
where $w_0 \in M$ and $w_1 = v - w_0 \in M^\perp$.

This decomposition is unique.
To see this, let $z_0 + z_1$ be another such decomposition.
Then,
$$
    0 = (w_0 - z_0) + (w_1 - z_1)
$$
and so
$$
    M \ni w_0 - z_0 = z_1 - w_1 \in M^\perp
$$
and since $M \cap M^\perp = \{ 0 \}$ it follows that $w_0 = z_0$ and $w_1 = z_1$.

We can therefore define the orthogonal projections $P_M$ and $P_M^\perp$ by
$$
\begin{aligned}
    P_M &: v \mapsto w_0 \\
    P_M^\perp &: v \mapsto w_1
\end{aligned}
$$

Note that $P_M^\perp = P_{M^\perp}$. Why?

Another way of writing this is:

> $H = M \oplus M^\perp$

> **_DEFINITION:_** A linear operator $P : V \to V$ is a **projection** if $P^2 = P$, i.e. $P ( P z) = P z$ for all $z \in V$.

$P_M$ and $P_M^\perp$ are projections. Why?