## Unit 7.5 : Inner Product Spaces

With the DOT PRODUCT defined in unit 6, we were able to study the following properties of R<sup>n</sup>

1. Length or norm. $ \lVert u \rVert = \sqrt{ u \cdot u } $
2. Distance of 2 vectors. $ \lVert u - v \rVert $
3. Orthogonality of 2 vectors. $ u \cdot v = 0 $

How to describe the same properties of vectors in other types of vector spaces?

For example,

1. How to define the norm of the function f(x) in C([a,b]) ?
2. How to determine polynomials f(x) and g(x) are orthogonal ?
3. How to calculate the distance between 2 matrices A and B $ \in \mathcal{M}_{4 \times 3} $

### Definition : Inner Product

Let V be a vector space over a field F (which is either R or C).  
An **Inner Product** on V is a function that assigns a scalar in F to any pair of vectors u and v, denoted $ \langle u,v \rangle $,  
such that, for any vectors u, v and w in V and any scalar c, the following axioms hold.

**Axioms of an Inner Product**, (這些 dot product 都符合)

1. $ \langle u, u \rangle \in R $ and $ \langle u,u \rangle \gt 0 $ if $ u \ne 0 $
2. $ \langle u,v \rangle = \langle v,u \rangle^{*} $
3. $ \langle u+v, \ w \rangle = \langle u,w \rangle + \langle v,w \rangle$
4. $ \langle cu,v \rangle = c \ \langle u,v \rangle $

### Definition : Inner Product Space

A vector space endowed with a particular inner product is called an **Inner Product Space**.

### Example

$ V = C([a,b]) = \big\{ \ f \ | \ f: \ [a,b] \to R, $ f is continuous $ \big\} $ is a vector space,  
and the function <.,.> : $ V \times V \ \to \ R $ defined by 

$ \langle f, g \rangle = \int_a^b \ f(t) \ g(t) \ d_t $

$ \forall f, g \in V $ is an inner product on V.

Proof.

Axiom 1.

$ f^2 $ is continuous and non-negative.

$ f \ne 0 \to $  

$ f^2(t_0) \gt 0 $ for some $ t_0 \in [a, b] \to $

$ f^2(t) \gt p \gt 0 , \ \  \forall \ \big[ \ t_0 - r/2, \ t_0+r/2 \big] \subseteq [a,b] \to $

$ \langle f, f \rangle = \int_a^b f^2 (t) \ d_t \ge r \cdot p \gt 0 $

### Example : Frobenius Inner Product

$ \langle A, B \rangle $ = trace $ (A\ B^T) $ is the Frobenius Inner Product on $ R^{n \times n} $.

trace : 對角線上的元素相加。

### Definition

For any vector v in an inner product space V,  
the **norm** or length of v is denoted and defined as $ \lVert v \rVert = \langle v, v \rangle^{1/2} $

The **distance** between $ u, v \in V $ is defined as $ \lVert u - v \rVert $

### Example

The Frobenius norm in $ R^{n \times n} $ is $ \lVert A \rVert = \big( \sum_{1 \le i, j \le n} (a_{ij})^2 \big)^{1/2} $,

since $ \langle A, B \rangle $ = trace $ (A\ B^T) = \sum_{1 \le i,j \le n} a_{ij} b_{ij} $

### Property

Inner products and norms satisfy the elementary properties  
stated in Theorem 6.1, the Cauchy-Schewarz inequality,  
and the triangle inequality in Unit 6.1.

- ( a ) $ \vec{u} \ \cdot \ \vec{u} = \lVert u \rVert^2 $
- ( b ) $ \vec{u} \ \cdot \ \vec{u} = 0 $ iff. $ \vec{u} = \vec{0} $
- ( c ) $ \vec{u} \ \cdot \ \vec{v} = \vec{v} \ \dot \ \vec{u} $
- ( d ) $ \vec{u} \ \cdot \ ( \vec{v} + \vec{w} ) = \vec{u} \cdot \vec{v} + \vec{u} \cdot \vec{w} $
- ( e ) $ ( \vec{v} + \vec{w} ) \ \cdot \ \vec{u} = \vec{v} \cdot \vec{u} + \vec{w} \cdot \vec{u} $
- ( f ) $ ( c \vec{u} ) \ \dot \ \vec{v} =  c ( \vec{u} \cdot \vec{v} ) = \vec{u} \cdot ( c \cdot \vec{v} ) $
- ( g ) $ \lVert c u \rVert = \lvert c \rvert \ \lVert u \rVert $

#### Pythagorean Theorem

u and v are orthogonal iff

$ \lVert u + v \rVert = \lVert u \rVert^2 + \lVert u \rVert^2 $

#### Cauchy-Schwarz Inequality

$ \lvert \langle u, v \rangle \lvert \le \lVert u \rVert \ \cdot \ \lVert v \rVert $

#### Triangle Inequality

$ \lVert u + v \lVert \le \lVert u \rVert + \lVert v \rVert $

### Example : A Cauchy-Schewarz inequality in C([a,b])

$ \big( \int_a^b f(t) g(t) d_t \big)^2 \le \big( \int_a^b f^2(t) d_t \big) \ \big( \int_a^b g^2(t) d_t \big) $

### Definitions

In an inner product space V, the vectors u,v are called **orthogonal** if $ \langle u, v \rangle = 0 $,

a vector u is called a **unit vector** if $ \lVert u \rVert = 1 $,

a subset S is called orthogonal if $ \langle u, v \rangle = 0$ for all distinct $ u,v \in S $,

and S is called **orthonormal** if S is orthogonal and $ \lVert u \rVert = 1, \forall \ u \in S $

### Properties

1. Every nonzero vector v in an inner product space may be changed into a unit **normalized vector** : $ \frac{v}{\lVert v \rVert} $
2. An orthogonal set of nonzero vectors is L.I., no matter the set is finite or infinite.

### Example

Note: 三角積化和差公式  
$ \sin(a+b) = \sin(a) \cos(b) + \cos(a) \sin(b) $  
$ \sin(a-b) = \sin(a) \cos(b) - \cos(a) \sin(b) $  
$ \to 2 \times \sin(a) \cos(b) = \sin(a+b) + \sin(a-b) $  

In the inner product space of $ C([0, 2\pi]) $, the vectors: 

$ \Big \{ f(t) = \sin(3t), g(t) = \cos(2t) \Big \} $ are orthogonal, since,

$$ \langle f, g \rangle = \int_0^{2\pi} \sin(3t) \cos(2t) d_t = \frac{1}{2} \int_0^{2\pi} \big[ \sin(5t) + \sin(t) \big] d_t = 0 $$

Note: $ \int_0^{2\pi} sin(x) = 0 $ 因為在一個週期($ 2 \pi $) 中，sin 的積分互相抵消。

### Example

In the vector space of trigonometric polynomials

$ \mathcal{T}[0, 2 \pi] $ = 
Span $ \begin{Bmatrix} 1,& \cos t,& \sin t,& \cos 2t,& \sin 2t, & \cdots,& \cos nt,& \sin nt, \cdots \end{Bmatrix} $  
= Span S

S is orthogonal, since:

$$ \langle \cos nt, \sin mt \rangle = \int_0^{2\pi} \big[ \cos nt \times \sin mt \big] d_t = 0, \forall n,m \ge 0 $$
$$ \langle \cos nt, \cos mt \rangle = \int_0^{2\pi} \big[ \cos nt \times \cos mt \big] d_t = 0, \forall n \ne m $$
$$ \langle \sin nt, \sin mt \rangle = \int_0^{2\pi} \big[ \sin nt \times \sin mt \big] d_t = 0, \forall n \ne m $$

$ \to \mathcal{S} $ is a basis of $ \mathcal{T}[0, 2 \pi] $

### Gram-Schmidt Process

從一組 basis (線性獨立) 找到 orthogonal basis.

Let { $ u_1, u_2, \cdots, u_k $ } be a basis for a inner product space V. Define

$ v_1 = u_1 $,

$ v_2 = u_2 - \frac{ \langle u_2,v_1 \rangle }{\lVert v_1 \rVert^2} \vec{v_1} $

...

$ v_k = u_k - \frac{ \langle u_k,v_1 \rangle }{\lVert v_1 \rVert^2} \vec{v_1} - 
\frac{ \langle u_k,v_2 \rangle }{\lVert v_2 \rVert^2} \vec{v_2} - 
\cdots - 
\frac{ \langle u_k,v_{k-1} \rangle }{\lVert v_{k-1} \rVert^2} \vec{v_{k-1}} $

Then { $ v_1, v_2, \cdots, v_i $ } is an orthogonal set of nonzero vectors such that

Span { $ v_1, v_2, \cdots, v_i $ } = Span { $ u_1, u_2, \cdots, u_k $ }

for each i. So { $ v_1, v_2, \cdots, v_k $ } is an orthogonal basis for V.

### Corollary

Every finite-dimensional inner product space has an orthonormal basis.

### Example

$ \mathcal{P}_2 $ is a inner product space with the following inner product:

$$ \langle f,g \rangle = \int_{-1}^1 \big[ f(x) \ g(x) \big] d_x $$

$ \forall f,g \in \mathcal{P}_2 $, find the orthogonal basis { $ v_1, v_2, v_3 $ } from the basis B = { $ 1, x, x^2 $ } = { $ u_1, u_2, u_3 $ }

> By the Gram-Schmidt Process

$ v_1 = u_1 = 1 $

$$ v_2 
= u_2 - \frac{\langle u_2, v_1 \rangle}{\lVert v_1 \rVert^2} v_1 
= x - \frac{ \int_{-1}^1 [ x \times 1 ] d_x }{ \int_{-1}^1  1^2 d_x } v_1
= x - 0 = x
$$

$$ v_3 
= u_3 - \frac{\langle u_3, v_1 \rangle}{\lVert v_1 \rVert^2} v_1 - \frac{\langle u_3, v_2 \rangle}{\lVert v_2 \rVert^2} v_2 
= x^2 - \frac{ \int_{-1}^1 [ x^2 \times 1 ] d_x }{ \int_{-1}^1  x^2 d_x } v_1
= x^2 - \frac{2/3}{2} = x^2 - \frac{1}{3}
$$

So an orthogonal basis is $ \Big \{ 1, \ \ x, \ \ x^2 - \frac{1}{3} \Big \} $,

To find the orthonormal basis, devide by each vector's norm:

$$ \lVert v_1 \rVert = \sqrt{ \int_{-1}^1 \big[ 1 \times 1 \big] d_x } = \sqrt{2} $$

$$ \lVert v_2 \rVert = \sqrt{ \int_{-1}^1 \big[ x \times x \big] d_x } = \sqrt{\frac{2}{3}} $$

$$ \lVert v_2 \rVert = \sqrt{ \int_{-1}^1 \big[ x^2 - \frac{1}{3} \big] d_x } = \sqrt{\frac{8}{45}} $$

So the orthonormal basis is :

$$ \begin{Bmatrix} \frac{1}{\sqrt{2}}, & \sqrt{\frac{3}{2}} x, & \sqrt{\frac{45}{8}} (x^2 - \frac{1}{3}) \end{Bmatrix} $$

### Normalized Legendre Polynomials

For P with the same inner product and the basis B = { $ 1, x, x^2, \cdots $ },  
the same procedure may be applied to obtain an orthonormal basis { $ p_0(x), p_1(x), p_2(x), \cdots $ },  
called **Normalized Legendre Polynomials**.

上面的例子，就是前三項。

### Proposition : Orthogonal Projection of v onto W

Suppose that V is an inner product space and W is a finite-dimensional subspace of V.  
For every v in V, there exist unique $ w \in W $ and $ z \in W^{\perp} $ such that v = w + z.

The vector w is called the orthogonal projection of v onto W, and we have

$$ w = \langle v, v_1 \rangle v_1 + \langle v, v_2 \rangle v_2 + \cdots + \langle v, v_n \rangle v_n $$

if { $ v_1, v_2, \cdots, v_n $ } is an orthonormal basis of W.

### Corollary

Under the notations in the above Proposition, among all vectors in W, the vectors closest to v is w.

### Least-Square Approximation

Since the closeness is measured by the distance,  
which involves the sum (integral) of a square of the difference vector (function),  
the closest vector is called the **Least-Square Approximation**

### Example

$ \mathcal{P}_2 $ with the inner product $ \langle f,g \rangle = \int_{-1}^1 f(x) g(x) d_x, \forall f, g \in \mathcal{P}_2 $ is a finite-dimensional subspace of C([-1, 1]).  

To $ v = f(x) = \sqrt[3]{x} \in C([-1, 1]) $, the least-squares aproximation by a polynomial with degree &le; 2 is  
the orthogonal projection of f onto $ \mathcal{P}_2 $

Thus take the **orthonormal basis** { $ v_1, v_2, v_3 $ } = $ \begin{Bmatrix} \frac{1}{\sqrt{2}}, & \sqrt{\frac{3}{2}} x, & \sqrt{\frac{45}{8}} (x^2 - \frac{1}{3}) \end{Bmatrix} $

Note: 
even function : $ v_1, v_3 $  
odd function : $ v, v_2 $

對 odd function 做 -1 到 1 的積分結果是 0；$ v \cdot v_1 $ 是odd function， 所以 $ \langle v, v_1 \rangle = 0 $。同理 $ \langle v, v_3 \rangle = 0 $

and get

$ w = \langle v, v_1 \rangle v_1 + \langle v, v_2 \rangle v_2 + \langle v, v_3 \rangle v_3 $

$$ = \Big( \int_{-1}^1 \sqrt[3]{x} \cdot \sqrt{\frac{3}{2}} x \ d_x \Big) \sqrt{\frac{3}{2}} x = \frac{9}{7} x $$