# **4.4 Orthonormal Bases and Gram-Schmidt**

- The vectors $q_1, \dots, q_n$ are **orthogonal** when their dot products $q_i \cdot q_j$ are zero. More precisely: $q_i^T q_j = 0$ whenever $i \neq j$.  
  By dividing each vector by its length, the vectors become unit vectors. Together, they form an **orthonormal basis**.

**Definition:** The vectors $q_1, \dots, q_n$ are orthonormal if  
$$
q_i^T q_j =
\begin{cases}
0 & \text{if } i \neq j \text{ (orthogonal)} \\
1 & \text{if } i = j \text{ (unit length)}
\end{cases}
$$

- A matrix with orthonormal columns is denoted $Q$.
- $Q$ is easy to work with because  
$$
Q^T Q = I
$$  
  meaning its columns are orthonormal. $Q$ does **not** have to be square.

- If $Q$ is square, $Q^T Q = I$ implies $Q^T = Q^{-1}$. Such a square orthonormal matrix is called an **orthogonal matrix**.

- **Reflections and Rotations:**  
  Operations like reflections, rotations, permutations, and multiplication by orthogonal matrices preserve lengths and angles:  
  $$
  \|Qx\| = \|x\|, \quad (Qx)^T (Qy) = x^T y
  $$

### **Projections Using Orthonormal Bases**

- With an orthonormal basis $Q$, the least squares formulas simplify:  
$$
\hat{x} = Q^T b, \quad p = Q \hat{x}, \quad P = QQ^T
$$

- No matrices need to be inverted; projections are just dot products:  
$$
p = q_1 (q_1^T b) + \dots + q_n (q_n^T b)
$$

- If $Q$ is square ($m=n$), then $QQ^T = I$, and the projection of $b$ onto the whole space is $b$ itself:  
$$
b = QQ^T b = q_1(q_1^T b) + \dots + q_n(q_n^T b)
$$

- This is the foundation for Fourier series and other transforms: vectors/functions are decomposed into perpendicular components and can be recombined.

### **The Gram-Schmidt Process**

- Converts a set of independent vectors $a, b, c$ into orthonormal vectors $q_1, q_2, q_3$:
  1. $A = a$ (first vector)
  2. $B = b - \frac{A^T b}{A^T A} A$ (subtract projection onto $A$)
  3. $C = c - \frac{A^T c}{A^T A} A - \frac{B^T c}{B^T B} B$ (subtract projections onto $A$ and $B$)
- Normalize to get orthonormal vectors:  
$$
q_1 = \frac{A}{\|A\|}, \quad q_2 = \frac{B}{\|B\|}, \quad q_3 = \frac{C}{\|C\|}
$$`

### **The Factorization $A = QR$**

- Start with $A = [a\ b\ c]$ and end with $Q = [q_1\ q_2\ q_3]$. There exists an upper triangular matrix $R$ such that:  
$$
A = QR, \quad
R =
\begin{bmatrix}
q_1^T a & q_1^T b & q_1^T c \\
0 & q_2^T b & q_2^T c \\
0 & 0 & q_3^T c
\end{bmatrix}
$$

- Any $m \times n$ matrix $A$ with independent columns can be factorized as $A = QR$ with $Q$ orthonormal and $R$ upper triangular with positive diagonal.

- **Least Squares Application:**  
$$
A^T A \hat{x} = A^T b \quad \Rightarrow \quad R^T R \hat{x} = R^T Q^T b \quad \Rightarrow \quad R \hat{x} = Q^T b \quad \Rightarrow \quad \hat{x} = R^{-1} Q^T b
$$  
Back substitution is fast and avoids matrix inversion.

**Key Ideas**

1. For orthonormal $q_1, \dots, q_n$ forming $Q$:  
   $$
   Q^T Q = I
   $$

2. If $Q$ is square (orthogonal), $Q^T = Q^{-1}$.

3. Length preservation: $\|Qx\| = \|x\|$.

4. Projection onto the column space of $Q$: $P = QQ^T$.

5. If $Q$ is square:  
$$
b = QQ^T b = q_1 (q_1^T b) + \dots + q_n (q_n^T b)
$$

6. Gram-Schmidt produces orthonormal vectors and the factorization $A = QR = (\text{orthogonal } Q)(\text{triangular } R)$.
