## Singular Value Decomposition (SVD)

Notation

| Symbol        | meaning                                                                             |
| ------------- | ----------------------------------------------------------------------------------- |
| $\delta_{ij}$ | Kronecker delta, that is $\delta_{ij}=1$ if $i=j$, and $\delta_{ij}=0$ if $i\neq j$ |


**Summary** (Case I: $m\ge n$) 

- For any $m$-by-$n$ matrix $A$ ($m\ge n$), we can choose 
  - $\left\{u_1, \ldots, u_m\right\}$ orthonormal vectors of length $m$, (left singular vector)
  - $\left\{v_1, \ldots, v_n\right\}$ orthonormal vectors of length $n$, (right singular vector) and
  - $s_1 \geq \cdots \geq$ $s_{n} \geq 0$, (singular values) satisfying
$$
\begin{gathered}
A v_1=s_1 u_1 \\
A v_2=s_2 u_2 \\
\vdots \\
A v_n=s_n u_n .
\end{gathered}
$$


**Geometric Intuition** (Sauer (2017) p. 579)

$v_i$'s form the basis of a rectangular coordinate system on which $A$ acts in a simple way: It produces the basis vectors of a new coordinate system, the $u_i$’s, with some stretching quantified by the scalars $s_i$'s. The stretched basis vectors $s_i u_i$ are the semimajor axes of the ellipse.

![SVD geometry](https://blogs.sas.com/content/iml/files/2017/08/svd1.png)

Figure: Rick Wicklin, SAS blog (Geometry of 2-by-2 SVD)

**Example** (Building SVD easy case; Sauer (2017) p. 580)

(Step 1)

Find the singular values and singular vectors for $A=\begin{bmatrix}3 & 0 \\ 0 & 1/2 \end{bmatrix}$. 


###### hide/show


$$
\begin{aligned} & A\left[\begin{array}{l}1 \\ 0\end{array}\right]=3\left[\begin{array}{l}1 \\ 0\end{array}\right] \\ & A\left[\begin{array}{l}0 \\ 1\end{array}\right]=\frac{1}{2}\left[\begin{array}{l}0 \\ 1\end{array}\right]\end{aligned}
$$


(Step 2)

Find the singular values and singular vectors for $A=\begin{bmatrix}0 & -1/2 \\ 3 & 0 \\ 0 & 0 \end{bmatrix}$.


###### hide/show



$$
\begin{aligned}
& A\left[\begin{array}{l}
1 \\
0
\end{array}\right]=3\left[\begin{array}{l}
0 \\
1 \\
0
\end{array}\right] \\
& A\left[\begin{array}{l}
0 \\
1
\end{array}\right]=\frac{1}{2}\left[\begin{array}{r}
-1 \\
0 \\
0
\end{array}\right]
\end{aligned}
$$

Even for this simple matrix, guessing singular vectors is not that easy, especially when requiring the left singular vectors be orthogonal.

(Step 3) 

Can you do the same for $A=\begin{bmatrix}2 & -1/2 \\ 3 & 1 \\ -2 & 5 \end{bmatrix}$?

We need more systematic approach.


### Fundamentals of SVD

**Theorem** (Spectral theorem for real symmetric matrix; Rephrase of Horn and Johnson (2013) Matrix analysis 2ed. Theorem 4.1.5. p. 229)

If $A$ is a real symmetric $n$-by-$n$ matrix, then there exists an orthonormal basis of $R^n$ consisting of eigenvectors of $A$. Each eigenvalue of $A$ is real.


**Lemma** 

Let $A$ be an $m \times n$ matrix. The eigenvalues of $A^T A$ are nonnegative.



Proof

Let $v$ be a unit eigenvector of $A^T A$, and $A^T A v=\lambda v$. Then
$$
0 \leq\|A v\|^2=v^T A^T A v=\lambda v^T v=\lambda .
$$

**Theorem** (Sauer (2017) p. 581)

Let $A$ be an $m \times n$ matrix where $m \geq n$. Then there exist two orthonormal bases $\left\{v_1, \ldots, v_n\right\}$ of $R^n$, and $\left\{u_1, \ldots, u_m\right\}$ of $R^m$, and real numbers $s_1 \geq \cdots \geq s_n \geq 0$ such that $A v_i=s_i u_i$ for $1 \leq i \leq n$. The columns of $V=\left[v_1|\ldots| v_n\right]$, the right singular vectors, are the set of orthonormal eigenvectors of $A^T A$; and the columns of $U=\left[u_1|\ldots| u_m\right]$, the left singular vectors, are the set of orthonormal eigenvectors of $A A^T$. That is, we have $A=USV^T$.

Constructive version (Human-friendly; Sauer (2017) p. 581)

1. $s_i$'s (singular values): Find eigenvalues (nonnegative) of $A^T A$ ($n$-by-$n$) in the decreasing order $s_1^2 \ge s_2^2 \ge \cdots \ge s_n^2 \ge 0$ along with
1. $v_i$'s (right singular vectors): corresponding eigenvectors $v_i$ ($i=1,2,\cdots, n$).
1. $u_i$'s (left singular vectors): If $s_i \neq 0$, define $u_i$ by the equation $s_i u_i=A v_i$. Choose each remaining $u_i$ as an arbitrary unit vector subject to being orthogonal to $u_1, \ldots, u_{i-1}$ ($i=1,2,\cdots, m$).

**Remark** 

- The SVD is not unique. 
  - Replacing $v_1$ by $-v_1$ and $u_1$ by $-u_1$ does not change the equality, but changes the matrices $U$ and $V$.

**Example** (Sauer (2017) p. 581)

Find the singular value decomposition of the $4 \times 2$ matrix
$$
A=\left[\begin{array}{rr}
3 & 3 \\
-3 & -3 \\
-1 & 1 \\
1 & -1
\end{array}\right] .
$$

0. Preliminary

$$
A^T A=\left[\begin{array}{ll}
20 & 16 \\
16 & 20
\end{array}\right]
$$

1. Eigenvectors and eigenvalues 

$$
v_1=\begin{bmatrix}1 / \sqrt{2} \\ 1 / \sqrt{2}\end{bmatrix}, 
\quad 
v_2=\begin{bmatrix}1 / \sqrt{2} \\ -1 / \sqrt{2}\end{bmatrix},
\quad
\begin{array}{l}
s_1^2=36 \\ 
s_2^2=4
\end{array}
$$



2. Singular values

$$
\begin{array}{l}
s_1=6 \\ 
s_2=2
\end{array}
$$

3. Right singular vectors

$v_1, v_2$ (same as eigenvectors of $A^T A$)

4. Left singular vectors

From 

$$
6 u_1=A v_1=\left[\begin{array}{r}
3 \sqrt{2} \\
-3 \sqrt{2} \\
0 \\
0
\end{array}\right] \quad 2 u_2=A v_2=\left[\begin{array}{r}
0 \\
0 \\
-\sqrt{2} \\
\sqrt{2}
\end{array}\right]
$$

we have

$$
u_1=\left[\begin{array}{r}
\frac{1}{\sqrt{2}} \\
-\frac{1}{\sqrt{2}} \\
0 \\
0
\end{array}\right] \quad u_2=\left[\begin{array}{r}
0 \\
0 \\
-\frac{1}{\sqrt{2}} \\
\frac{1}{\sqrt{2}}
\end{array}\right] .
$$

For $i = 3, 4$, choose

$$
u_3=\left[\begin{array}{c}
\frac{1}{\sqrt{2}} \\
\frac{1}{\sqrt{2}} \\
0 \\
0
\end{array}\right] \quad u_4=\left[\begin{array}{c}
0 \\
0 \\
\frac{1}{\sqrt{2}} \\
\frac{1}{\sqrt{2}}
\end{array}\right]
$$

If such vectors are not easy to guess, we can use Gram-Schmidt starting with $\{u_1, u_2, e_3, e_4\}$, where $e_i = [\delta_{ij}]_{1\le j \le 4}^T$ and $\delta_{ij}$ is Kronecker delta.



6. SVD

$$
A=\left[\begin{array}{rr}
3 & 3 \\
-3 & -3 \\
-1 & 1 \\
1 & -1
\end{array}\right]=U S V^T=\left[\begin{array}{rrrr}
\frac{1}{\sqrt{2}} & 0 & \frac{1}{\sqrt{2}} & 0 \\
-\frac{1}{\sqrt{2}} & 0 & \frac{1}{\sqrt{2}} & 0 \\
0 & -\frac{1}{\sqrt{2}} & 0 & \frac{1}{\sqrt{2}} \\
0 & \frac{1}{\sqrt{2}} & 0 & \frac{1}{\sqrt{2}}
\end{array}\right]\left[\begin{array}{ll}
6 & 0 \\
0 & 2 \\
0 & 0 \\
0 & 0
\end{array}\right]\left[\begin{array}{cc}
\frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}} \\
\frac{1}{\sqrt{2}} & -\frac{1}{\sqrt{2}}
\end{array}\right] .
$$

**Reduced/Economic SVD**

- The lower block of zeros in $S$ and the corresponding number of left singular vectors do not contribute to $A$.
- Remove them, making it *reduced SVD* or *economic SVD*.


SVD

$$
A=\left[\begin{array}{rr}
3 & 3 \\
-3 & -3 \\
-1 & 1 \\
1 & -1
\end{array}\right]=\hat U \hat S V^T=\left[\begin{array}{rrrr}
\frac{1}{\sqrt{2}} & 0 \\
-\frac{1}{\sqrt{2}} & 0 \\
0 & -\frac{1}{\sqrt{2}}\\
0 & \frac{1}{\sqrt{2}} 
\end{array}\right]\left[\begin{array}{ll}
6 & 0 \\
0 & 2
\end{array}\right]\left[\begin{array}{cc}
\frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}} \\
\frac{1}{\sqrt{2}} & -\frac{1}{\sqrt{2}}
\end{array}\right] .
$$

Case 2: $m\le n$

- Find SVD of $A^T$ to get $A^T=U S V^T$. 
- Then, $A=\left(U S V^T\right)^T=V S^T U^T$ is the SVD of $A$. [Sauer (2017) p. 582]

**Example** (Sauer (2017) p. 583)

Find the singular value decomposition of the $2 \times 3$ matrix
$$
A=\left[\begin{array}{rrr}
-1 & 3 & 7 \\
7 & 4 & 1
\end{array}\right] .
$$



Plan: Find the SVD of $A^T$ and then transpose the result. 

0. Preliminary

$$
A A^T=\left[\begin{array}{ll}
59 & 12 \\
12 & 66
\end{array}\right],
$$

1. Eigenvectors and eigenvalues 

$$
v_1=\begin{bmatrix}3 / 5 \\ 4 / 5\end{bmatrix}, 
\quad 
v_2=\begin{bmatrix}-4 / 5 \\ 3 / 5\end{bmatrix},
\quad
\begin{array}{l}
s_1^2=75 \\ 
s_2^2=50
\end{array}
$$


2. Singular values

$$
\begin{array}{l}
s_1=5 \sqrt{3} \\ 
s_2=5 \sqrt{2}
\end{array}
$$

3. Right singular vectors

$v_1, v_2$ (same as eigenvectors of $AA^T$)

4. Left singular vectors

From 

$$
5 \sqrt{3} u_1=A^T v_1=\left[\begin{array}{l}
5 \\
5 \\
5
\end{array}\right] \quad 5 \sqrt{2} u_2=A^T v_2=\left[\begin{array}{r}
5 \\
0 \\
-5
\end{array}\right]
$$

we have

$$
u_1=\left[\begin{array}{c}
\frac{1}{\sqrt{3}} \\
\frac{1}{\sqrt{3}} \\
\frac{1}{\sqrt{3}}
\end{array}\right] \quad u_2=\left[\begin{array}{c}
\frac{1}{\sqrt{2}} \\
0 \\
-\frac{1}{\sqrt{2}}
\end{array}\right] .
$$

For $i = 3$, choose

$$
u_3=\left[\begin{array}{c} \frac{1}{\sqrt{6}} \\ -\frac{2}{\sqrt{6}} \\ \frac{1}{\sqrt{6}}
\end{array}\right]
$$ 


5. SVD of $A^T$

$$
A^T=\left[\begin{array}{rr}
-1 & 7 \\
3 & 4 \\
7 & 1
\end{array}\right]=\tilde U \tilde S \tilde V^T=\left[\begin{array}{rrr}
\frac{1}{\sqrt{3}} & \frac{1}{\sqrt{2}} & \frac{1}{\sqrt{6}} \\
\frac{1}{\sqrt{3}} & 0 & -\frac{2}{\sqrt{6}} \\
\frac{1}{\sqrt{3}} & -\frac{1}{\sqrt{2}} & \frac{1}{\sqrt{6}}
\end{array}\right]\left[\begin{array}{rr}
5 \sqrt{3} & 0 \\
0 & 5 \sqrt{2} \\
0 & 0
\end{array}\right]\left[\begin{array}{rr}
3 / 5 & 4 / 5 \\
-4 / 5 & 3 / 5
\end{array}\right],
$$

6. SVD of $A$ is

$$
A= \left[\begin{array}{rrr}
-1 & 3 & 7 \\
7 & 4 & 1
\end{array}\right]
=USV^T
=\left[\begin{array}{rr}
3 / 5 & 4 / 5 \\
-4 / 5 & 3 / 5
\end{array}\right]\left[\begin{array}{rrr}
5 \sqrt{3} & 0 & 0 \\
0 & 5 \sqrt{2} & 0
\end{array}\right]\left[\begin{array}{rrr}
\frac{1}{\sqrt{3}} & \frac{1}{\sqrt{3}} & \frac{1}{\sqrt{3}} \\
\frac{1}{\sqrt{2}} & 0 & -\frac{1}{\sqrt{2}} \\
\frac{1}{\sqrt{6}} & -\frac{2}{\sqrt{6}} & \frac{1}{\sqrt{6}}
\end{array}\right] .
$$

Here, we note $USV^T=\tilde V \tilde S^T \tilde U^T$ with equality holding factor-wisely.


7 Reduced SVD of $A$

$$
A=\left[\begin{array}{rrr}
-1 & 3 & 7 \\
7 & 4 & 1
\end{array}\right]
=U\hat S \hat V^T 
=\left[\begin{array}{rr}
3 / 5 & 4 / 5 \\
-4 / 5 & 3 / 5
\end{array}\right]\left[\begin{array}{rr}
5 \sqrt{3} & 0 \\
0 & 5 \sqrt{2}
\end{array}\right]\left[\begin{array}{rrr}
\frac{1}{\sqrt{3}} & \frac{1}{\sqrt{3}} & \frac{1}{\sqrt{3}} \\
\frac{1}{\sqrt{2}} & 0 & -\frac{1}{\sqrt{2}}
\end{array}\right] .
$$

**Remark** (Sizes of matrices; Sauer (2017) p. 582)

- SVD of $m$-by-$n$ matrix $A=USV^T$
  - $S$ has the same size as $A$.
- Reduced SVD of $m$-by-$n$ matrix $A=\hat U \hat S V^T$ with $m\ge n$
  - $\hat U$ has the same size as $A$.
- Reduced SVD of $m$-by-$n$ matrix $A=U \hat S \hat V^T$ with $m\le n$
  - $\hat V^T$ has the same size as $A$.
  - Warning: Here, $A=U \hat S \hat V^T$ is an reduced SVD of $A$, not of $A^T$ in the intermediate steps. 

### Properties of SVD

**Property 1** (Sauer (2017) p. 585)

The rank of the matrix $A=U S V^T$ is the number of nonzero entries in $S$.



Proof. Since $U$ and $V^T$ are invertible matrices, $\operatorname{rank}(A)=\operatorname{rank}(S)$, and the latter is the number of nonzero diagonal entries.



**Property 2** (Sauer (2017) p. 585)

If $A$ is an $n \times n$ matrix, $|\operatorname{det}(A)|=s_1 \cdots s_n$.

Proof. Since $U^T U=I$ and $V^T V=I$, the determinants of $U$ and $V^T$ are 1 or -1 , due to the fact that the determinant of a product equals the product of the determinants. Property 2 follows from the factorization $A=U S V^T$.


**Property 3** (Sauer (2017) p. 585)

If $A$ is an invertible $m \times m$ matrix, then $A^{-1}=V S^{-1} U^T$.

Proof. By Property $1, S$ is invertible, meaning all $s_i>0$. Now Property 3 follows from the fact that if $A_1, A_2$, and $A_3$ are invertible matrices, then $\left(A_1 A_2 A_3\right)^{-1}=$ $A_3^{-1} A_2^{-1} A_1^{-1}$ and that $U,V$ are orthogonal.

**Remark**

- Property 3 says that obtaining $A$ is simple once its SVD is known
  - $V$ and $U$ are just transposed and $S^{-1}=\mathrm{diag}(s_i^{-1})$.

**Property 4** (Sauer (2017) p. 586)

The $m \times n$ matrix $A$ can be written as the sum of rank-one matrices

$$
A=\sum_{i=1}^r s_i u_i v_i^T,
$$

where $r$ is the rank of $A$, and $u_i$ and $v_i$ are the $i$ th columns of $U$ and $V$, respectively.

Proof. (sketch)

Given $A=USV^T$, 

1. Split $S$ into sum of $r$ matrices of a single nonzero entry.
2. Expand the result and carry out block multiplication.

**Remark**

- Each summand in Property 4 is called *rank-one* matrix.
  - Each column is a scalar multiple of of the first column.
  - If you haven't, write it out.

### Applications of SVD

#### Dimension reduction

#### Low-rank approximation

### Penrose Pseudo-inverse

### Appendix

#### Raw citations

**Theorem** (Horn and Johnson (2013) Matrix analysis 2ed. Theorem 4.1.5. p. 229) 

A matrix $A \in M_n$ is Hermitian if and only if there is a unitary $U \in M_n$ and a real diagonal $\Lambda \in M_n$ such that $A=U \Lambda U^*$, where $M_n$ is the set of $n$-by- $n$ complex matrices. Moreover, $A$ is real and Hermitian (that is, real symmetric) if and only if there is a real orthogonal $P \in M_n$ and a real diagonal $\Lambda \in M_n$ such that $A=P \Lambda P^T$.

**Remark**

- Observe the subtlety of the statement: If $A$ is symmetric as a complex matrix, then the conclusion is different. (See e.g., [Wikipedia - Complex symmetric matrices](https://en.wikipedia.org/wiki/Symmetric_matrix#Complex_symmetric_matrices))

#### Additional examples

**Example** (Visualization of SVD)

- $x=\left[\begin{array}{cccc}-10 & -10 & 20 & 20 \\ -10 & 20 & 20 & -10\end{array}\right]$ 
- $A=\left[\begin{array}{cc}1 & 0.3 \\ 0.45 & 1.2\end{array}\right]$.
- $A=USV^T = \begin{bmatrix} -0.5819 & -0.8133 \\ -0.8133 & 0.5819 \end{bmatrix} \begin{bmatrix} 1.4907 & 0 \\ 0 & 0.7144 \end{bmatrix} \begin{bmatrix} -0.6359 & -0.7718 \\ -0.7718 & 0.6359 \end{bmatrix}$.

Example and figures: Alyssa Quek ([SVD visualization](https://alyssaq.github.io/2015/singular-value-decomposition-visualisation/))

- This example is replaced by the picture of transformation of a circle to an ellipse.
  - It is in favor of simplitiy and to focus only on the intuition.
  - However, via this example, you can keep track of numerical values while transformation, hence kept in appendix.

| | |
|---|---|
| $$Ax$$ <br> ![Figure 1](https://alyssaq.github.io/blog/images/eigens-transformation_matrix.png) | | 
| $$V^Tx$$ <br> ![Figure 2](https://alyssaq.github.io/blog/images/svd_Vx.png) | $$SV^Tx$$ <br> ![Figure 3](https://alyssaq.github.io/blog/images/svd_SVx.png) | 
| $$USV^Tx$$ <br> ![Figure 4](https://alyssaq.github.io/blog/images/svd_USVx.png) | |
