# Lecture 30: Singular Value Decomposition

Reference    
Lecture video: https://www.youtube.com/watch?v=TX_vooSnhm8             
Chinese note: https://nbviewer.jupyter.org/github/zlotus/notes-linear-algebra/blob/master/chapter30.ipynb 

This lecture we talk about SVD: $A=U \Sigma V^{T}$. $U, V$ are orthogonal matrices, $\Sigma$ is diagonal matrix. Compared to the decomposition we mentioned before, the two orthogonal matrices here usually are different, and this decomposition can be applied to any matrix. 
* A positive definite matrix can be decomposed as: $A=Q \Lambda Q^{T}$. This is actually a special case of SVD 
* A diagonalizable matrix can be decomposed as: $A=S \Lambda S^{T}$. But $S$ is not orthogonal matrix. 

What we want to do is finding a othornormal basis $v_{1}, v_{2}, \cdots, v_{r}$ in the column space of $A$, and the basis transforms into a multiple of a orthornormal basis of $A$'s row space $u_{1}, u_{2}, \cdots, u_{r}$. 

$$A\left[v_{1} \space v_{2} \cdots v_{r}\right]=\left[\sigma_{1} u_{1} \space \sigma_{2} u_{2} \cdots \sigma_{r} u_{r}\right]=\left[\begin{array}{llll}u_{1} & u_{2} & \cdots & u_{r}\end{array}\right]\left[\begin{array}{llll}\sigma_{1} & & & \\ & \sigma_{2} & & \\ & & \ddots & \\ & & & & \\ & & & & \sigma_{n}\end{array}\right]$$

This is exactly : $A v_{1}=\sigma_{1} u_{1}, A v_{2}=\sigma_{2} u_{2}, \cdots, A v_{r}=\sigma_{r} u_{r}$, where $\sigma$ is multiplying factor. The left null space and null space will show up zeros on the diagonal of $\Sigma$. 

$$
A\left[v_{1} v_{2} \cdots v_{r} \space v_{r+1} \cdots v_{m}\right]=\left[\begin{array}{lllll}
u_{1} & u_{2} & \cdots & u_{r}\space u_{r+1} & \cdots & u_{n}
\end{array}\right]\left[\begin{array}{cccc|c}
\sigma_{1} & & & \\
& \ddots & & \\
& & & \sigma_{r} & \\
\hline & & & &0
\end{array}\right]
$$

The matrix form is:

$$
A V=U \Sigma
$$

$U$ is $m\times m$, $\Sigma$ is $m\times n$, $V^T$ is $n\times n$. $A=U \Sigma V^{-1}$. Since $V$ is orthorgonal matrix, thus $A=U \Sigma V^{-1} = U \Sigma V^{T}$. If $A$ is positive definite, $U$ and $V$ become $Q$. 


Next we solve an example: Given $A=\left[\begin{array}{cc}4 & 4 \\ -3 & 3\end{array}\right]$, we want to find
* Orthonormal basis of row space $\mathbb{R}^2$: $v_{1}, v_{2}$
* Orthonormal basis of column space $\mathbb{R}^2$: $u_{1}, u_{2}$
* $\sigma_{1}>0, \sigma_{2}>0$

There are two orthogonal matrices to solve in $A = U \Sigma V^{T}$, how to solve them one by one? 

A trick is that we first compute: $A^{\top}A$, then we have $A^{T} A=V \Sigma^{T} U^{T} U \Sigma V^{T}$. Because $U^{T} U=I$, $\Sigma^{T} \Sigma$ is a diagonal matrix with diagonal element squared. Thus now we have:
$$A^{T} A=V\left[\begin{array}{cccc}\sigma_{1} & & & \\ & \sigma_{2} & & \\ & & \ddots & \\ & & & \sigma_{n}\end{array}\right] V^{T}$$

Because $A^{\top}A$ is symmetric (semi)positive definite, here $V$ is the eigenvector matrix and $\Sigma^2$ is the eigenvalue matrix. 

Similarly, we can calculate $U$ by computing $AA^{\top}$ to eliminate $V$. 

Now we show the calculating procedures using  the example $A=\left[\begin{array}{cc}4 & 4 \\ -3 & 3\end{array}\right]$. 

$A^{T} A=\left[\begin{array}{cc}4 & -3 \\ 4 & 3\end{array}\right]\left[\begin{array}{cc}4 & 4 \\ -3 & 3\end{array}\right]=\left[\begin{array}{cc}25 & 7 \\ 7 & 25\end{array}\right]$, $A^{T} A\left[\begin{array}{l}1 \\ 1\end{array}\right]=32\left[\begin{array}{l}1 \\ 1\end{array}\right], A^{T} A\left[\begin{array}{c}1 \\ -1\end{array}\right]=18\left[\begin{array}{c}1 \\ -1\end{array}\right]$

Thus its eigenvectors are: $\left[\begin{array}{l}1 \\ 1\end{array}\right], \left[\begin{array}{c}1 \\ -1\end{array}\right]$. Eignevalues are 32, 18. The unit eigenvectors are : $\left[\begin{array}{c}\frac{1}{\sqrt{2}} \\ \frac{1}{\sqrt{2}}\end{array}\right], \left[\begin{array}{c}\frac{1}{\sqrt{2}} \\ -\frac{1}{\sqrt{2}}\end{array}\right]$. 

Until now, we have: $\left[\begin{array}{cc}4 & 4 \\ -3 & 3\end{array}\right]=\left[\begin{array}{cc}u_{?} & u_{?} \\ u_{?} & u_{?}\end{array}\right]\left[\begin{array}{cc}\sqrt{32} & 0 \\ 0 & \sqrt{18}\end{array}\right]\left[\begin{array}{cc}\frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}} \\ \frac{1}{\sqrt{2}} & -\frac{1}{\sqrt{2}}\end{array}\right]$. Next, we continue calculating $U$. 

$A A^{T}=U \Sigma V^{T} V \Sigma^{T} U^{T}=U \Sigma^{2} U^{T} = \left[\begin{array}{cc}32 & 0 \\ 0 & 18\end{array}\right]$. It is easy to obtain a set of eigenvectors: $A A^{T}\left[\begin{array}{l}1 \\ 0\end{array}\right]=32\left[\begin{array}{l}1 \\ 0\end{array}\right], A A^{T}\left[\begin{array}{l}0 \\ 1\end{array}\right]=18\left[\begin{array}{l}0 \\ 1\end{array}\right]$. But we can not directly use $\left[\begin{array}{l}1 \\ 0\end{array}\right], \left[\begin{array}{l}0 \\ 1\end{array}\right]$ because $U$ should satisfy $A v_{2}=\left[\begin{array}{c}0 \\ -\sqrt{18}\end{array}\right]=u_{2} \sigma_{2}=\left[\begin{array}{c}0 \\ -1\end{array}\right] \sqrt{18}$. Thus $U = \left[\begin{array}{rr}1 & 0 \\ 0 & -1\end{array}\right]$ . 

Finally we get: 

$\left[\begin{array}{cc}4 & 4 \\ -3 & 3\end{array}\right]=\left[\begin{array}{cc}1 & 0 \\ 0 & -1\end{array}\right]\left[\begin{array}{cc}\sqrt{32} & 0 \\ 0 & \sqrt{18}\end{array}\right]\left[\begin{array}{cc}\frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}} \\ \frac{1}{\sqrt{2}} & -\frac{1}{\sqrt{2}}\end{array}\right]$

Remark:
* $AB$ and $BA$ have the same eigenvalue. $\lambda B v=B(A B v)=(B A) B v$. The eigenvalue are the same, but the eigenvector changes. 

Let's see an example $A=\left[\begin{array}{ll}4 & 3 \\ 8 & 6\end{array}\right]$ whose rank is $1$. 
* Unit basis vectors of row and column spaces are: $v_{1}=\left[\begin{array}{l}0.8 \\ 0.6\end{array}\right], u_{1}=\frac{1}{\sqrt{5}}\left[\begin{array}{l}1 \\ 2\end{array}\right]$
* $A^{T} A=\left[\begin{array}{ll}4 & 8 \\ 3 & 6\end{array}\right]\left[\begin{array}{ll}4 & 3 \\ 8 & 6\end{array}\right]=\left[\begin{array}{ll}80 & 60 \\ 60 & 45\end{array}\right]$ is not full rank, thus one of its eigenvalue is 0. According to the trace, its other eigenvalue is 125. 
* The basis vectors for the left null space and null space is $v_{2}=\left[\begin{array}{c}0.6 \\ -0,8\end{array}\right], u_{2}=\frac{1}{\sqrt{5}}\left[\begin{array}{c}2 \\ -1\end{array}\right]$

Finally we get: $\left[\begin{array}{ll}4 & 3 \\ 8 & 6\end{array}\right]=\left[\begin{array}{cc}1 & \underline{2} \\ 2 & \underline{-1}\end{array}\right]\left[\begin{array}{cc}\sqrt{125} & 0 \\ 0 & \underline{0}\end{array}\right]\left[\begin{array}{cc}0.8 & 0.6 \\ 0.6 & \underline{-0.8}\end{array}\right]$. The underlined part is associated with null space. 

* $v_{1}, \cdots, v_{r}$ is an orthonormal basis for the row space 
* $u_{1}, \cdots, u_{r}$ is an orthonormal basis for the column space
* $v_{r+1}, \cdots, v_{n}$ is an orthonormal basis for the null space
* $u_{r+1}, \cdots, u_{m}$ is an orthonormal basis for the left null space 

$A v_{i}=\sigma_{i} u_{i}$. 