#### Diagonalization of matrices

One of applications of eigenvector and eigenvalue is diagonalization of a matrix, specifically

Matrix $A$ is diagonalizable `if and only if` it has independent set of eigenvectors

To see this

(1) Suppose $v_1, \cdots, v_n$ is a `independent set of eigenvectors` of $A\in \mathbf{R}^{n \times n}$

$$Av_i=\lambda_i v_i$$

We can express these as

$$A\begin{bmatrix}v_1 & \cdots & v_n\end{bmatrix}=\begin{bmatrix}v_1 & \cdots & v_n\end{bmatrix}\begin{bmatrix}\lambda_1 &  & \\ & \ddots & \\ & & \lambda_n\end{bmatrix}$$

Let $v_1, \cdots, v_n$  be `columns` of a matrix $T$, and $\Lambda=\text{diag} (\lambda_1, \cdots, \lambda_n)$, then, we have

$$AT=T\Lambda$$

As we assume that eigenvectors are independent, then $T$ is `invertible`. As a result, we can `diagonalize` $A$ as

$$T^{-1}AT=\Lambda$$

(2) Assume that $A$ is `diagonalizable`, then by definition, there is a matrix $P$ and a diagonal matrix $D$ such that

$$P^{-1}AP=D$$

or

$$\begin{bmatrix}p_1 & \cdots & p_n\end{bmatrix}^{-1}A\begin{bmatrix}p_1 & \cdots & p_n\end{bmatrix}=\begin{bmatrix}d_1 &  & \\ & \ddots & \\ & & d_n\end{bmatrix}$$

Right multiply each side by $\begin{bmatrix}p_1 & \cdots & p_n\end{bmatrix}$

we have

$$A\begin{bmatrix}p_1 & \cdots & p_n\end{bmatrix}=\begin{bmatrix}p_1 & \cdots & p_n\end{bmatrix}\begin{bmatrix}d_1 &  & \\ & \ddots & \\ & & d_n\end{bmatrix}$$

Then, by `definition`, $p_1, \cdots, p_n$ are eigenvectors of $A$ with eigenvalues $d_1, \cdots, d_n$

Since $\begin{bmatrix}p_1 & \cdots & p_n\end{bmatrix}$ is `invertible`, then its columns form an `independent set`

#### Not all square matrices are diagonalizable

For example

$$A=\begin{bmatrix}0 & 1 \\ 0 & 0\end{bmatrix}$$

has only eigenvalue of 0

So its eigenvectors satisfy

$$\begin{bmatrix}0 & 1 \\ 0 & 0\end{bmatrix}v=0$$

Its eigenvectors are `not independent` as they are all in the form of $v=\begin{bmatrix}b\\ 0\end{bmatrix}$, where $b\neq 0$

#### Matrices with distinct eigenvalues are diagonalizable

Let $\lambda_1,\cdots, \lambda_n$ be distinct eigenvalues and $v_1,\cdots,v_n$ be the corresponding eigenvectors of matrix $A$, that is

$$Av_i=\lambda_i v_i$$

Assume that the eigenvectors are `not independent` (we want to show `contradiction` and thereby proving that eigenvectors must be independent, which can lead to diagonalization)

Then, we can always reindex the eigenvectors such that eigenvectors $v_1,\cdots, v_j$ are the smallest `dependent` set, meaning that `there is no subset of j-1 eigenvectors that is dependent`

(such dependent set would contain at least two eigenvectors to be `dependent`, or $j\geq 2$)

Then, there exists a set of (`not all zero`) coefficients $a_1,\cdots, a_{j-1}$ such that

$$\boxed{v_j=a_1 v_1+ \cdots + a_{j-1} v_{j-1}}$$

Left multiply $A$ on both side of the highlighted equation

$$\lambda_j v_j=a_1 \lambda_1 v_1+ \cdots + a_{j-1} \lambda_{j-1} v_{j-1}$$

Then, we multiply $\lambda_j$ on both side of the highlighted equation

$$\lambda_j v_j=a_1 \lambda_j v_1+ \cdots + a_{j-1} \lambda_j v_{j-1}$$

Subtract these two, we have

$$0=a_1(\lambda_1-\lambda_j)v_1+\cdots+ a_{j-1}(\lambda_{j-1}-\lambda_j)v_{j-1}$$

Since all eigenvalues are `distinct` and $a_i$'s are `not all zero`, then the only way this equation holds is that $v_i,\cdots,v_{j-1}$ are `dependent`, which is a contradiction

It follows that eigenvectors must be independent, and therefore, the matrix is diagonalizable

The reverse is `not true`, for example, $I$ is diagonalizable, but it has `repeated` eigenvalues

#### Diagonalization simplifies many matrix expressions

With $T^{-1}AT=\Lambda$, and $A=T\Lambda T^{-1}$

For `resolvent`, we have

$$\begin{align*}
(sI-A)^{-1} & = \left(TsIT^{-1}-T\Lambda T^{-1}\right)^{-1} \\
&=\left(T(sI-\Lambda)T^{-1}\right)^{-1} \\
&=T(sI-\Lambda)^{-1}T^{-1} \\
&=T \text{diag}\left(\frac{1}{s-\lambda_1},\cdots,\frac{1}{s-\lambda_n}\right)T^{-1}
\end{align*}$$

For `powers`

$$\begin{align*}
A^k &= (T\Lambda T^{-1})^k \\
&= (T\Lambda T^{-1})(T\Lambda T^{-1})\cdots (T\Lambda T^{-1})(T\Lambda T^{-1}) \\
&= T\Lambda (T^{-1}T)\Lambda (T^{-1}\cdots T)\Lambda (T^{-1}T)\Lambda T^{-1} \\
&=T\Lambda^kT^{-1}\\
&=T\text{diag}\left(\lambda_1^k,\cdots,\lambda_n^k\right)T^{-1}
\end{align*}$$

(for $k\leq 0$ only if $A$ is invertible, i.e., all $\lambda_i\neq 0$)

For `exponential`

$$\begin{align*}
e^A&= I + A + A^2/2!+\cdots \\
&= I + T\Lambda T^{-1}+ T\Lambda^2T^{-1}/2!+\cdots \\
&=T(I+\Lambda+\Lambda^2/2!+\cdots)T^{-1}\\
&=Te^{\Lambda}T^{-1}\\
&=T\text{diag}(e^{\lambda_1},\cdots,e^{\lambda_n})T^{-1}
\end{align*}$$

#### Diagonalization simplifies linear relation

Assume matrix $A$ is diagonalizable $A=T\Lambda T^{-1}$

If we have two vectors $x, y$ such that $y=Ax$, then we can write

$$TT^{-1}y=ATT^{-1}x$$

Denote `expansion` of the two vectors in the `eigenvectors` of $A$ as $\hat{x}=T^{-1}x$ and $\hat{y}=T^{-1}y$, we have

$$T\hat{y}=AT\hat{x}\Longrightarrow \hat{y}=T^{-1}AT\hat{x}=\Lambda \hat{x}$$

That is, in the eigenspace of $A$, the two vectors are connected via a diagonal matrix of the eigenvalues of $A$ and entries in $\hat{x}$ can be independently determined from corresponding entries in $\hat{y}$

#### Diagonalization and left eigenvectors

Rewrite

$$T^{-1}AT=\Lambda$$

as

$$T^{-1}A=\Lambda T^{-1}$$

or

$$\begin{bmatrix}w_1^T \\ \vdots \\ w_n^T\end{bmatrix}A=\Lambda \begin{bmatrix}w_1^T \\ \vdots \\ w_n^T\end{bmatrix}$$

Thus

$$w_i^TA=\lambda_iw_i^T$$

That is, `rows` of $T^{-1}$ are linearly independent `left eigenvectors`

As a result, `left and right eigenvectors` chosen this way are `dual basis` (after normalization)

$$w_i^Tv_j = \left\{\begin{array}{rcl}0 &i\neq j \\1 &i=j \end{array}\right.$$