# Matrices Revisited

toc: true
badges: false

## Norm

Norm is a function which when applied, gives a measure of distance-ness from the origin, or the size of the **vector**. This function has the following properties

$$f(x) = 0 \text{ then } x = 0 \tag{1}$$

$$f(x+y) \leq f(x) + f(y) \cdots \text{ Triangle Inequality} \tag{2}$$

$$f(\alpha x) = |\alpha| f(x) \text{  } \forall \alpha \in \mathbb{R} \tag{3}$$

Examples of Norm are,

$$||x||_p = \left(\sum_i |x_i|^p\right)^\frac{1}{p} \tag{$L^p$ Norm}$$

When $p = 2$, we have $L^2$ norm, also known as Euclidean norm. For practical purposes we use squared $L^2$ norm.
$$||x||_p = \sum_i |x_i|^2$$

However, when when we are working with values near $0$, a change of $\epsilon$ will cause the squared $L^2$ norm to change by $\epsilon^2$, which will be very small. Thus we can use $L^1$ norm in those cases.

$$||x||_1 = \sum_i |x_i| \tag{$L^1$ Norm}$$

## Orthogonal matrices

Properties of orthogonal matrices
- They must be square
- Rows are mutually orthonormal
- Columns are mutually orthonormal

> Two vectors $x$ and $y$ are **orthonormal** if their dot product is 0 and they have unit norm.

As a result of the definition, the below property follows for any orthogonal matrix $A$,

$$A^T A = A A^T = I$$

$$A^{-1} = A^T$$

### Multiplying with orthogonal matrices leads to rotation

To prove that multiplying with an orthogonal matrix leads to a rotation, we need to prove two things,
- The length/norm of the vector is preserved after matrix operation.
- The angles between vectors are preserved after matrix operation.

#### Length is preserved

Let $c$ be an orthogonal matrix, then the squared norm of the transformed vector is as follows,
$$||c \vec{x}||^2$$

$$= c \vec{x} \cdot c \vec{x}$$

$$= (c \vec{x})^T c \vec{x}$$

$$= \vec{x}^T c^T c \vec{x}$$

$$= \vec{x}^T I \vec{x}$$

$$= \vec{x}^T \vec{x}$$

$$= \vec{x} \cdot \vec{x}$$

$$= || \vec{x} ||^2$$

#### Angle between vectors is preserved

Let $\vec{v}$ and $\vec{w}$ be two vectors and let $\theta$ be the angle between them.

$$\therefore \text{ } \cos \theta = \frac{\vec{v} \cdot \vec{w}}{||\vec{v}||\text{ }||\vec{w}||}$$

Let $\theta_c$ be the angle between the rotated vectors.

$$\therefore \text{ } \cos \theta_c = \frac{C\vec{v} \cdot C\vec{w}}{||C\vec{v}||\text{ }||C\vec{w}||}$$

$$ = \frac{(C\vec{w})^T C\vec{v}}{||\vec{v}||\text{ }||\vec{w}||}$$

$$ = \frac{\vec{w}^T C^T C\vec{v}}{||\vec{v}||\text{ }||\vec{w}||}$$

$$ = \frac{\vec{w}^T I \vec{v}}{||\vec{v}||\text{ }||\vec{w}||}$$

$$ = \frac{\vec{w}^T \vec{v}}{||\vec{v}||\text{ }||\vec{w}||}$$

$$ = \frac{\vec{v} \cdot \vec{w}}{||\vec{v}||\text{ }||\vec{w}||}$$

$$ \cos \theta_c = \cos \theta $$

## Eigen vectors and Eigen values

For a matrix $A$, a vector $x$ and some scalar constant $\lambda$, we can have an equation as below,

$$A x = \lambda x$$

Here, $x$ becomes the eigen vector and $\lambda$ its corresponding eigen value.

To get an intuition for this equation, we can start by thinking of a matrix multiplication on a vector as application of a function that leads to a transformation in the space. This space in which the transformation take place will have few axes (imaging $x$, $y$ and $z$, however more than three are possible). The transformation by the matrix $A$ can be decomposed into sub-transformations along these axes. That scaling factor $\lambda$ is called the eigen value.

Let us assume a vector $\vec{v}$ along the axes $x$ ($x$ from the equation above). Now, the dot product of $\vec{v}$ will cancel out with all the other axes except $x$. Thus the scaling (transformation) will happen only in the direction of $x$ axes.

> "Cancels out" is a crude term which holds only for matrices whose eigen vectors are orthogonal (see below). Let us assume that the transformation is quantified by an array with each value as a metric of change in the axes other than $x$, then we have chosen the case for other axes as say $[0, 0, 0, 0, 0]$, which could well be $[-1, 2, -3, 4, -2]$ where sum of both the arrays is 0, but the former is easier to visualize.

### Eigen vectors of a symmetric matrix are orthogonal 

Let $A$ be a symmetric matrix with $x$ and $y$ as two of its eigen vectors and $\lambda_1$ and $\lambda_2$ be distinct and corresponding eigen values.

$$\therefore \text{ } A x = \lambda_1 x  \tag{1}$$
Multiplying both sides on left with $y^T$
$$\therefore \text{ } y^T A x = \lambda_1 y^T x$$
Taking transpose on both sides,
$$\therefore \text{ } \left(y^T A x\right)^T = \lambda_1 \left(y^T x\right)^T$$

$$\therefore \text{ } x^T A^T y = \lambda_1 x^T y$$
Since $A$ is symmetric, we have $A = A^T$
$$\therefore \text{ } x^T A y = \lambda_1 x^T y \tag{2}$$

Also,
$$\therefore \text{ } A y = \lambda_2 y  \tag{3}$$
Multiplying both sides on left with $x^T$
$$\therefore \text{ } x^T A y = \lambda_2 x^T y \tag{4}$$
Subtracting $(4)$ from $(2)$ we get,
$$x^T y \cdot (\lambda_1 - \lambda_2) = 0$$
Since $\lambda_1 \neq \lambda_2$ , we have,
$$x^T y = 0$$

This implies that $y \cdot x = 0$ or that $x$ and $y$ are orthogonal vectors.

## Eigen Decomposition

## Singular Value Decomposition

## Spectral norm