## 3b. Matrices and Linear Transformation

### 3.1. What can we do with matrices?

Let's say you have a $3\times 4$ matrix $M$ and a $4\times 1$ vector $v$. You then carry out the following multiplication

$$ u = Mv $$

What's the dimension of $u$? It is a $3\times 1$ vector.

So you can view the above multiplication as a *transformation*. $M$ has *transformed* the 4-dimensional vector $v$ into the 3-dimensional vector $u$. In fact, if you give me any 4-dimensional vector $v_i$, I can feed that vector into $M$ (i.e. perform the multiplication $Mv_i$), and get an output $u_i$ which is a 3-dimensional vector. So we can think of $M$ like a machine, taking an input in the 4-dimensional space and giving an output in the 3 dimensional space.

We say $M$ is implementing a *linear transformation*. It is transforming input vectors to output vectors.





### 3.2. Some examples

**Example 1**: Let's take the matrix 

$$
M_1 = \begin{bmatrix}
	5 & 0  \\
	0 & 5
\end{bmatrix}
$$

and see what it does to its inputs.


$$ \begin{bmatrix}
	5 & 0  \\
	0 & 5
\end{bmatrix} \begin{bmatrix}
	3 \\
    2
\end{bmatrix}  = \begin{bmatrix}
	15 \\
    10
\end{bmatrix}$$

$$ \begin{bmatrix}
	5 & 0  \\
	0 & 5
\end{bmatrix} \begin{bmatrix}
	-4 \\
    1
\end{bmatrix}  = \begin{bmatrix}
	-20 \\
    5
\end{bmatrix}$$

$$ \begin{bmatrix}
	5 & 0  \\
	0 & 5
\end{bmatrix} \begin{bmatrix}
	0 \\
    -1
\end{bmatrix}  = \begin{bmatrix}
	0 \\
    -5
\end{bmatrix}$$

What do you notice? The output vector is the input vector scaled by a factor of 5. So $M$ transforms its inputs by scaling them 5 times.

**Example 2**: What about this one 

$$
M_2 = \begin{bmatrix}
	-2 & 0  \\
	0 & 3
\end{bmatrix}
$$ 

Again, let's try feeding it a few inputs

$$ \begin{bmatrix}
	-2 & 0  \\
	0 & 3
\end{bmatrix} \begin{bmatrix}
	3 \\
    2
\end{bmatrix}  = \begin{bmatrix}
	-6 \\
    6
\end{bmatrix}$$

$$ \begin{bmatrix}
	-2 & 0  \\
	0 & 3
\end{bmatrix} \begin{bmatrix}
	-4 \\
    1
\end{bmatrix}  = \begin{bmatrix}
	8 \\
    3
\end{bmatrix}$$

$$ \begin{bmatrix}
	-2 & 0  \\
	0 & 3
\end{bmatrix} \begin{bmatrix}
	0 \\
    -1
\end{bmatrix}  = \begin{bmatrix}
	0 \\
    -3
\end{bmatrix}$$



The effect of $M_2$ is scaling the first dimension by -2 and the second dimension by 3.

From these two examples, you can see that:

** Diagonal matrices have a scaling effect on their inputs. In other words, they implement a *scaling* transformation on the inputs they get.**


### 3.3. More examples: Orthonormal matrices

Ok let's see what orthonormal matrices do to their inputs.

Consider this matrix

$$
M_3 = \begin{bmatrix}
	0 & 1  \\
	-1 & 0 
\end{bmatrix}
$$


Make sure you know why this is an orthonormal matrix. Let's see what $M_3$ does.

$$ \begin{bmatrix}
	0 & 1  \\
	-1 & 0
\end{bmatrix} \begin{bmatrix}
	3 \\
    2
\end{bmatrix}  = \begin{bmatrix}
	2 \\
    -3
\end{bmatrix}$$

$$ \begin{bmatrix}
	0 & 1  \\
	-1 & 0
\end{bmatrix} \begin{bmatrix}
	-4 \\
    1
\end{bmatrix}  = \begin{bmatrix}
	1 \\
    4
\end{bmatrix}$$

$$ \begin{bmatrix}
	0 & 1  \\
	-1 & 0
\end{bmatrix} \begin{bmatrix}
	0 \\
    -1
\end{bmatrix}  = \begin{bmatrix}
	-1 \\
    0
\end{bmatrix}$$

The effect might not be obvious from here. But if you draw out the input and output vectors, you will see that the output vectors are the input vectors *rotated by 90 degrees counter-clockwise*.

Now, a more involved example.

$$
M_4 = \begin{bmatrix}
	\frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}}  \\
	\frac{1}{\sqrt{2}} & \frac{-1}{\sqrt{2}} 
\end{bmatrix}
$$

Let's see what it does.

$$ \begin{bmatrix}
	\frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}}  \\
	\frac{1}{\sqrt{2}} & \frac{-1}{\sqrt{2}}
\end{bmatrix} \begin{bmatrix}
	0 \\
    -1
\end{bmatrix}  = \begin{bmatrix}
	\frac{-1}{\sqrt{2}} \\
    \frac{1}{\sqrt{2}}
\end{bmatrix}$$

$$ \begin{bmatrix}
	\frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}}  \\
	\frac{1}{\sqrt{2}} & \frac{-1}{\sqrt{2}}
\end{bmatrix} \begin{bmatrix}
	1 \\
    0
\end{bmatrix}  = \begin{bmatrix}
	\frac{1}{\sqrt{2}} \\
    \frac{1}{\sqrt{2}}
\end{bmatrix}$$

$$ \begin{bmatrix}
	\frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}}  \\
	\frac{1}{\sqrt{2}} & \frac{-1}{\sqrt{2}}
\end{bmatrix} \begin{bmatrix}
	-3 \\
    3
\end{bmatrix}  = \begin{bmatrix}
	0 \\
    -3\sqrt{2}
\end{bmatrix}$$

This is a *reflection* about the straight line that makes an angle of 22.5 degrees with the x-axis.


So we see that 

**Orthonormal matrices implement transformations that do not change the lengths of its inputs**.

Another name for orthnonormal matrices is *rotation matrices*, since we can think of a reflection as a "special" kind of rotation.


### 3.4. Properties of linear transformations

(Feel free to skip this section)

Here are some interesting properties of linear transformations. Can you prove them?

Let $M$ be a matrix.

* If $M$ transforms $x$ to $y$, then it will transform $ax$ to $ay$, where $a$ is a constant scalar.

* If $M$ transforms $x_1$ to $y_1$ and $x_2$ to $y_2$, then it will transform $x_1 + x_2$ to $y_1+y_2$.

*Proof*:

The proof follows directly from the definition:

* If $Mx = y$, then $M(ax) = aMx = ay$
* If $Mx_1 = y_1, Mx_2 = y_2$, then $M(x_1+y_1) = Mx_1+Mx_2 = y_1+y_2$.

Here's another interesting fact which is the reverse of the previous statement. 

*Any tranformation T that satisfies the following two properties is a linear transformation (can be implemented by a matrix $M$)*

1. If $T$ transforms $x$ to $y$, then $T$ transforms $ax$ to $ay$, where $a$ is a constant scalar.

2. If $T$ transforms $x_1$ to $y_1$ and $x_2$ to $y_2$, then $T$ transforms $x_1 + x_2$ to $y_1+y_2$.
