## Matrix theory

### Introduction.

Let $V$, $W$ be finite dimensional vector spaces. Let $T$ be a linear transformation from $V$ into $W$. 

$$ T:V \rightarrow W$$

Let $B_V=\{v_1,v_2,\ldots,v_n\}$ be an ordered basis of $V$, so that dim $V$=$n$. And $B_W=\{w_1,w_2,\ldots,w_m\}$ is an ordered basis of $W$, so that dim $W$=$m$. We are going to define the matrix of the linear transformation $T$. It is done as follows - 

A linear transformation is completely determined by its action on the basis vectors. If we know $Tv_1, Tv_2,\ldots,Tv_n$ it is enough to completely determine $T$. 

Each of the vectors $Tv_1, Tv_2, \ldots, Tv_n$ is determined by the $m$ scalars:

$\begin{align}
Tv_1&=a_{11} w_1 + a_{21}w_2 + \ldots + a_{m1}w_m\\
Tv_2&=a_{12} w_1 + a_{22}w_2 + \ldots + a_{m2}w_m\\
\vdots\\
Tv_n&=a_{1n} w_1 + a_{2n}w_2 + \ldots + a_{mn}w_m\\
\end{align}$

That is,

$$Tv_j = \sum_{i=1}^{m}a_{ij}w_{i}$$

On the right hand side, $i$ is the running index, $j$ is the free index that corresponds to $Tv_j$. The matrix of the vector $Tv_j$ relative to the basis $B_W$ is the column vector whose entries are the coordinates of $Tv_j$ with respect to $B_W$. 

$[Tv_j]_{B_W}=\begin{bmatrix}
a_{1j}\\
a_{2j}\\
\vdots\\
a_{mj}
\end{bmatrix}$

The matrix of the linear transformation $T$, that sends $x \in V$ having coordinates $x=(x_1,x_2,\ldots,x_n)$ with respect to $B_V$ to $T(x)$ in $W$ with basis $B_W$ is defined as :

$A = (a_{ij})=[T]_{B_V}^{B_W}=
\begin{bmatrix}
a_{11} & a_{12} & \ldots & a_{1n} \\
a_{21} & a_{22} & \ldots & a_{2n} \\
\vdots & \vdots & & \vdots\\
a_{m1} & a_{m2} & \ldots & a_{mn} \\
\end{bmatrix}
$

As an aid to remembering, how $[T]_{B_V}^{B_W}$ is constructed from $T$, you might write the basis vectors $v_1,v_2,\ldots,v_n$ for the domain across the top and the basis vectors $w_1,w_2,\ldots,w_m$ for the target space along the right. In the matrix above, the $j$th column of $[T]_{B_V}^{B_W}$ consists of scalars needed to write $Tv_j$ as a linear combination of the $w$'s. Thus, the picture showed reminds you that $Tv_j$ is restricted by multiplying each entry in the $j$th column, by the corresponding $w$ from the right and then adding up the resulting vectors.

This is in confirmity with the usual notation of writing a matrix.

### The three important matrices.

Let $V$ and $W$ be finite dimensional vector spaces. Assume that $B_V=\{v_1,v_2,\ldots,v_n\}$, so dim $V$=$n$ and $B_W=\{w_1,w_2,\ldots,w_m\}$, so dim $W$=$m$. Suppose $T$ is a linear transformation :

$$T:V \rightarrow W$$

We have really speaking defined three matrices.

1) The matrix of the vector $x$ with respect to the basis $B_V$.

$$[x]_{B_V}$$

2) The matrix of the vector $T(x)$ with respect to the basis $B_W$.

$$[T(x)]_{B_W}$$

3) The matrix of the linear transformation $T$, with respect to the bases $B_V$ and $B_W$.

$$[T]_{B_V}^{B_W}$$

How are these matrices related? We will derive a relationship between these three matrices. And if you look at that relationship it will tell you, why the statement, "any linear transformation between finite dimensional vector spaces is like multiplying the vector $x$ by the matrix $A$".

### Addition of matrices.

Theorem 1. Let $S,T$ be two linear transformations from $V$ into $W$. $S,T\in\mathcal{L}(V,W)$. Then,

$$[S+T]_{B_V}^{B_W}=[S]_{B_V}^{B_W}+[T]_{B_V}^{B_W}$$

Proof.

Let $S(v_j)=\sum_{i=1}^{m}a_{ij}w_i$ and $T(v_j)=\sum_{i=1}^{m}b_{ij}w_i$. Then,

As $S$ and $T$ are linear transformations in $\mathcal{L}(V,W)$, addition in this space is defined point-wise. So,

$(S+T)(v_j)=S(v_j)+T(v_j)$

Further,

$\begin{aligned}
S(v_j)+T(v_j) &= \sum_{i=1}^{m}a_{ij}w_i + T(v_j)=\sum_{i=1}^{m}b_{ij}w_i \\
              &= \sum_{i=1}^{m}(a_{ij}+b_{ij})w_i
\end{aligned}$

So,

$[(S+T)(v_j)]_{B_w}=
\begin{bmatrix}
a_{1j}+b_{1j}\\
a_{2j}+b_{2j}\\
\vdots\\
a_{mj}+b_{mj}
\end{bmatrix}=
\begin{bmatrix}
a_{1j}\\
a_{2j}\\
\vdots\\
a_{mj}
\end{bmatrix}+
\begin{bmatrix}
b_{1j}\\
b_{2j}\\
\vdots\\
b_{mj}
\end{bmatrix}=
[S(v_j)]_{B_W}+[T(v_j)]_{B_W}
$

As a result it follows that:

$$[S+T]_{B_V}^{B_W}=[S]_{B_V}^{B_W}+[T]_{B_V}^{B_W}$$

### Scalar Multiplication.

Theorem 2. Let $T$ be a linear transformation from $V$ into $W$. $T\in \mathcal{L}(V,W)$. Then,

$$[cT]_{B_V}^{B_W}=c[T]_{B_V}^{B_W}$$

Proof.

$\begin{aligned}
cT(v_j)&=c\sum_{i=1}^{m}a_{ij}w_i\\
       &=\sum_{i=1}^{m}(ca_{ij})w_i
\end{aligned}$

Therefore,

$[cT(v_j)]_{B_W}=
\begin{bmatrix}
ca_{1j}\\
ca_{2j}\\
\vdots\\
ca_{mj}
\end{bmatrix}=
c\begin{bmatrix}
a_{1j}\\
a_{2j}\\
\vdots\\
a_{mj}
\end{bmatrix}=
c[T(v_j)]_{B_W}
$

And hence, we have -

$$[cT]_{B_V}^{B_W}=c[T]_{B_V}^{B_W}$$

Product of two matrices.

Consider the linear maps $S:U \rightarrow V$ and $T:V\rightarrow W$. The composition $TS$ is a linear map from $U$ to $W$. How can the matrix $[TS]_{B_U}^{B_W}$ be computed from $[T]$