# Matrix Multiplication as Transformation

## Motivation

Vectors represent points or directions in space.<br>
In many problems, we want to transform vectors in a systematic way,
for example by scaling, rotating, or shearing them.

A matrix is not just a table of numbers.<br>
It represents a function that maps one vector to another while
preserving linear structure.

Understanding matrix multiplication as the composition of such
transformations explains why the multiplication rule is defined
the way it is.


## Key Idea: Matrices as Functions on Vectors

A matrix acts on a vector to produce a new vector.
This action represents a linear transformation of space.

Instead of asking how to multiply matrices,<br>
we first ask: what should a transformation do to vectors?


## What Should a Linear Transformation Do?

A transformation that acts on vectors should satisfy two natural properties:

1. **Additivity**  
   Transforming the sum of two vectors should be the same as summing their individual transformations.

2. **Homogeneity (scaling)**  
   Scaling a vector before transforming it should be the same as scaling the transformed vector.

Mathematically, for a transformation T:

$$
T(\mathbf{u} + \mathbf{v}) = T(\mathbf{u}) + T(\mathbf{v}), \quad
T(c\,\mathbf{v}) = c\,T(\mathbf{v})
$$

Any transformation satisfying these properties is called **linear**.


## Why Basis Vectors Are Enough

Any vector in two-dimensional space can be written as a linear combination of the standard basis vectors:

$$
\mathbf{v} = x\,\mathbf{e}_1 + y\,\mathbf{e}_2
$$

where

$$
\mathbf{e}_1 = \begin{bmatrix}1 \\ 0\end{bmatrix}, \quad
\mathbf{e}_2 = \begin{bmatrix}0 \\ 1\end{bmatrix}
$$

If we know how a linear transformation acts on these basis vectors, then its action on every other vector is completely determined.


## From Transformations to Matrices

Suppose a linear transformation T sends the basis vectors to:

$$
T(\mathbf{e}_1) = \mathbf{a}, \quad
T(\mathbf{e}_2) = \mathbf{b}
$$

We can represent this transformation by placing these images as columns
of a matrix:

$$
A = \begin{bmatrix} \mathbf{a} & \mathbf{b} \end{bmatrix}
$$

Applying the transformation to any vector
$\mathbf{v} = x\,\mathbf{e}_1 + y\,\mathbf{e}_2$ gives:

$$
T(\mathbf{v}) = x\,\mathbf{a} + y\,\mathbf{b}
$$

This is exactly what matrixâ€“vector multiplication computes.
