In [1]:
%matplotlib inline
import numpy as np
from IPython.core.display import HTML
HTML('<link href="https://fonts.googleapis.com/css?family=Cabin|Quicksand" rel="stylesheet"><style>.container{width:90% !important; font-family: "Cabin", sans-serif;}em{color: red !important;}</style><style>.output_png {display: table-cell;text-align: center;vertical-align: middle;}</style>')

In [1]:
from sympy import init_printing 
from sympy import Matrix
init_printing(use_latex=True)
def out(mat, n=2): return Matrix(np.round(mat, decimals=n))

# Change of Basis: intuition
  

<center><img src="img/change_of_basis.png" width=500></center>



# Change of Basis: intuition
  
- (blue) basis 1 = $\{u_1, u_2\}$
- coordinate of vector **x** in basis 1 = $(x_1, x_2)$
- meaning: **x** = $x_1 \cdot u_1 + x_2 \cdot u_2$

- (red) basis 2 = $\{v_1, v_2\}$
- coordinate of vector **x** in basis 2 = $(x'_1, x'_2)$
- meaning: **x** = $x'_1 \cdot v_1 + x'_2 \cdot v_2$

- Question: given the coordinates of **x** in basis 1 $(x_1, x_2)$, how can we get the coordinates of **x** in basis 2?



# Change of Basis: basis
  

- let $V$ be a vector space 
- let $S=\{v_1, v_2, ..., v_n\}$ be a set of vectors in $V$
- $S$ forms a basis for $V$ if:
  - $S$ is linearly independent
  - $S$ spans $V$

# Change of Basis: coordinates
  
- if $S=\{v_1, v_2, ..., v_n\}$ is a basis for $V$
- then every vector $v \in V$ can be expressed *uniquely* 
- as a linear combination of $v_1, v_2, ... v_n$:

$$ v=c_1 v_1+c_2 v_2 + ... + c_n v_n$$

- $ \begin{bmatrix} c_1 \\ c_2 \\ \vdots \\ c_n \end{bmatrix}$ are the *coordinates* of $v$ relative to the basis $S$
- if $V$ has dimension $n$, then every set of $n$ linearly independent vectors in $V$ is a valid basis for $V$

# Change of Basis: coordinates
  
- let $B=\{u,w\}$ and $B'=\{u',w'\}$ be two bases for $\mathbb {R}^2$
- consider $v \in V$ 
- the representation of $v$ in basis $B$ has coordinates $[v]_B$ 
- the representation of $v$ in basis $B'$ has coordinates $[v]_{B'}$

- we are going to view $B'$ as the original representation
  - and $B$ as the new representation
- we start from the coordinate representation $[v]_{B'}$ in basis $B'$ and we want to find/compute the coordinate representation $[v]_{B}$ in basis $B$

# Change of Basis: coordinates
  
- suppose the basis vectors $u'$ and $w'$ for $B'$ (original) have the following coordinates relative to the basis $B$ (new):

 $[u']_B = \begin{bmatrix} a \\ b \end{bmatrix}$  

 $[w']_B = \begin{bmatrix} c \\ d \end{bmatrix}$  


- This means that $u' = au+bw$  and $w' = cu+dw$ 

# Change of Basis:  change of coordinates matrix

- claim: the change of basis transformation $[v]_{B}=T([v]_{B'})$ is a **linear transformation** associated to matrix $P$
  
- we define the *change of coordinates matrix* from B' to B
$$P ={\begin{pmatrix} a & c \\ b & d \\ \end{pmatrix}}$$

- the change of basis transformation is obtained by
$$[v]_B = P [v]_{B'} = {\begin{pmatrix} a & c \\ b & d \\ \end{pmatrix}} [v]_{B'}$$

# Change of Basis:  change of coordinates matrix
  
in words:
- to change bases:
  - if we know the coordinates of $v$ relative to the (original) basis $B'$
  - we can multiply this vector by the change of coordinates matrix $P$
- this gives us the coordinates of $v$ relative to the (new) basis $B$

# Change of Basis:  change of coordinates matrix
 
$[u']_B = \begin{bmatrix} a \\ b \end{bmatrix}$ 
and $[w']_B = \begin{bmatrix} c \\ d \end{bmatrix}$  

$$P ={\begin{pmatrix} a & c \\ b & d \\ \end{pmatrix}}$$

Note that we just need to know the representation of the vectors forming the original basis in the new coordinate system
- the conversion matrix is formed by the column vectors of the original basis expressed in the new basis

# Change of Basis: explanation
  
- say $v = {\begin{bmatrix} x' \\ y' \end{bmatrix}}_{B'}$ relative to the basis $B' = \{u', w'\}$
- this means that $v=x' u' + y' w' $
- we said that vectors $u'$ and $w'$ for $B'$ have the following coordinates relative to the basis $B$:

 $[u']_B = \begin{bmatrix} a \\ b \end{bmatrix}$  and  $[w']_B = \begin{bmatrix} c \\ d \end{bmatrix}$  


- i.e. that $u' = au+bw$  and $w' = cu+dw$  
- so substituting $u',w'$ in $v=x' u' + y' w' $ we get:

# Change of Basis: explanation
  
$$v = x'(au+bw)+y'(cu+dw)$$ 

$$v =(ax'+cy')u+(bx'+dy')w $$ 

that is:

$$ [v]_B = \begin{bmatrix} ax'+cy' \\ bx'+dy' \end{bmatrix}$$

$$ [v]_B = \begin{bmatrix} a & c \\ b & d \end{bmatrix} \begin{bmatrix} x'\\ y' \end{bmatrix}$$

$$ [v]_B = \begin{bmatrix} a & c \\ b & d \end{bmatrix} [v]_{B'}$$

# Change of Basis: inverse transformation
  
- the transition matrix $P$ is invertible
- if $P$ is the change of coordinates matrix from $B'$ to $B$
- then $P^{−1}$ is the change of coordinates matrix from $B$ to $B'$
$$ [v]_{B'}=P^{−1}[v]_B $$

# Change of Basis: change of representation of a linear transformation 
  
- a linear tranformation $T$ can be represented as a matrix $M$ w.r.t. the basis $B$
- how can the same linear transformation $T$ be represented as a matrix $M'$ w.r.t. the basis $B'$?

$$ M'=P^{−1} M P $$

- M and M' are **similar** matrices

# Change of Basis: change of representation of a linear transformation 
  
$$ M'=P^{−1} M P $$

- why? 

$$ w=T(v)$$

$$ [w]_{B}=M[v]_{B} $$

$$ P[w]_{B'}=MP[v]_{B'} $$

$$ [w]_{B'}=P^{-1}MP[v]_{B'} $$

$$ [w]_{B'}=M'[v]_{B'} $$
