# Linear Transformations


The link of linear algebra with differentiation is that we view differentation as the way to get the best linear approximation to a function at a given point.

Definition: A **linear transformation** or a **linear map** is a function 

$T: R^n \to R^m$ if it meets the following condition

- $T(\vec{v} + \vec{w}) = T(\vec{v}) + T(\vec{w})$
- $T(c\vec{v}) = cT(\vec{v})$


**Proposition: A linear map is identified entirely by it's values at the standard bases vectors of $R^n$.**

Let $\vec{x} = c_1\vec{e_1} + c_2\vec{e_2} + \dots + c_n\vec{e_n}$

Then, we see that $T(\vec{x}) = c_1T(\vec{e_1}) + c_2T(\vec{e_2}) + \dots + c_nT(\vec{e_n})$

## Matrix for a Transformation

We can see that $T(e_j) = \begin{bmatrix} a_1j \\ a_2j \\ \dots \\ a_nj \end{bmatrix} \in R^n$

We can then associate a **matrix** with T, called the **standard matrix of T**

$$
\begin{align}

[T] = \begin{bmatrix} a_{11} & a_{12} & \dots & a_{1n} \\ a_{21} & a_{22} & \dots & a_{2n} \\ \dots \\ a_{m1} & a_{m2} & \dots & a_{mn}\end{bmatrix}

\end{align}
$$

Here $T(\vec{e_j})$ forms the jth column of this matrix.

# Examples of Linear Maps

## Dot Product: $T:R^n \to R$ : $T(\vec{x}) = \vec{a} \cdot \vec{x}$

$\vec{a} = \begin{bmatrix}a_1 \\ a_2 \\ \dots \\ a_n\end{bmatrix}$, $[T] = \begin{bmatrix}a_1 & a_2 & \dots & a_n\end{bmatrix}$

## Rotation by $\pi/2$: $T(\vec{x}) : R^2 \to R^2$ : $T(\vec{x}) = \rho(\vec{x})$

$[T] = \begin{bmatrix} 0 & -1 \\ 1 & 0\end{bmatrix}$

$T({\begin{bmatrix} x_1 \\ x_2\end{bmatrix}}) = \begin{bmatrix} -x_2 \\ x_1\end{bmatrix}$

## Rotation by $\theta$: $T(\vec{x}) : R^2 \to R^2$ 

$A_{\theta} = [T] = \begin{bmatrix} \cos{\theta} & -sin{\theta} \\ sin{\theta} & \cos{\theta} \end{bmatrix}$


## Reflection along $x_2 = x_1$: $T(\vec{x}) : R^2 \to R^2$ 

$[T] = \begin{bmatrix} 0 & 1 \\ 1 & 0\end{bmatrix}$

$T({\begin{bmatrix} x_1 \\ x_2\end{bmatrix}}) = \begin{bmatrix} x_2 \\ x_1\end{bmatrix}$

## Projection along a line $x_2 = mx_1$: $T(\vec{x}) : R^2 \to R^2$ 

Take vector $\vec{p} = \begin{bmatrix} 1 \\ m\end{bmatrix}$ along the line.

Then $proj_p \begin{bmatrix} x_1 \\ x_2\end{bmatrix} = \frac{x + my}{1 + m^2}\begin{bmatrix} 1 \\ m\end{bmatrix}$

$[T] = \begin{bmatrix} \frac{1}{1 + m^2} & \frac{m}{1 + m^2} \\ \frac{m}{1 + m^2} & \frac{m^2}{1 + m^2}\end{bmatrix}$

## Reflection along a line $x_2 = mx_1$: $T(\vec{x}) : R^2 \to R^2$ 

Continuing from previous:

$perp_p \begin{bmatrix} x_1 \\ x_2\end{bmatrix} = \begin{bmatrix} x_1 \\ x_2 \end{bmatrix} - proj_p \begin{bmatrix} x_1 \\ x_2\end{bmatrix}$

$$
\begin{align}
T(\begin{bmatrix} x_1 \\ x_2 \end{bmatrix}) &= proj_p \begin{bmatrix} x_1 \\ x_2 \end{bmatrix} - perp_p \begin{bmatrix} x_1 \\ x_2 \end{bmatrix} \\
                &= 2 proj_p \begin{bmatrix} x_1 \\ x_2 \end{bmatrix} - \begin{bmatrix} x_1 \\ x_2 \end{bmatrix} \\
                &= \begin{bmatrix} \frac{2x + 2my}{1+m^2} - x \\ \frac{2mx + 2m^2y}{1 + m^2} \end{bmatrix}
\end{align}
$$

$[T] = \begin{bmatrix} \frac{1-m^2}{1 + m^2} & \frac{2m}{1 + m^2} \\ \frac{2m}{1 + m^2} & -\frac{1 - m^2}{1 + m^2}\end{bmatrix}$


## Shearing : $T(\vec{x}) : R^2 \to R^2$ 

$[T] = \begin{bmatrix} 1 & c \\ 0 & 1\end{bmatrix}$

$T({\begin{bmatrix} x_1 \\ x_2\end{bmatrix}}) = \begin{bmatrix} x_1 + cx_2 \\ x_2\end{bmatrix}$

## Reflection across plane $x_3 = 0$: $T(\vec{x}) : R^3 \to R^3$ 

$[T] = \begin{bmatrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & -1\end{bmatrix}$



# Algebra of Linear Functions

Let $M_{m \times n}$ be set of all $m \times n$ matrices.

The zero matrix $O$ in $M_{m \times n}$ is the matrix with all entries being zero. 

We can define *scalar multiplication* and *addition* of matrices based on the corresponding linear function definitions.

Let $A \in M_{m \times n}$. Let $a_{ij}$ be the element in the matrix A in the ith row and jth column.

Then, given a scalar c, we define $B = cA$ as the matrix in $M_{m \times n}$ where $b_{ij} = c a_{ij}$

Also, given  $A \in M_{m \times n}$ and  $B \in M_{m \times n}$:

we define $C = A + B$ as the matrix in $M_{m \times n}$ where $c_{ij} = a_{ij} + b_{ij}$

## Properties of matrices in $M_{m \times n}$

Let A,B,C $\in M_{m \times n}$ and c,d $\in R$

### Addition Rules

Commutative: A + B = B + A

Distributive: A + (B + C) = (A + B) + C

Additive Identity: 0 + A = A 

Additive Inverse: There is a matrix -A such that A + (-A) = 0

Note that A is unique: If X + A = A, the X + (A + -A) = (A + -A) i.e. X = 0.

### Scalar Multiplication Rules

c(dA) = (cd)A

c(A + B) = cA + cB

(c + d)A = cA + dA

1A = A



# Linear Maps Composition & Matrix Multiplication

## Row and Column Vectors

Given an $m \times n$ matrix A, we can define column vectors of matrix A:

$\vec{a_j} = \begin{bmatrix} a_{1j} \\ a_{2j} \\ \dots \\ a_{mj} \end{bmatrix}$, j = 1 to n.

And row vectors of matrix A:

$\vec{A_i} = \begin{bmatrix} a_{i1} \\ a_{i2} \\ \dots \\ a_{in} \end{bmatrix}$, i = 1 to m.


## Multiplying Matrix by a vector

Given $\vec{x} = \begin{bmatrix} x_1 \\ x_2 \\ \dots \\ x_n \end{bmatrix}$ in $R^n$, 

we can represent $\vec{x}$ we can define the product as a linear combination of column vectors:

$$
\begin{align}
A\vec{x} &= T(\vec{x}) \\
         &= x_1T(\vec{e_1}) + x_2T(\vec{e_2}) + \dots + x_nT(\vec{e_n}) \\
         &= x_1\vec{a_1} + x_2\vec{a_2} + \dots + x_n\vec{a_n} \\
\end{align}
$$

However, we can also represent this in terms of row vectors :

$$
\begin{align}
A\vec{x} = \begin{bmatrix} \vec{A_1} \cdot \vec{x} \\ \vec{A_2} \cdot \vec{x} \\ \dots \\ \vec{A_m} \cdot \vec{x} \end{bmatrix}
\end{align}
$$

## Multiplying Matrices

Given that linear maps are functions, we can compose them. And given matrices represent linear maps, we will define matrix multiplication such that multipliying two matrices we get their composition. Given two linear maps $T: R^n \to R^m$ and $S: R^p \to R^n$, we get:

$[T][S]\vec{x} = T \circ S(\vec{x})$, where $\vec{x} \in R^n$

Here A = [T] is an $m \times n$ matrix, and B = [S] is an $n \times p$ matrix. Then,

We note that given $e_j$ in $R^p$

$S(\vec{e_j}) = b_j$, where $b_j \in R^n$

Now,

$A\vec{b_j} = T(\vec{b_j}) = T(S(\vec{e_j})) = \sum_{i=1}^{n} b_{ij}\vec{a_i} = \begin{bmatrix} A_1.\vec{b_j} \\ A_2.\vec{b_j} \\ \dots \\ A_n.\vec{b_j} \end{bmatrix}$, where the result is in $R^m$. 

Thus, we can define matrix multiplication of A and B as the matrix AB, where:

$(AB)_{ij} = A_i \cdot b_j$

## Identity Matrices

Given the identity linear map $T:R^n \to R^n: T(e_j) = e_j$, for j = 1 to n, let $I_n = [T]$. It follows that the jth column of $I_n$ is $e_j$.

### Properties of Matrix Multiplication

Since matrix multiplication represents the composition of linear maps, it follows that the same properties apply. 

Given A as an $m \times n$ matrix, B as an $n \times p$ matrix, C as a $p \times q$ matrix:

- non-commutative - in general, $AB \ne BA$. In fact BA may not even exist, even if AB exists. If AB and BA both exists and A is an m x n matrix, then B must be n x m.
- associative - $A(BC) = (AB)C$
- left / right identities - $AI_n = A = I_mA$
- distributive over addition - (A + B)C = AC + BC and A(B + C) = AB + AC
- (cA)B = c(AB) = A(cB)

### Cancellation in Matrices

In general, if AB = 0 we cannot conclude A or B is zero. Similarly Ax = Bx does not imply A = B. But we have the following results.

**$A\vec{x} = 0$ for all $\vec{x} \in R^n$, then A = 0.**

Proof:

$A\vec{e_j} = 0$ => jth column of A is zero, for j = 1 to n. 

**$A\vec{x} = B\vec{x}$ for all $\vec{x} \in R^n$, then A = B.**

Proof:
$A\vec{e_j} = B\vec{e_j} \implies \text{ jth column of A = jth column of B for j = 1 to n}$



# Square Matrices

We have seen n x n matrices e.g. $I_n$. Matrices with same number of rows and columns are called **square matrices**.

In a square matrix A, we refer to the elements $a_{ii}$ as **diagonal entries**

We call a square matrix 

a) a **diagonal matrix** if $a_{ij} = 0$ whenever $i \ne j$.

b) an **upper triangular** matrix if all entries below the diagonal are zero

c) a **lower triangular** matrix if all entries above the diagonal are zero

d) **symmetric** if $a_{ij} = a_{ji}$

e) **skew-symmetric** if $a_{ij} = -a_{ij}$ whenever $i \ne j$


## Powers of a square matrix

For a square matrix, we can consider multiplying it by itself:

$A, AA = A^2, AAA = A^3$ and so on.

e.g. If A rotates a vector by $\theta$, then we can expect $A^3$ to rotate the resulting vector $3\theta$. 

But consider A as the projection of a vector on a line. 

$A = \begin{bmatrix} \frac{1}{1 + m^2} & \frac{m}{1 + m^2} \\ \frac{m}{1 + m^2} & \frac{m^2}{1 + m^2}\end{bmatrix}$

$A^2 = \begin{bmatrix} \frac{1}{1 + m^2} & \frac{m}{1 + m^2} \\ \frac{m}{1 + m^2} & \frac{m^2}{1 + m^2}\end{bmatrix} = A$

A little reflection will show that after the first Ax, the resulting vector is already on the line, and it's projection is itself. Hence $A^2\vec{x} = A\vec{x}$. It can be easily verified that in this case, $A^i = A$

On the other hand, consider the reflection matrix across line $x_2 = x_1$

$A = \begin{bmatrix} 0 & 1 \\ 1 & 0\end{bmatrix}$. Here $A^2 = I$, and $A^3 = A$.




# Matrix Inverse

**Definition: Inverse of a Matrix** 
Let A be an $n \times n$ matrix. We say A is invertible, if there is an $n \times n$ matrix B, such that:

$AB = BA = I_n$

We call B the inverse of matrix A, and denote this by $B = A^{-1}$

It is easy to see that if $A = [T]$, $A^{-1} = [T^{-1}]$

Also, $A^{-1}$ is unique, as can be easily proven.

### Example in R^2

Given $A = \begin{bmatrix} a & b \\ c & d \end{bmatrix}$, $A^{-1} = \frac{1}{ad-bc}\begin{bmatrix} d & -c \\ -b & a \end{bmatrix}$

Inverse exists if $ad \ne bc$

## Inverse of a Matrix Multiplication

Let A and B be invertible $n \times n$ matrices. Then

$(AB)^{-1} = B^{-1}A^{-1}$

Proof: 

$B^{-1}A^{-1}AB = B^{-1}(A^{-1}A)B = B^{-1}I_nB = B^{-1}(I_nB) = B^{-1}B = I_n$

$ABB^{-1}A^{-1} = A(BB^{-1})A^{-1} = AI_nA^{-1} = (AI_n)A^{-1} = AA^{-1} = I_n$


# Matrix Transpose

Given an $m \times n$ matrix A, it's transpose $A^T$ is an $n \times m$ matrix such that:

$(A^T)_{ij} = (A)_{ji}$

It follows that for a **symmetric matrix**, $A^T = A$, for a **skew-symmetric matrix**, $A^T = -A$

## Properties of Transpose

Let A and A' be $n \times n$ matrices, B be an $n \times p$ matrix. Then

a) $(A^T)^T = A

b) $(cA)^T = cA^T$

c) $(A + A')^T = A^T + A'^T$

d) $(AB)^T = B^TA^T$

**Proof of (d):**

Note first that AB is an $n \times p$ matrix, so $(AB)^T$ is a $p \times n$ matrix, same as $B^TA^T$.

By definition:

$(AB)^T_{ij} = AB_{ji} = A_j \cdot b_i$

$(B^TA^T)_{ij} = b_i \cdot A_j = (AB)^T_{ij}$

## Transpose and the Dot Product

Given $\vec{x}$ and $\vec{y}$ in $R^n$, we can define a matrix:

$\vec{x}^T$ such that $\vec{x}^T\vec{y} = [\vec{x}.\vec{y}]$. For purpose of this discussion, we will write:

$\vec{x} \cdot \vec{y} = \vec{x}^T\vec{y}$

... noting that the R.H.S. is a single-element matrix.

### Prove: Given A in $M_{m \times n}$, $\vec{x} \in R^n$, $\vec{y} \in R^m$, $A\vec{x} \cdot \vec{y} = \vec{x} \cdot A^T\vec{y}$

$$
\begin{align}
A\vec{x} \cdot \vec{y} &= (A\vec{x})^T\vec{y} \\
                       &= \vec{x}^TA^T\vec{y} \\
                       &= \vec{x}^T(A^T\vec{y}) \\
                       &= \vec{x} \cdot A^T \vec{y}
 
\end{align}
$$


### Example of Transpose using products and ingredients

Let vector $\vec{x} = \begin{bmatrix} x_1 \\ x_2 \\ \dots \\ x_n \end{bmatrix}$ represents units of products we want to manufacture. These n products need m ingredients, and the $m \times n$ matrix A represents the quantity of ingredients needed for each product i.e. the jth column of A represents the quantity of each of the m ingredients needed per unit of product j. 

Then, $\vec{y} = A\vec{x}$ represents the total quantity of each of the m ingredients needed for all n products put together.

Finally, we have the vector $\vec{p}$ representing the unit price for each ingredient.

We get total cost C:

$C = \vec{y} \cdot \vec{p} = A\vec{x} \cdot \vec{p} = \vec{x} \cdot A^T \vec{p}$

Let $\vec{q} = A^T \vec{p} = \begin{bmatrix} q_1 \\ q_2 \\ \dots \\ q_n \end{bmatrix}$. Then $q_i$ represents the price per unit of the ith product. 

# Ex 3 : Let A be an m x n matrix. Show V = $\{ \vec{x} \in R: A\vec{x} = 0 \}$ is a subspace of R^n.

Proof: 

a) $0 \in V$, since $A.0 = 0$

Choose any $x, y \in V$

b) $A(c\vec{x}) = c(A\vec{x}) = c.0 = 0$ 

c) $A(\vec{x} + \vec{y}) = A\vec{x} + A\vec{y} = 0 + 0 = 0$

In [29]:
# Ex 5

import sympy as s

m = s.var('m')
theta = s.var('t')

reflect_matrix = (1/(1 + m**2)) * s.Matrix([[(1 - m**2), 2*m], [2*m, m**2 - 1]])
proj_matrix = (1/(1 + m**2)) * s.Matrix([[1, m], [m, m**2]])
rotate_matrix = s.Matrix([[s.cos(theta), -s.sin(theta)], [s.sin(theta), s.cos(theta)]])

reflect_y = s.Matrix([[-1,0], [0, 1]])
proj_y = s.Matrix([[0,0], [0, 1]])

e1 = s.Matrix([1,0])
e2 = s.Matrix([0,1])

def reflect(slope):
    if slope == s.oo:
        r = reflect_y
    else:
        r = reflect_matrix.subs(m, slope)
    return r


def project(slope):
    if slope == s.oo:
        p = proj_y
    else:
        p = proj_matrix.subs(m, slope)
    return p

def rotate(angle):
    return rotate_matrix.subs(theta, angle)

# (b)
display(reflect(1))

# (c)
display(reflect(s.oo)*reflect(0))

# (d)

display(project(2))

# (e)

display(rotate(s.pi/2)*project(s.Rational(1,2)))

# (f)

display(project(s.Rational(1,2))*rotate(s.pi/2))





Matrix([
[0, 1],
[1, 0]])

Matrix([
[-1,  0],
[ 0, -1]])

Matrix([
[1/5, 2/5],
[2/5, 4/5]])

Matrix([
[-2/5, -1/5],
[ 4/5,  2/5]])

Matrix([
[2/5, -4/5],
[1/5, -2/5]])

# Ex 9 : 2 x 2 matrices

$A = \begin{bmatrix}a & b \\ c & d\end{bmatrix}$

$A^2 = \begin{bmatrix}a^2 + bc & b(a + d) \\ c(a + d) & d^2 + bc \end{bmatrix}$

**a) $A^2 = I_2$**

$a^2 + bc = d^2 + bc = 1$, $b(a + d) = c(a + d) = 0$

if b = 0, a & d = +-1, c: if a = d, c = 0, else c can be anything.

if b not 0, a = -d. c = 1-a^2/b 

So, matrices are :

$\begin{bmatrix}1 & 0 \\ c & -1\end{bmatrix}$ OR $\begin{bmatrix}1 & 0 \\ 0 & -1 \end{bmatrix}$ OR $\begin{bmatrix}-1 & 0 \\ 0 & 1 \end{bmatrix}$


**a) $A^2 = 0$**

$a^2 + bc = d^2 + bc = b(a + d) = c(a + d) = 0$

if b = 0, a = 0, d = 0, c can be anything

if b not 0, a = -d. c = -a^2/b

So, matrices are :

$\begin{bmatrix}0 & 0 \\ c & 0\end{bmatrix}$ OR  $\begin{bmatrix}a & b \\ -\frac{a^2}{b} & -a \end{bmatrix}$

**a) $A^2 = -I_2$**

$a^2 + bc = d^2 + bc = -1$, $b(a + d) = c(a + d) = 0$

if b = 0, then a/d are not real.

if b not 0, a = -d. c = -1 - a^2 / b

So, matrices are :

$\begin{bmatrix}a & b \\ \frac{a^2-1}{b} & -a \end{bmatrix}$


In [42]:
# Ex 10

display(s.simplify(reflect(s.sin(theta)/s.cos(theta))))

display(rotate(2*theta)*reflect(0))

display(s.simplify(rotate(theta)*reflect(0)*rotate(-theta)))


Matrix([
[cos(2*t),  sin(2*t)],
[sin(2*t), -cos(2*t)]])

Matrix([
[cos(2*t),  sin(2*t)],
[sin(2*t), -cos(2*t)]])

Matrix([
[cos(2*t),  sin(2*t)],
[sin(2*t), -cos(2*t)]])

# Ex 11 : Formula for A^n

(a) $A = \begin{bmatrix}1 & 1 \\ 0 & 1\end{bmatrix}$ 

A shearing matrix. 

$A^2 = \begin{bmatrix}1 & 2 \\ 0 & 1\end{bmatrix}$ 

$A^n = \begin{bmatrix}1 & n \\ 0 & 1\end{bmatrix}$ 


b) $A = \begin{bmatrix}d_1 \\ .. & d_2 \\ \dots \\ .. & .. & .. & d_n\end{bmatrix}$ a diagonal matrix

$A^n = \begin{bmatrix}d_1^n \\ .. & d_2^n \\ \dots \\ .. & .. & .. & d_n^n\end{bmatrix}$ a diagonal matrix




# Ex 12: m+n matrix

$\begin{bmatrix}A & B \\ C & D\end{bmatrix}\begin{bmatrix}A' & B' \\ C' & D'\end{bmatrix} = \begin{bmatrix}AA' + BC' & AB' + BD' \\ CA' + DC' &  CB' + DD' \end{bmatrix}$

A,A' - m x m
B,B' - m x n
C,C' - n x m
D,D' - n x n

For first matrix: rows are of two types:

$[A_i \text{ } B_i]$ for i = 1 to m, with $A_i$ having m elements, $B_i$ having n.

$[C_{i-m} \text{ } D_{i-m}]$ for i = m+1 to m+n, with $C_i$ having m elements, $D_i$ having n.

For second matrix: columns are of two types (shown horizontally for ease):

$[A'_j \text{ } C'_j]$ for j = 1 to m, with A'_j having m elements, C'_j having n.

$[B'_{j-m} \text{ } D'_{j-m}]$ for j = m+1 to m+n, with B' having m elements, D' having n.

Multiplying, we see the four combinations for the ith row, jth column entry of the result:

$[A_i.A'_j + B_i.C'_j]$ for i = 1 to m, j = 1 to m.

$[A_i.B'_{j-m} + B_i.D'_{j-m}]$ for i = 1 to m, j = m+1 to m+n.

$[C_{i-m}.A'_j + D_{i-m}.C'_j]$ for i = m+1 to m+n, j = 1 to m.

$[C_{i-m}.B'_{j-m} + D_{i-m}.D'_{j-m}]$ for i = m+1 to m+n, j = m+1 to m+n.




In [47]:
# Ex 13

# a
T = rotate(s.pi/2)
S = reflect(-1)
display(T, S)

# b
display(T*S)

# c
display(S*T)


Matrix([
[0, -1],
[1,  0]])

Matrix([
[ 0, -1],
[-1,  0]])

Matrix([
[1,  0],
[0, -1]])

Matrix([
[-1, 0],
[ 0, 1]])

In [53]:
# Ex 14

# a

angle = -s.pi/4


display(rotate(angle))

Matrix([
[ sqrt(2)/2, sqrt(2)/2],
[-sqrt(2)/2, sqrt(2)/2]])

# Ex 17: $(BAB^{-1})^n$ and $(BAB^{-1})^{-1}$

$(BAB^{-1})^2 = BAB^{-1}BAB^{-1} = BA^2B^{-1}$

$(BA^nB^{-1})(BAB^{-1}) = BA^nAB^{-1} = BA^{n+1}B^{-1}$


$EBAB^{-1} = I$

$EBA = B$

$EB = BA^{-1}$
$E = BA^{-1}B^{-1}$

A needs to be reversible.


# Ex 18: $A^3 - 3A + 2I = 0$

$A^3 - 3A + 2AA^{-1} = 0$
$A(A^2 -3 + 2A^{-1}) = 0$

Since A is non-zero,

$A^2 -3 + 2A^{-1} = 0$

$A^{-1} = 3/2 - 1/2 A^2$

# Ex 19: Trace of a matrix

A is an n x n matrix

$tr(A) = \sum_{i=1}^{n} a_{ii}$ 

a) $tr(A) = tr(A^T)$

b) tr(cA) = ctr(A)

c) tr(A + B) = tr(A) = tr(B)

d) tr(AB) = tr(BA) 

Proof of (d)

$(AB)_{ii} = A_i \cdot b_i = \sum_{k=1}^{n} a_{ik}b_{ki}$

$tr(AB) = \sum_{i=1}^{n}\sum_{k=1}^{n} a_{ik}b_{ki}$

$(BA)_{ii} = B_i \cdot a_i = \sum_{l=1}^{n} b_{il}a_{li}$

$tr(BA) = \sum_{i=1}^{n}\sum_{l=1}^{n} b_{il}a_{li} = \sum_{i=1}^{n}\sum_{k=1}^{n} a_{ki}b_{ik} = \sum_{i=1}^{n}\sum_{k=1}^{n} a_{ik}b_{ki} = tr(AB)$


In [70]:
A = s.Matrix([[1,2],[3,4]])
B = s.Matrix([[2,1],[4, 3]])
C = s.Matrix([[1,2,1],[0,1,2]])
D = s.Matrix([[0,1],[1,0],[2,3]])


display(A.T)
display(2*A-B.T)
display(C.T)
display(C.T + D)
display(A.T*C)
# display(A*C.T)
display(C.T*A.T)
display(B*D.T)
# display(D.T*B)
# display(C*C.T)
#display(C.T*C)
#display(C.T*D.T)




Matrix([
[1, 3],
[2, 4]])

Matrix([
[0, 0],
[5, 5]])

Matrix([
[1, 0],
[2, 1],
[1, 2]])

Matrix([
[1, 1],
[3, 1],
[3, 5]])

Matrix([
[1, 5,  7],
[2, 8, 10]])

Matrix([
[1,  3],
[4, 10],
[5, 11]])

Matrix([
[1, 2,  7],
[3, 4, 17]])

# Ex 24 : Prove A,B symmetric <=> AB = BA

Proof:

a) Assume AB symmetric 

$(AB)_{ij} = (AB)_{ji} = A_j.b_i = a_j.B_i = B_i.a_j = (BA)_{ij}$

b) Assume AB = BA

$(AB)_{ij} = (BA)_{ij} = B_i.a_j = b_i.A_j = (AB)_{ji}$



# Ex 25: A is an m x n matrix. Prove $A^TA$ is symmetric

$(A^TA)_{ij} = A^T_i.A_j = A_i.A^T_j = (A^TA)_{ji}$



# Ex 26: A is invertible. Prove $(A^{-1})^TA^T = I = (A^{-1})^TA^T$ i.e. $A^T$ is invertible.

$A^{-1}A = I => A^T(A^{-1})^T = I^T = I$

$AA^{-1} = I => (A^{-1})^TA^T = I^T = I$

Thus $(A^{-1})^T$ is the inverse of $A^T$, when A is invertible.

# Ex 27: Permutation Matrices

An n x n matrix is called a permutation matrix if it has a single 1 in each row and column.

c) Prove product of two permutation matrices is a permutation matrix

If P and Q are permutation matrices: Let $(p_1, p_2, ..., p_n)$ be the location of the 1's in each row of p, and $(q_1,q_2,...,q_n)$ be ones in each column of p.

row i of PQ = $[P_i.q_1 P_i.q_2               P_i.q_n    ] = [q_{p_i1} q_{p_i2} ... q_{p_in}] = Q_{p_i}$ 

col j of PQ = $[P_1.q_j P_2.q_j ... P_n.q_j] = [p_1{q_j} p_2{q_j} ... p_n{q_j}] = p_{q_j}$ 

Thus the ith row of PQ is the $p_i$ th row of Q - since every $p_i$ is different, this means that this is a permutation of the rows of Q, so every row has only one 1.

The jth column of PQ is the $q_j$ th column of P. Since every $q_j$ is different, this means this is a permutation of the columns of P, hence every column has only one 1.

c) Prove every permutation matrix is invertible and $P^{-1} = P^T$

Let Q = P^T. Q is a permutation matrix.

Then P_i = q_i. Also

Also, (PQ)_{ii} = P_i.q_i = 1, since they have 1 in the same place. 

Also, (QP)_{ii} = Q_i.p_i = 1, since they have 1 in the same place. 


Also PQ / QP are products of two permutation matrices, so each row, column has only one 1. i.e. PQ = I = QP

Q = $P^T = P^{-1}$

e) PA = A with rows exchanged, AP = A with columns exchanged


In [92]:

def permute_matrix(p):
    def inner_perm(i,j):
        if p[i] == j:
            return 1
        else:
            return 0
    return inner_perm

from sympy.utilities.iterables import generate_bell
for p in generate_bell(2):
    m = s.Matrix(2,2,permute_matrix(p))
    display(m)


for p in generate_bell(3):
    m = s.Matrix(3,3,permute_matrix(p))
    display(m)



# s.Matrix(4, 4, f)



Matrix([
[1, 0],
[0, 1]])

Matrix([
[0, 1],
[1, 0]])

Matrix([
[1, 0, 0],
[0, 1, 0],
[0, 0, 1]])

Matrix([
[1, 0, 0],
[0, 0, 1],
[0, 1, 0]])

Matrix([
[0, 0, 1],
[1, 0, 0],
[0, 1, 0]])

Matrix([
[0, 0, 1],
[0, 1, 0],
[1, 0, 0]])

Matrix([
[0, 1, 0],
[0, 0, 1],
[1, 0, 0]])

Matrix([
[0, 1, 0],
[1, 0, 0],
[0, 0, 1]])

# Ex 29: Let A be an m x n matrix, and let x,y in $R^n$. Prove that if Ax = 0, and $y = A^Tb$ for some b in $R^m$, then x.y = 0

$x.A^Tb = Ax.b = 0.b = 0$

# Ex 30: Let A be an n x n matrix. Let V be a subspace in Rn, V = $\{ x \in R^n: Ax \in V \}$. Prove $Ay \in V^{\perp}$ for all y in $V^{\perp}$

Proof:

Let b be in $V^T$. Let y = Ab = $A^Tb$ (since A is symmetric).

Then $x.y = x.A^Tb = Ax.b = 0.b = 0$

# Ex 32: A is m x n matrix, x in Rn satisfies $(A^TA)x = 0$. Prove Ax = 0.

|| Ax || = $Ax.Ax = (Ax)^TAx = x^T(A^TA)x = x^T(0x) = x^T0 = 0$

# Ex 33: A is symmetric and $A^2 = 0$. Prove A = 0.

element (i,i) = ai.Ai = 0

But ai = Ai (symmetric)

So Ai.Ai = 0 => Ai = 0.

# Ex 34: Orthogonal Matrix: $A^TA = I_n$

a) Proof:
$(A^TA)_{ij} = A^T_i.a_j = a_i.a_j$

a_i.a_j = 1 if i = j, a_i.a_j = 0 if i n.e. j. 

c) for a 2x2 matrix, 

Using (a)

a_1 = (cost, sint)
a_2 = (cosw, sinw)

Since these are orthogonal,

cost.cosw + sint.sinw = 0

cos(t - w) = 0

t - w = pi/2 or t - w = -pi/2

w = t - pi/2 or w = t + pi/2

w = t - pi/2: cos w = cos t - pi/2 = sin t, sin w = sin t - pi/2 = -cos(t)

w = t + pi/2: cos w = -sin t, sin w = sin t + pi/2 = cos(t)

Ans: 

$\begin{bmatrix}\cos{\theta} & -\sin{\theta} \\ \sin(\theta) & \cos{\theta} \end{bmatrix}$, or

$\begin{bmatrix}\cos{\theta} & \sin{\theta} \\ \sin(\theta) & -\cos{\theta} \end{bmatrix}$

d) For 2x2 matrix, the first one is a rotation. The second one is a reflection, followed by a rotation.

$\begin{bmatrix}\cos{\theta} & \sin{\theta} \\ \sin(\theta) & -\cos{\theta} \end{bmatrix} = \begin{bmatrix}\cos{\theta} & -\sin{\theta} \\ \sin(\theta) & \cos{\theta} \end{bmatrix}\begin{bmatrix}1 & 0 \\ 0 & -1 \end{bmatrix}$

e) Assume $A^T = A^{-1} when A is orthogonal (proved in chapter 4)$. Prove row vectors of A are unit vectors orthogonal to each other.

AA^T = I = (A^T)^TA^T, since A^T = A^-1.

Hence, A^T is an orthogonal matrix.






# Ex 35: More orthogonal matrices

a) Prove if A and B are orthogonal, so is AB

AB^T AB = B^T A^T AB = B^T I B = B^T B = I

b) If A is orthogonal, so is A^{-1}

A^{-1} = A^T 


# Ex 36: Symmetric / Skew-Symmetric Matrices

a) Prove only 0 is both symmetric and skew symmetric

a_ij = a_ij (symmetric), a_ij = -a_ij (skew)

b) A is n x n. Prove $S = 1/2(A + A^T)$ is symmetric, $K = 1/2(A - A^T)$ is skew symmetric.

(S)_ij = 1/2(a_ij + a'_ij) = 1/2(a_ij + a_ji) = (S)_ji

(K)_ij = 1/2(a_ij - a'_ij) = 1/2(a_ij - a_ji) = -(K)_ji

c) Prove any square matrix A can be written as A = S + K, where S is symmetric, K is skew symmetric.

See (b). S + K = A.

d) If A = S + K and A = S' + K', S = S', K = K' (S, K are unique)

Subtracting, we get (S-S') + (K - K') = 0. 
Note that S - S' is symmetric (easy to prove), and K - K' is anti-symmetric (if a_ij = -a_jj, b_ij = -b_ji, then a_ij + b_ij = -(a_ji + b_ji))
Also, the sum of a symmetric and an anti-symmetric matrix cannot be symmetric unless the antisymmetric matrix is zero, since:

ss_ij = ss_ji, kk_ij = -kk_ji. 

so : ss_ij + kk_ij = 0, and ss_ji + kk_ji = 0 i.e. ss_ij - kk_ij = 0 => kk_ij = ss_ij = 0.

Thus, K - K' = 0 and S = S'.

e) A is an nxn matrix commuting with all nxn matrices - AB = BA for all nxn matrices B. Describe A.

A_i.b_j = B_i.a_j

Pick i, j, i n.e. j. Choose a diagonal matrix B with b_ii = -1, b_jj = 1

Then :

a_ij = -a_ij i.e. a_ij = 0 when i n.e. j.

So A is a diagonal matrix. 

A_i.b_j = B_i.a_j. Since a is a diagonal matrix, only a_ii, and a_jj are non-zero (possibly).

So a_ii.b_ij = b_ij.a_jj i.e. a_ii = a_jj = c, say, assuming b_ij is non-zero (which we can since B is any matrix)

So A = cI.




