### Transposing a matrix

The transpose of a matrix switches the row and column indices...

$[A^T]_{ij}=[A]_{ji}$

Another way of thinking about it is that the rows of the original matrix, $A$, become the columns in the transposed matrix, $A^T$. (Or, that the columns of the original matrix, $A$, become the rows in the transposed matrix, $A^T$.)

The effect is that the matrix is flipped over its diagonal...

<a title="By LucasVB [Public domain], from Wikimedia Commons" href="https://commons.wikimedia.org/wiki/File:Matrix_transpose.gif"><img width="128" alt="Matrix transpose" src="https://upload.wikimedia.org/wikipedia/commons/e/e4/Matrix_transpose.gif"></a>

Notice that:

* If $A$ is an $m\times n$ matrix then $A^T$ is an $n\times m$ matrix
* The transpose of $A^T$ is $A$ (it is its own inverse)

### An orthonormal matrix

A set of basis vectors are orthonormal if they are orthogonal to each other (the projection of one vector on to any other will equal $0$) and each have a length of $1$ (they are unit vectors).

Imagine $n$ orthonormal vectors: $\hat a_1, \hat a_1, \dots, \hat a_n $

Then the projection of any one of them, $a_i$ onto any other one, $a_j$ is...

$\frac{a_i.a_j}{\vert a_j \rvert}=0$ (but $\vert a_j \rvert=1$ of course, because they're unit vectors)

$a_i.a_j=0$ if $i \ne j$

The projection of a vector on to itself is equal to the length of the vector ($1$ in our case)...

$a_i.a_j=1$ if $i = j$

Now imagine making a transform matrix for this space by forming the columns from the basis vectors (like we've done before)...

$A=\begin{bmatrix}
\begin{pmatrix}\vdots \\ \hat a_1 \\ \vdots \end{pmatrix} &&
\begin{pmatrix}\vdots \\ \hat a_2 \\ \vdots \end{pmatrix} &&
\dots &&
\begin{pmatrix}\vdots \\ \hat a_n \\ \vdots \end{pmatrix}
\end{bmatrix}$

This is called an orthonormal matrix.

The transpose of this matrix is obtained by forming the **rows** from the basis vectors...

$A^T=\begin{bmatrix}
\begin{pmatrix}\dots && \hat a_1 && \dots \end{pmatrix} \\
\begin{pmatrix}\dots && \hat a_2 && \dots \end{pmatrix} \\
\vdots \\
\begin{pmatrix}\dots && \hat a_n && \dots \end{pmatrix}
\end{bmatrix}$

Notice how the matrix multiplication $A^TA$ is essentially a bunch of dot products between these vectors (as the length of them is always $1$)...

$A^TA=
\begin{bmatrix}
\begin{pmatrix}\dots && \hat a_1 && \dots \end{pmatrix} \\
\begin{pmatrix}\dots && \hat a_2 && \dots \end{pmatrix} \\
\vdots \\
\begin{pmatrix}\dots && \hat a_n && \dots \end{pmatrix}
\end{bmatrix}
\begin{bmatrix}
\begin{pmatrix}\vdots \\ \hat a_1 \\ \vdots \end{pmatrix} &&
\begin{pmatrix}\vdots \\ \hat a_2 \\ \vdots \end{pmatrix} &&
\dots &&
\begin{pmatrix}\vdots \\ \hat a_n \\ \vdots \end{pmatrix}
\end{bmatrix}=
\begin{bmatrix}
\hat a_1.\hat a_1 && \hat a_1.\hat a_2 && \dots && \hat a_1.\hat a_n \\
\hat a_2.\hat a_1 && \hat a_2.\hat a_2 && \dots && \hat a_2.\hat a_n \\
\vdots && \vdots && \vdots
\end{bmatrix}$

And, because of the properties we've already determined...

$a_i.a_j=0$ if $i \ne j$

$a_i.a_j=1$ if $i = j$

It's easy to see that $A^TA$ produces an identity matrix...

$A^TA=
\begin{bmatrix}
1 && 0 && 0 && \dots \\
0 && 1 && 0 && \dots \\
0 && 0 && 1 && \dots \\
\vdots && \vdots && \vdots
\end{bmatrix}$

...which effectively tells us that the transpose of an orthonormal matrix is also the inverse of the matrix (because we know that $A^{-1}A=I$)

This is why we prefer to be working with orthonormal basis vectors wherever possible as:

* The inverse is really easy to compute (it's just the transpose)
* The transformation is reversable (it doesn't collapse any dimensions)
* Projections are just dot products (instead of complicated matrix multiplications)