# Calculating matrix inverse

Although calculating matrix inverses is usually complicated, you will soon learn a **universal algorithm** to compute them, as well as a **general expression** involving determinants.

To develop this algorithm, we will rely on the **elementary row operations** that you already know well. In fact, **elementary matrices** will serve as the fundamental building blocks of the method.

So, without further delay, let us move on to this algorithm.


## Elementary Matrices

As you know, applying an **elementary row operation** to a matrix $A$ is equivalent to applying that same operation to the **identity matrix**, obtaining a new matrix $E$, and then multiplying:
$$
EA.
$$
For this reason, the matrix $E$ is called an **elementary matrix**.

To find the inverse of a matrix, we will multiply a sequence of elementary matrices. The first important observation is that **every elementary matrix is invertible**, and moreover, its inverse is easy to compute.

### Example: Row Exchange

Consider the elementary matrix obtained by swapping the first two rows of the identity matrix:
$$
E =
\begin{pmatrix}
0 & 1 \\
1 & 0
\end{pmatrix}.
$$

If you swap the rows again, you recover the identity matrix. This is equivalent to premultiplying by the same elementary matrix $E$. Therefore, the inverse of $E$ is $E$ itself. Indeed,
$$
EE =
\begin{pmatrix}
0 & 1 \\
1 & 0
\end{pmatrix}
\begin{pmatrix}
0 & 1 \\
1 & 0
\end{pmatrix}
=
\begin{pmatrix}
1 & 0 \\
0 & 1
\end{pmatrix}
= I.
$$

### Inverses of Elementary Matrices

In general, the inverse of an elementary matrix is the elementary matrix that performs the **inverse row operation**:

- **Multiply a row by a nonzero scalar $c$**
  Inverse operation: multiply the same row by $\frac{1}{c}$.

- **Add a scalar multiple of one row to another**
  Inverse operation: subtract the same scalar multiple from that row.

- **Swap two rows**
  Inverse operation: swap the same two rows again (the matrix is its own inverse).

For example, suppose the elementary matrix $E$ is obtained from the identity by multiplying a row by $5$. Multiplying that same row by $\frac{1}{5}$ recovers the identity. Let $F$ be the elementary matrix corresponding to this inverse operation. Then
$$
EF = I \quad \text{and} \quad FE = I,
$$
which implies
$$
F = E^{-1}.
$$

### Products and Inverses

Finally, it is important to note that **products and inverses interact nicely**.

If $A$ and $B$ are invertible matrices, then their product $AB$ is also invertible, and its inverse is given by
$$
(AB)^{-1} = B^{-1} A^{-1}.
$$


### Proof

To prove the formula for the inverse of a product, we only need to check the **definition of the inverse matrix**.

#### First direction

On one hand, we compute:
$$
(AB)(B^{-1}A^{-1})
= A(BB^{-1})A^{-1}
= AIA^{-1}
= AA^{-1}
= I.
$$

#### Second direction

On the other hand:
$$
(B^{-1}A^{-1})(AB)
= B^{-1}(A^{-1}A)B
= B^{-1}IB
= BB^{-1}
= I.
$$

Since both products yield the identity matrix, we conclude that
$$
(AB)^{-1} = B^{-1}A^{-1}.
$$

This completes the proof.


## How to Find the Inverse

Let us put all the previous results together.

Let $A$ be a square matrix of size $n \times n$. Then $A$ is **invertible if and only if** its **reduced row echelon form** is the identity matrix. That is, there exist elementary matrices
$$
E_1, E_2, \ldots, E_k
$$
such that
$$
E_k \cdots E_2 E_1 A = I.
$$

In this case, the inverse of $A$ is given by
$$
A^{-1} = E_k \cdots E_2 E_1.
$$

In other words, the inverse of $A$ is obtained by multiplying all the elementary matrices that reduce $A$ to the identity, in the same order.


### Proof

If
$$
E_k \cdots E_2 E_1 A = I,
$$
then multiplying both sides on the left by
$$
(E_k \cdots E_2 E_1)^{-1}
$$
gives
$$
A = (E_k \cdots E_2 E_1)^{-1}.
$$

Therefore, multiplying on the right by $E_k \cdots E_2 E_1$, we obtain
$$
A (E_k \cdots E_2 E_1)
= (E_k \cdots E_2 E_1)^{-1}(E_k \cdots E_2 E_1)
= I.
$$

Thus, $A$ is invertible, and its inverse is
$$
A^{-1} = E_k \cdots E_2 E_1.
$$


#### Lemma

For the remainder of the proof, we need the following result.

Let $B$ be an **invertible matrix**, and let
$$
E_1, E_2, \ldots, E_k
$$
be **elementary matrices**.

If the matrix $T$ defined by
$$
T = E_k \cdots E_2 E_1 B
$$
is in **row-echelon form**, then **all entries on its main diagonal are nonzero**.


## Proof

Suppose that there is an entry on the **main diagonal** of $T$ that is equal to zero. That is, there exists some
$$
i \in \{1, 2, \ldots, n\}
$$
such that
$$
t_{ii} = 0.
$$

Recall that $T$ is in **row-echelon form**. Consider the rows below the $i$-th row. In row-echelon form, the first nonzero entry of each lower row must appear **to the right** of the first nonzero entry of the row above it. Since $t_{ii} = 0$, this implies that the $i$-th row has no pivot in the $i$-th column.

Because $T$ is a **square matrix**, this shift forces the last row to have **no nonzero entries at all**. Hence, the last row of $T$ is the **zero vector**.

Now, take any square matrix $C$. What does the last row of the product $TC$ look like? Since the last row of $T$ is the zero vector, the last row of $TC$ is also the zero vector. Consequently, $TC$ cannot be the identity matrix $I$.

Therefore, no matrix $C$ can serve as the inverse of $T$. Since $C$ was chosen arbitrarily, this means that **$T$ is not invertible**, which contradicts our assumption.

This contradiction shows that every diagonal entry of $T$ must be **nonzero**, completing the proof.


Great. Now let us assume that the matrix $A$ is **invertible**.

First, put $A$ into **row-echelon form** by applying some $l$ elementary row operations. By the lemma, all entries on the **main diagonal** of the matrix $E_l \cdots E_2 E_1 A$ are **nonzero**.

Next, by applying additional elementary row operations, we can convert all diagonal entries into $1$â€™s. After that, we can continue applying elementary row operations to transform all entries **outside the diagonal** into zeros. In this way, the matrix is transformed into the **identity matrix**. Hence, the reduced row echelon form of $A$ is $I.$

Suppose that the total number of elementary row operations required is $k$. Then there exist elementary matrices $E_1, E_2, \ldots, E_k$ such that $E_k \cdots E_2 E_1 A = I.$

Since $A$ is invertible, we have
$$
A^{-1} = IA^{-1}
       = (E_k \cdots E_2 E_1 A)A^{-1}
       = E_k \cdots E_2 E_1 (AA^{-1})
       = E_k \cdots E_2 E_1.
$$

Thus, the inverse of $A$ is obtained by multiplying together all the elementary matrices used to reduce $A$ to the identity.


Although it may not seem so at first sight, this result gives us an infallible algorithm to find the inverse of any invertible matrix:

1. Find the reduced row echelon form of $A$.
2. If the result is the identity, you can move forward; otherwise, $A$ is not invertible.
3. Save all the elementary matrices $E_1,E_2,\ldots,E_k$ used in the previous process.
4. Then $A^{-1}$ is the product of these elementary matrices in the reverse order in which they were used. That is,
   $$
   A^{-1}=E_k\cdots E_2E_1.
   $$

But wait a minute. The product $E_k\cdots E_2E_1$ is the same as $(E_k\cdots E_2E_1)I$. This means that the operations applied to $A$ in order to transform it into $I$ then have to be applied again to $I$ in the same order to convert it into $A^{-1}$. So why not apply the operations at the same time on both matrices?

First, join the two matrices into an augmented one by putting $I$ to the right of $A$:
$$
[A\mid I].
$$

After that, start applying elementary row operations to the augmented matrix in order to row reduce $A$. This means that every operation applied to $A$ is immediately applied to $I$. At the end, when on the left $A$ has become $I$, on the right $I$ will have become $A^{-1}$:
$$
[I\mid A^{-1}].
$$

For example, the inverse of the matrix
$
A=
\begin{pmatrix}
2 & 3\\
1 & 4
\end{pmatrix}
$
is
$
A^{-1}=
\begin{pmatrix}
\frac{4}{5} & -\frac{3}{5}\\
-\frac{1}{5} & \frac{2}{5}
\end{pmatrix}.
$

Our method establishes that first you have to build the augmented matrix $[A\mid I]$ and apply row operations until $A$ is converted into $I$. At that moment, the original $I$ will have become $A^{-1}$. You can visualize it in the following way:

![Matrix Inverse](img/matrix_inverse.png)

### An invertible matrix

Suppose you want to find the inverse of the following matrix:

$
A=
\begin{pmatrix}
2 & 3\\
1 & 4
\end{pmatrix}
$

The first step is to define the bigger matrix:

$
[A\mid I]
=
\begin{pmatrix}
2 & 3 \mid 1 & 0\\
1 & 4 \mid 0 & 1
\end{pmatrix}
$

Now compute the reduced row echelon form of $A$. You can start by interchanging the two rows:

$
\begin{pmatrix}
1 & 4 \mid 0 & 1\\
2 & 3 \mid 1 & 0
\end{pmatrix}
$

Now, subtract two times the first row from the second one:

$
\begin{pmatrix}
1 & 4 \mid 0 & 1\\
0 & -5 \mid 1 & -2
\end{pmatrix}
$

Now divide the second row by $-5$:

$
\begin{pmatrix}
1 & 4 \mid 0 & 1\\
0 & 1 \mid -\frac{1}{5} & \frac{2}{5}
\end{pmatrix}
$

Finally, subtract $4$ times the second row from the first one:

$
\begin{pmatrix}
1 & 0 \mid \frac{4}{5} & -\frac{3}{5}\\
0 & 1 \mid -\frac{1}{5} & \frac{2}{5}
\end{pmatrix}
$

You've just transformed $A$ into $I$. So, the desired inverse is:

$
A^{-1}=
\frac{1}{5}
\begin{pmatrix}
4 & -3\\
-1 & 2
\end{pmatrix}
$

### An invertible matrix

Suppose you want to find the inverse of the following matrix:

$
A=
\begin{pmatrix}
2 & 3\\
1 & 4
\end{pmatrix}
$

The first step is to define the bigger matrix:

$
[A\mid I]
=
\begin{pmatrix}
2 & 3 \mid 1 & 0\\
1 & 4 \mid 0 & 1
\end{pmatrix}
$

Now compute the reduced row echelon form of $A$. You can start by interchanging the two rows:

$
\begin{pmatrix}
1 & 4 \mid 0 & 1\\
2 & 3 \mid 1 & 0
\end{pmatrix}
$

Now, subtract two times the first row from the second one:

$
\begin{pmatrix}
1 & 4 \mid 0 & 1\\
0 & -5 \mid 1 & -2
\end{pmatrix}
$

Now divide the second row by $-5$:

$
\begin{pmatrix}
1 & 4 \mid 0 & 1\\
0 & 1 \mid -\frac{1}{5} & \frac{2}{5}
\end{pmatrix}
$

Finally, subtract $4$ times the second row from the first one:

$
\begin{pmatrix}
1 & 0 \mid \frac{4}{5} & -\frac{3}{5}\\
0 & 1 \mid -\frac{1}{5} & \frac{2}{5}
\end{pmatrix}
$

You've just transformed $A$ into $I$. So, the desired inverse is:

$
A^{-1}=
\frac{1}{5}
\begin{pmatrix}
4 & -3\\
-1 & 2
\end{pmatrix}
$

## The inverse in terms of determinants

The determinant is a really versatile tool, and you can even use it to build the inverse of a matrix $A$. The first step is to build a new matrix with all the possible cofactors of $A$:

$$
\operatorname{cof}(A)=
\begin{pmatrix}
c_{11} & c_{12} & \cdots & c_{1n}\\
c_{21} & c_{22} & \cdots & c_{2n}\\
\vdots & \vdots & \ddots & \vdots\\
c_{n1} & c_{n2} & \cdots & c_{nn}
\end{pmatrix}
\in \mathbb{R}^{n\times n}.
$$

Then you only have to transpose it and divide every entry by the determinant of the original matrix. The whole process is the following:

If $A$ is invertible, then

$$
A^{-1}=\frac{1}{\det(A)}\,\operatorname{cof}(A)^{T}.
$$

## Proof

As always, proofs are optional. In this case, you should be familiarized with the properties of the determinant.

First, let us denote $X=A^{-1}$. The strategy is to get every entry of $X$. You know that $AX=I$. Let $j\in\{1,\ldots,n\}$. On one hand, by the matrix product properties, the $j$-th column of $AX$ is the product between $A$ and the $j$-th column of $X$, that is $AX_j$. On the other hand, as $AX=I$, the $j$-th column of $AX$ is just the $j$-th column of $I$, which is simply $e_j$. Thus:

$$
AX_j=e_j.
$$

Nice! This is a simple system of linear equations with unknown vector $X_j$, so you can apply Cramer's Rule to get every entry of $X_j$. Then, for every $i\in\{1,\ldots,n\}$:

$$
x_{ij}=\frac{\det(A^{(i)})}{\det(A)}
=\frac{\det\big([A_1\mid\cdots\mid A_{i-1}\mid e_j\mid A_{i+1}\mid\cdots\mid A_n]\big)}{\det(A)}.
$$

Here comes the important part. Look at the numerator. It is the determinant of a matrix whose $i$-th column is the vector $e_j$. That is a column with a bunch of zeros: all its entries are $0$, except the $j$-th one which is $1$. Therefore, you can expand the determinant along this column, which means that only one term will survive:

$$
\det\big([A_1\mid\cdots\mid A_{i-1}\mid e_j\mid A_{i+1}\mid\cdots\mid A_n]\big)
=c_{ji}=(-1)^{i+j}m_{ji}.
$$

But the minor $m_{ji}$ of the matrix
$[A_1\mid\cdots\mid A_{i-1}\mid e_j\mid A_{i+1}\mid\cdots\mid A_n]$
is exactly the minor $m_{ji}$ of the matrix $A$, because in both cases, after removing the $j$-th row and the $i$-th column, the result is the same (after all, both matrices are the same except for the $i$-th column, which is exactly the one you removed!). Thus

$$
x_{ij}
=\frac{\det\big([A_1\mid\cdots\mid A_{i-1}\mid e_j\mid A_{i+1}\mid\cdots\mid A_n]\big)}{\det(A)}
=\frac{(-1)^{i+j}m_{ji}}{\det(A)}.
$$

Finally, by the definition of transpose, $m_{ji}$ is the $(i,j)$ entry of $\operatorname{cof}(A)^T$.

$\blacksquare$

As a simple but curious example, you can use this formula to find the inverse of any invertible matrix of size $2\times 2$.
After easily computing all the possible cofactors of any matrix
$$
A=
\begin{pmatrix}
a & b\\
c & d
\end{pmatrix},
$$
you get that
$$
\operatorname{cof}(A)=
\begin{pmatrix}
d & -c\\
-b & a
\end{pmatrix}.
$$
Thus,
$$
A^{-1}
=\frac{1}{\det(A)}\,\operatorname{cof}(A)^T
=\frac{1}{ad-bc}
\begin{pmatrix}
d & -b\\
-c & a
\end{pmatrix}.
$$

Let us apply this to the matrix whose inverse you have already calculated,
$$
A=
\begin{pmatrix}
2 & 3\\
1 & 4
\end{pmatrix}.
$$
Then
$$
\det(A)=2\cdot4-3\cdot1=8-3=5.
$$
Hence,
$$
A^{-1}
=\frac{1}{5}
\begin{pmatrix}
4 & -3\\
-1 & 2
\end{pmatrix}.
$$

This is exactly the same result as before! You did not need to apply row operations or other cumbersome computations to obtain it.

## Conclusion

- Every **elementary matrix** is invertible, and its inverse is also an elementary matrix.

- The product of two invertible matrices $A$ and $B$ is invertible, and its inverse is given by
  $$
  (AB)^{-1} = B^{-1}A^{-1}.
  $$

- A matrix $A$ is invertible **if and only if** its **reduced row echelon form** is the identity matrix $I$.
  In this case, the inverse of $A$ is obtained by applying to $I$ **exactly the same elementary row operations** used to transform $A$ into $I$.

- If $A$ is invertible, then its inverse can also be expressed in terms of determinants as
  $$
  A^{-1} = \frac{1}{\det(A)} \operatorname{cof}(A)^T,
  $$
  where $\operatorname{cof}(A)$ denotes the **cofactor matrix** of $A$.

- For any invertible $2 \times 2$ matrix
  $$
  A =
  \begin{pmatrix}
  a & b \\
  c & d
  \end{pmatrix},
  $$
  its inverse is obtained by swapping the diagonal entries, changing the signs of the off-diagonal entries, and dividing by the determinant:
  $$
  A^{-1}
  =
  \frac{1}{ad - bc}
  \begin{pmatrix}
  d & -b \\
  -c & a
  \end{pmatrix}.
  $$
