# Linear Algebra

In the final part of this lesson, we are going to focus on a branch of mathematics called linear algebra. Many concepts in linear algebra are central to image processing and statistical analysis, as we will see as the course progresses. However, we recognise that this prospect may be filling some of you with anxiety and dread, especially if you are one of those people who says "I can't do maths" or "I just don't get maths". The key thing to stress is that we will be emphasising conceptual knowledge over working knowledge. In other words, we will not be asking you to solve equations or write proofs, but we do need you to know what a matrix is, what a vector is, what a matrix inverse is used for and so on. The aim is that after reading this section you will be able to see an equation such as

$$
\mathbf{A} = \mathbf{B}^{\prime}\mathbf{C}^{-1}\mathbf{D} + \mathbf{E}
$$

and have a degree of confidence in understanding what is going on. You will not have the skills to manipulate the equation, or even understand where the equation comes from, but you will have some understanding of the different elements of the equation and what they mean. In addition, we will not be expecting you to do any arithmetic. Instead, we encourage exploration of linear algebra through playing around inside MATLAB to solidify your understanding, without having to go through the pain of calculating things manually. So hopefully, this will be relatively painless, even if you have concerns about your mathematical ability.

## What is linear algebra?

We will start with the big picture view of what linear algebra is. Linear algebra is all about linear equations and their representation using mathematical objects called matrices. A linear equation is simply an equation where some numbers are multiplied by other numbers and then added together to form a weighted-sum. For example, the equation below is a linear equation

$$
y = a_{1}x_{1} + a_{2}x_{2} + b
$$

where we have two *variables* ($x_{1}$ and $x_{2}$) and three *coefficients* ($a_{1}$, $a_{2}$ and $b$). Each variable is multiplied by its respective coefficient and the products are added together to form $y$. The important point here is that the variables can take on any value (hence they can vary), whereas the coefficients are usually fixed. This is easiest to see when we have an equation with a single variable such as

$$
y = 0.5x + 1
$$

where $a_{1} = 0.5$ and $b = 1$. By using different values for $x$, we can calculate all the points on the straight line shown in {numref}`linear-eq-graph-fig`. Here we can see that the coefficient $a_{1}$ is the *slope* of the line and the coefficient $b$ represents the *intercept* of the line (the point where the line intersects the *y*-axis). If there was no $b$ then the line would be forced to go through 0.

```{figure} images/linear-eq-graph.png
---
width: 450px
name: linear-eq-graph-fig
---
Example of the straight line described by the linear equation $y = 0.5x + 1$.
```

So, in brief, a linear equation is a weighted sum that can be used to define points along a straight lines. But what does that have to do with linear algebra? Within many branches of science, linear equations are used a lot to solve real-world problems. However, in real-world applications, there will often be more than one equation needed. These multiple equations are known as a system of linear equations. These equations share the same coefficients, but have different values for the variables. What matrices allow us to do is represent and solve a system of linear equations. In a sense, they provide a means of organising information to make large systems of linear equations easier to deal with. So although there are deeper mathematical interpretations of matrices, for our purposes you can just think of them of a way of organising many numbers in one place.

## Matrices and Vectors

In terms of a formal definition, a matrix is an array of numbers that has a set number of rows and a set number of columns. For example, the following shows that the matrix A contains 2 rows and 3 columns.

$$
\mathbf{A} = 
\begin{bmatrix}
1 & 5 & 6 \\
4 & 8 & 2
\end{bmatrix}
$$

Matrices are written in shorthand by using **bold** uppercase letters, to distinguish them from regular variables. We can refer to an individual element of a matrix using the lowercase version of the letter, along with indices to show which row and column we are referring to. For instance $a_{12}$ refers to the element at row 1 column 2 of matrix $\mathbf{A}$. For instance

$$
\begin{align}
a_{11} &= 1, a_{12} = 5, a_{13} = 6 \\
a_{21} &= 4, a_{22} = 7, a_{23} = 2
\end{align}
$$

In comparison, a vector is either a single row or a single column from a matrix. As such, we can have row-vectors or column-vectors, depending on their orientation. For example, we can see below a row-vector $\mathbf{B}$ and a column vector $\mathbf{C}$. Sometimes vectors are written using bold lowercase letters to distinguish them from matrices, but this is not universal.

$$
\begin{align}
\mathbf{B} &= 
\begin{bmatrix}
1 & 5 & 6
\end{bmatrix} \\
\mathbf{C} &=
\begin{bmatrix}
5 \\
8
\end{bmatrix}
\end{align}
$$

We can refer to individual elements in the same way as a matrix, but we only need a single index. For instance

$$
b_{1} = 1, b_{2} = 5, b_{3} = 6
$$

## Matrix Addition and Subtraction

Just like regular numbers, the operations of addition and subtraction also apply to matrices. So we can add matrix A to matrix B to get a new matrix C

$$
\mathbf{C} = \mathbf{A} + \mathbf{B}
$$

However, unlike regular numbers, there are rules for matrix addition. For instance, you can only add two matrices or two vectors with identical numbers of rows and columns. This is because when you add matrices together you add the individual elements of one matrix to the other. This only works if there are the same number of elements in each matrix. We can see this in the MATLAB code below.

In [1]:
A = [1 2 3; 4  5  6]
B = [7 8 9; 10 11 12]
C = A + B

Formally, we can write that

$$
\mathbf{C} = \mathbf{A} + \mathbf{B} =
\begin{bmatrix}
a_{11} + b_{11} & a_{12} + b_{12} & \dots  & a_{1k} + b_{1k} \\
a_{21} + b_{21} & a_{22} + b_{22} & \dots  & a_{2k} + b_{2k} \\
\vdots          & \vdots          & \ddots & \vdots          \\
a_{j1} + b_{j1} & a_{j2} + b_{j2} & \dots  & a_{jk} + b_{jk} \\
\end{bmatrix}
$$

where the ellipses just mean "continue the pattern along this direction". Matrix subtraction works in the same way, so we have

$$
\mathbf{C} = \mathbf{A} - \mathbf{B}
$$

which, again, is only defined when $\mathbf{A}$ and $\mathbf{B}$ have the same dimensions. For instance

In [5]:
C = A - B

## Matrix Multiplication