## Gaussian elimination using augmented matrices

Once you've gone through several iterations of Gaussian elimination, you may have realized that you primarily focused on the coefficients and treated the variables as mere placeholders. Taking this observation into account, we can introduce a concise way to describe linear systems.

### Matrix

In mathematics, a matrix is a rectangular array of numbers, symbols, or expressions arranged in rows and columns. It is a fundamental concept in linear algebra.

A matrix is typically denoted by a capital letter, such as $A$, and its individual elements are represented with subscripts. For example, the element in the i-th row and j-th column of matrix $A$ is denoted as $A_{ij}$.

Here's an example of a $2 \times 3$ matrix:

$$ A = 
\begin{bmatrix}
1 & 2 & 3 \\
4 & 5 & 6 \\
\end{bmatrix}
$$

In this example, the matrix A has two rows and three columns. The elements of the matrix are filled in row-wise. So, $A_{11}$ is $1$, $A_{12}$ is $2$, $A_{13}$ is $3$, $A_{21}$ is $4$, $A_{22}$ is $5$, and $A_{23}$ is $6$.

Matrices can have different sizes and dimensions. The size of a matrix refers to the number of rows and columns it has. In the example above, the size of matrix $A$ is $2 \times 3$, meaning it has $2$ rows and $3$ columns.

### Augmented matrices

When we write a linear system, it is important to maintain a consistent order of variables in each equation. To create an augmented matrix, we disregard the variables and instead focus on recording the numerical data in a rectangular matrix. For example, the given system of equations can be represented by the following **augmented matrix**:

$$
\begin{align}
-x - 2y + 2z &= -1 \\
2x + 4y - z &= 5  \\
x + 2y &= 3 \\
\end{align}
$$

Initially, we assign the order of variables as follows: $x$ is the first variable, $y$ is the second variable, and $z$ is the third variable in our system, then we will construct the following augmented matrix from our linear system:

$$
\begin{bmatrix}
-1 & -2 & 2 & \big| & -1\\
2 & 4 & -1 & \big| & 5\\
1 & 2 & 0 & \big| & 3
\end{bmatrix}
$$

The vertical line serves as a visual reminder of the positions where the equals signs appear in the equations. Entries in the matrix to the left of the vertical line represent the coefficients of the equations. In certain cases, we may choose to emphasize only the coefficients of the system, and in such instances, we represent the coefficient matrix as follows:

$$
\begin{bmatrix}
-1 & -2 & 2 \\
2 & 4 & -1 \\
1 & 2 & 0 
\end{bmatrix}
$$

#### Operations

The three operations we apply to systems of equations can be directly translated into operations on matrices. For example, the replacement operation that involves multiplying the first equation by $2$ and adding it to the second equation can be performed by multiplying the first row of the augmented matrix by $2$ and then adding it to the second row:

$$
\begin{bmatrix}
-1 & -2 & 2 & \big| & -1\\
2 & 4 & -1 & \big| & 5\\
1 & 2 & 0 & \big| & 3
\end{bmatrix}
\sim
\begin{bmatrix}
-1 & -2 & 2 & \big| & -1\\
0 & 0 & 3 & \big| & 3\\
1 & 2 & 0 & \big| & 3
\end{bmatrix}
$$

The presence of the $\sim$ symbol between the matrices signifies that the two matrices are connected through a series of scaling, interchange, and replacement operations. Since these operations are performed on the rows of the matrices, we say that the matrices are **row equivalent**. It is significant to note that linear systems associated with two row equivalent augmented matrices share the same solution space.

#### Reduced row echelon form

A reduced row echelon form (RREF) matrix is a particular form of a matrix that has undergone a series of row operations to simplify its structure and reveal useful information about the associated system of linear equations. In RREF, the matrix satisfies several properties:

1. Leading Entry: In each row, the leftmost nonzero entry is called the leading entry. It is always equal to 1 (known as a leading 1).

2. Zero Rows: Any row containing only zeros is placed at the bottom of the matrix.

3. Leading 1's: Each leading 1 is the only nonzero entry in its column.

4. Leading 1 Position: The leading 1 in the second row (if present) is to the right of the leading 1 in the first row. The leading 1 in the third row (if present) is to the right of the leading 1 in the second row, and so on.

5. Column Operations: Each column that contains a leading 1 has all other entries in that column equal to zero.

6. Pivot Columns: Columns containing leading 1's are called pivot columns. Each pivot column has zeros below and above the leading 1.

The process of transforming a matrix into reduced row echelon form involves applying elementary row operations, which include:

1. Swapping two rows.
2. Multiplying a row by a nonzero scalar.
3. Adding a multiple of one row to another row.

By performing these operations systematically, the matrix can be transformed into RREF. The reduced row echelon form **is unique for a given matrix**, and it provides a concise representation of the system of linear equations associated with the matrix.

Here's an example of a matrix in reduced row echelon form:

$$
\begin{bmatrix}
1 & 0 & 2 & 0 \\
0 & 1 & -3 & 0 \\
0 & 0 & 0 & 1 \\
0 & 0 & 0 & 0
\end{bmatrix}
$$

In this matrix, the leading 1's are positioned in the first and second columns of the first and second rows, respectively. The leading 1 in the third row is in the fourth column. All other entries in the pivot columns are zero. The fourth row consists entirely of zeros and is placed at the bottom of the matrix. This matrix satisfies all the properties of reduced row echelon form that I mentioned earlier.

#### Explaining the solution space based on a matrix in reduced row echelon form.

To obtain the solution space of a linear system using the reduced row echelon form (RREF) matrix, you can follow these steps:

1. Convert the augmented matrix of the linear system into its reduced row echelon form using row operations such as interchange, scaling, and replacement.

2. Identify the pivot columns in the RREF matrix. These are the columns that contain leading 1's (the leftmost nonzero entry) in their respective rows.

3. Express the variables corresponding to the pivot columns in terms of the remaining variables. This can be done by setting each pivot variable equal to the corresponding column's constant term minus the sum of the products of the non-pivot variables and their respective coefficients in the same row.

4. Assign arbitrary values (parameters) to the remaining variables, which correspond to the non-pivot columns. Each non-pivot variable can be considered as a free variable.

5. Write the solution space using parameterized expressions for the variables. Combine the pivot variable expressions from step 3 with the free variable expressions from step 4 to represent all possible solutions of the linear system.

By following these steps, the solution space of the linear system can be determined using the reduced row echelon matrix.

Let's explore some examples, consider the following matrix in RREF:

##### Example 1:

$$
\begin{bmatrix}
    1 & 0 & 2 & \big| & -1\\
    0 & 1 & 1 & \big| & 2\\
\end{bmatrix}
$$

This matrix corresponds to the following system of equations:

$$
\begin{align}
x - 0y + 2z &= -1 \\
0x + y + z &= 2
\end{align}
$$

Initially, it is evident that the columns associated with variables $x$ and $y$ serve as pivot columns. Consequently, we express the remaining variables in relation to these pivot columns.

$$
\begin{align}
x &= -1 - 2z \\
y &= 2 - z
\end{align}
$$

The variable $z$ is not subject to any limitations, allowing it to take on any value. In a more precise formulation, we can express that $z$ belongs to the set of real numbers, denoted as $z \in \mathbb{R}$. So the solution space to the above system of equations is determined by this system:

$$
\begin{align}
x &= -1 - 2z \\
y &= 2 - z \\ 
z &\in \mathbb{R}
\end{align}
$$

Since there are no constraints placed on the value of $z$, we call it a **free variable**. It is important to observe that the linear system possesses an infinite number of solutions due to this characteristic.

We will refer to this representation of the solution space, where the pivot variables are expressed in relation to the independent (free) variables, as a **parametric description** of the solution space.

##### Example 2:

Consider the following matrix in RREF:

$$
\begin{bmatrix}
    1 & 0 & 2 & \big| & 0\\
    0 & 1 & -1 & \big| & 0\\
    0 & 0 & 0 & \big| & 1
\end{bmatrix}
$$

The lat equation says:

$$0x + 0y + 0z = 1$$

Which is always false, so This specific equation does not have a solution, and as a result, the entire system of equations does not have a solution.