## Elementary matrix operations and elementary matrices.

Solving a system of linear algebraic equations $Ax=b$ is the most important aspect of linear algebra. From high-school and university, we are familiar with performing elementary row-operations on a matrix $A$, resulting in a simplified system of equations easier to solve. 

**Definition** (*Elementary matrix*). Any matrix $E$ obtained by performing a single elementary row(column) operation on $I_n$ is called an elementary matrix.

**Theorem.** Given any elementary matrix $E$, there exists a matrix $D$, such that $DE = I = ED$. Every elementary matrix $E$ is invertible.

**Story Proof.** 

Each of the elementary row(column) operations on a matrix $A$ is like pre(post)-multiplying the matrix $A$, by an elementary matrix $E$. For example, if

$
A = 
\begin{bmatrix}
0 & 1 & 2\\
3 & 0 & 0\\
2 & 1 & 0
\end{bmatrix}
$

Remember, that $i$th row of the product of two matrices is given by,

$
\begin{align*}
C_i = \begin{bmatrix}a_{i1} & a_{i2} & \ldots & a_{in}\end{bmatrix} \begin{bmatrix}B_1 \\ B_2 \\ \vdots \\ B_n\end{bmatrix} = a_{i1}\cdot B_1 + a_{i2}\cdot B_2 + \ldots + a_{in}\cdot B_n
\end{align*}
$

Left-multiplying this matrix by $E_{12}$ is as good as interchanging the first and second rows of $A$. That is,

$
\begin{align*}
E_{12}A = \begin{bmatrix} 0 & 1 & 0 \\ 1 & 0 & 0 \\ 0 & 0 & 1 \end{bmatrix} \begin{bmatrix}0 & 1 & 2 \\ 3 & 0 & 0 \\ 2 & 1 & 0\end{bmatrix} = 
\begin{bmatrix}
3 & 0 & 0 \\ 
0 & 1 & 2 \\ 
2 & 1 & 0\end{bmatrix}
\end{align*}
$

Consider multiplying or dividing a row by a scalar. Multiplying a row by a scalar $k$ is akin to left-multiplication $E_i(k)A$. Multiplying a column by a scalar $k$ is similar to the right multiplication $E_i(k)$. As an illustration,

$
\begin{align*}
E_3(2)A = \begin{bmatrix}1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 2\end{bmatrix}\begin{bmatrix}0 & 1 & 2 \\ 3 & 0 & 0 \\ 2 & 1 & 0\end{bmatrix} = 
\begin{bmatrix}
0 & 1 & 2 \\ 
3 & 0 & 0 \\ 
4 & 2 & 0
\end{bmatrix}
\end{align*}
$

Lastly, if an equation(row) $e_i$ is replaced by the sum of $e_i + ke_j$, where $j \ne i$ and $k$ is any scalar, it is equivalent to multiplication by an elementary matrix $E_ij(k)$.

$
\begin{align*}
E_{13}(5)A = \begin{bmatrix}1 & 0 & 5 \\ 0 & 1 & 0 \\ 0 & 0 & 1\end{bmatrix}\begin{bmatrix}0 & 1 & 2 \\ 3 & 0 & 0 \\ 2 & 1 & 0\end{bmatrix} = 
\begin{bmatrix}
10 & 6 & 2 \\ 
3 & 0 & 0 \\ 
2 & 1 & 0
\end{bmatrix}
\end{align*}
$

Each column of $E$ is a unique linear combination of the columns of $I_n$. Thus, every column $E_j = \alpha_1 e_1 + \alpha_2 e_2 +\ldots + \alpha_n e_n$ has unique coordinates $\alpha_1,\alpha_2,\ldots,\alpha_n$. Therefore, each of the $E_j$ are linearly independent. Consequently, any elementary matrix $E$ is *invertible*.

## Equivalent systems of linear equations.

Any rectangular matrix in the *row-echelon* form has the following three defining properties.

(1) The first $r$ rows for some $r \ge 0$ are non-zero, and the remaining rows if any are zero.

(2) In the $i$th row $(i=1,2,3,\ldots,r)$, the first non-zero element is equal to unity, the column in which it occurs is $c_i$. 

(3) $c_1 < c_2 < c_3 < \ldots < c_n$

A matrix in row-echelon form has a stair-case pattern. For example,

$
\begin{align*}
U = \begin{bmatrix}
1 & 0 & 3 & 3\\
0 & 1 & 3 &4 \\
0 & 0 & 0 & 1
\end{bmatrix}
\end{align*}
$

is in the row-echelon form.

If a matrix in row-echelon form satisfies the following conditions, then it is said to be row-reduced echelon form(rref). 

(1) The matrix is in row-echelon form.

(2) Each leading $1$ is the only non-zero entry in its column.

The reduced-row echelon form of the matrix discussed in the previous section is

$
\begin{align*}
R = \begin{bmatrix}
1 & 0 & 3 & 0\\
0 & 1 & 3 & 0 \\
0 & 0 & 0 & 1
\end{bmatrix}
\end{align*}
$

The sole objective of linear algebra is to solve the system of equations:

$
\begin{align*}
A\mathbf{x}=\mathbf{b}
\end{align*}
$

where $A$ is a matrix of order $m \times n$ over the field of reals, $\mathbf{x} \in \mathbb{R}^n$, the right hand side vector $\mathbf{b} \in \mathbb{R}^m$. We are interested to find $\mathbf{x}=(x_1,x_2,x_3,\ldots,x_n)$ that satisfies the above system of equations.

A solution of a linear system is therefore an assignment of values to the variables $x_1,x_2,\ldots, x_n$ such that each of the equations is satisfied. The set of all possible solutions is called the *solution set*.

- A system of equaton may have no solution at all (for example parallel lines). 
- A system of equations may have unique solution (straight lines intersecting at a unique point)
- A system of equations may have an infinite number of solutions (coincident lines)

In general, a system of $m$ equations in $n$ unknowns is said to be *under-determined*, if the number of equations are smaller than the number of unknowns $m < n$. If $m < n$, the system generally has no solution or an infinite number of solutions. If $m=n$, the system has a unique solution. If the number of equations exceeds the number of unknowns, $m > n$, it is over-determined, and generally has no solution.

This is just to give a geometric viewpoint to the solutions of a system of linear equations.

Consider the system of equations