# DSCI 6001 - 1.2: Matrices as Systems of Equations

## By the End of this Lecture You Will Be Able To:

1. Describe what a linear system is and write a linear system in matrix form
2. Describe the determination of a system: determined, underdetermined, overdetermined
3. Describe the difference between a consistent and inconsistent system
4. Describe Gauss elimination and use it to solve a linear system


## Part 1: Linear Systems

Any algebraic equation that consists of terms that are either a constant or a constant multiple of a single variable, with power $= 1$, is termed a __linear equation__. For example, $y=5x+3$, $ax+by+cz+d=0$ and $2x_1+3x_2-x_3=5$ are all linear equations (assuming $a,b,c,d$ are all constants). A __system of linear equations__, or __linear system__) is a collection of linear equations involving the same set of variables. A simple example of a linear system in 2 variables, $x$ and $y$, would be:

$$
2x + 3y = 6 \\
4x + 5y = 15
$$

The general form of a linear system of $m$ in $n$ variables is:

$$\begin{alignat}{1}
a_{11} x_1 &&\; + \;&& a_{12} x_2   &&\; + \cdots + \;&& a_{1n} x_n &&\; = \;&&& b_1 \\
a_{21} x_1 &&\; + \;&& a_{22} x_2   &&\; + \cdots + \;&& a_{2n} x_n &&\; = \;&&& b_2 \\
\vdots\;\;\; &&     && \vdots\;\;\; &&                && \vdots\;\;\; &&     &&& \;\vdots \\
a_{m1} x_1 &&\; + \;&& a_{m2} x_2   &&\; + \cdots + \;&& a_{mn} x_n &&\; = \;&&& b_m \\
\end{alignat}$$

These are called 'linear' systems because the power of the variables $x_i$ is always 1 (i.e. a linear relationship $y = x_i$ is implied).


### Linear Systems: Nonhomogeneity, Homogeneity, Solutions

The relationship of the **dependent variables** $b_i$ to the independent variables $x_i$ defines the dominant property of the system. 

**If the system is:**

*Homogeneous:* The $b_i$ are **all zero:** $\{b_0 \ldots b_m\}=0$

*Nonhomogenous:* At least one of the $b_i$ are **not zero:** $\{b_0 \ldots b_m\} \neq 0$

**A solution to the system is a set of numbers $\{x_1 \ldots x_n\}$ that satisfies all the equations of the system simultaneously.**

A vector whose components are a solution to the system is called a solution vector ${\bf{x}} = \begin{bmatrix}
    x_1 \\
    \vdots \\
    x_n \\
\end{bmatrix}$



### Behavior of a linear system

The behavior of a linear system can be determined based on the number of equations and number of unknowns: 

- __Underdetermined__ 
    - If number of equations is _less than_ the number of variables, i.e. $m < n$, a linear system is said to be __underdetermined__.
    
    - An underdetermined system will have **infinitely many solutions** (a _solution set_) or no solution at all. 
    
- __Overdetermined__ 
    - A linear system is said to be __overdetermined__ if number of eqations is _greater than_ the number of variables, i.e. $m > n$.
    - These systems usually have no solution but an approximate solution can be found.
            
- __Uniquely Determined__
    - A linear system is said to be __uniquely determined__ if the number of equations _equals_ the number of variables, i.e. $m = n$
    - Such systems usually have a unique solution.

A linear system that has no solution is said to be __inconsistent__. Otherwise it is said to be __consistent__.

### QUIZ:

How determined is the system $3x + 2y = 1, -6x + 10y = 5$?

## Linear Systems as Matrices

The general form of of the linear system presented above can be compactly represented in matrix form as:

$$ \textbf{Ax}=\textbf{b} $$

where $\bf A$ is an $m\times{n}$ matrix, $\textbf{x}$ is a column vector with $n$ entries, and $\textbf{b}$ is a column vector with $m$ entries.


$$
{\bf A}=
\begin{bmatrix}
a_{11} & a_{12} & \cdots & a_{1n} \\
a_{21} & a_{22} & \cdots & a_{2n} \\
\vdots & \vdots & \ddots & \vdots \\
a_{m1} & a_{m2} & \cdots & a_{mn}
\end{bmatrix},\quad
\textbf{x}=
\begin{bmatrix}
x_1 \\
x_2 \\
\vdots \\
x_n
\end{bmatrix},\quad
\textbf{b}=
\begin{bmatrix}
b_1 \\
b_2 \\
\vdots \\
b_m
\end{bmatrix}
$$

Finding the solution to a system of linear equations involves assigning values to the elements of $\textbf{x}$, $x_1,x_2,...,x_n$ such that the equations are satisfied (or the matrix relation holds).

Frequently we write the *augmented form* of the system, that has the $b_i$ written as part of the matrix. This makes it more fun to solve. The system

$$
L_1:\hspace{10pt} 2x + 3y = 6 \\
L_2:\hspace{10pt} 4x + 5y = 15
$$

can be expressed in augmented matrix notation as:

$$ \left[ \begin{array}{cc|c}
2 & 3 & 6 \\
4 & 5 & 15 
\end{array} \right]$$

Similarly, we can express the more complex system:

$$
L_1:\hspace{10pt} 4x - 2y + 5z + t = 6 \\
L_2:\hspace{10pt} -\frac{9}{2}x + 5y -31z = 15 \\
L_3:\hspace{10pt} 22x - 50z - 17t = 34 \\
L_4:\hspace{10pt} 20y -11z = 78 
$$

as:

$$ \left[ \begin{array}{cccc|c}
4 & -2 & 5 & 1 & 6 \\
-\frac{9}{2} & 5& -31 & 0 & 15 \\
22 & 0 & -50 & -17 & 34 \\
0 & 20 & -11 & 0& 78 \\
\end{array} \right]$$

From here on out we will talk in terms of the rows and columns of the augmented matrix when we talk about linear systems. 

### QUIZ: 
Write the system $3x + 2y = 1, -6x + 10y = 5$ as a matrix. 

## Part 2: Solving Linear Systems

Whether or not a linear system possesses unique solutions is an underlying principle in linear algebra. The presence of a solution and the nature of that solution provides key information about the system under study. 

There are two primary pathways of getting a solution to a linear system:

- Direct methods 
    - Direct methods: Use a fixed number of computations leading to an exact arithemetic solution.
    - E.g.: Gauss Elimination, Cramer's rule 
    
- Iterative methods 
    - Iterative methods: Computationally generate a sequence of approximations to the solution. 
    - E.g.: Gauss-Siedal method, conjugate gradients
    
Much of what you're learning in this coursework pertains to these two items. 


### Gauss Elimination:

Today we will be addressing direct methods of solving linear systems. There is one main way of doing so involving two parts: 

- __Backsubstitution__:
     - We can obtain partial solutions to the system by solving completely for one of the independent variables in terms of the dependent variables. This allows us to substitute the solved independent variable into unsolved rows of the linear system.
     
     
- __Gauss Elimination__:
    - Using a system of simple operations, the rows of the matrix can be manipulated to simplify the rows by eliminating as many independent variables as possible in each row. This is done by adding scalar multiples of rows to each other, attempting to reduce the coeefficients of as many variables as possible to zero in each row.
    
We can break the above methods down into what is known as elementary row operations on the augmented matrix of the system:

1. Swapping two rows
2. Scalar (non-zero) multiplication of a single row
3. Adding a scalar multiple of one row to another row

These operations are only for **rows** NOT for **columns**. Swapping columns is the same thing as changing the coefficients of the independent variables!


**Example 1:** Consider the following system of linear equations in 2 variables.

$$2 x_1 + 5 x_2 = 2$$
$$-4 x_1 + 3 x_2 = -30$$

$$ \begin{array}{c} L_1 \\ L_2 \end{array}\hspace{20pt} \left[\begin{array}{cc|c}
2 & 5 & 2\\
-4 & 3 & -30\\
\end{array} \right]$$

$$ \begin{array}{c} L_1 \\ 2 L_1 + L_2 \end{array}\hspace{20pt} \left[\begin{array}{cc|c}
2 & 5 & 2\\
0 & 13 & -26\\
\end{array} \right]$$

$$ \begin{array}{c} L_1 \\ \frac{1}{13}(2 L_1 + L_2) \end{array}\hspace{20pt} \left[\begin{array}{cc|c}
2 & 5 & 2\\
0 & 1 & -2\\
\end{array} \right]$$

$$ \begin{array}{c} L_1 - \frac{5}{13}(2 L_1 + L_2) \\ \frac{1}{13}(2 L_1 + L_2) \end{array}\hspace{20pt} \left[\begin{array}{cc|c}
2 & 0 & 12\\
0 & 1 & -2\\
\end{array} \right]$$

$$ \begin{array}{c} \frac{1}{2}(L_1 - \frac{5}{13}(2 L_1 + L_2)) \\ \frac{1}{13}(2 L_1 + L_2) \end{array}\hspace{20pt} \left[\begin{array}{cc|c}
1 & 0 & 6\\
0 & 1 & -2\\
\end{array} \right]$$

Thus this is a nonhomogeneous system, with a single unique solution of

$$x_1 = 6$$
$$x_2 = -2$$

**Example 2 (advanced):** Consider the following system of linear equations in 4 variables.

$$x_1 + x_2 - 2 x_3 + 4 x_4 = 5$$
$$2 x_1 + 2 x_2 - 3 x_3 + x_4 = 3$$
$$3 x_1 + 3 x_2 - 4 x_3 - 2 x_4 = 1$$

The augmented matrix for this system is

$$\left[ \begin{array}{cccc|c} 1 & 1 & -2 & 4 & 5 \\ 2 & 2 & -3 & 1 & 3 \\ 3 & 3 & -4 & -2 & 1 \end{array}\right]$$

$L_3 - 3 L_1$ and $L_2 - 2 L_1$ gives

$$\left[ \begin{array}{cccc|c} 1 & 1 & -2 & 4 & 5 \\ 0 & 0 & 1 & -7 & -7 \\ 0 & 0 & 2 & -14 & -14 \end{array}\right]$$


$L_1 + L_3$ and $L_3 - 2 L_2$

$$\left[ \begin{array}{cccc|c} 1 & 1 & 0 & -10 & -9 \\ 0 & 0 & 1 & -7 & -7 \\ 0 & 0 & 0 & 0 & 0 \end{array}\right]$$

Rewrite in terms of system of linear equations:

$$x_1 + x_2 -10 x_4 = -9$$
$$x_3 -7 x_4 = -7$$

(The zero row is omitted in the solution). Or

$$x_1 = -9 -x_2 + 10 x_4$$
$$x_3 = -7 + 7 x_4$$

$x_2$ and $x_4$ are _free_ variables. This means that we can _choose_ any values for $x_3$ and $x_4$, and we'll have solution for the given equations. In this sense, there are infinite solutions for the above system of equations.

### QUIZ: 
Solve the system $3x + 2y = 1, -6x + 10y = 5$, if you can.

## Matrix-Form notation:

We have spent our time discussing the matrix forms of systems of equations. There is an important cognitive tool that we can unleash (and will soon) on these types of problems by converting our expressions to ''Matrix-form notation'' (I often call it 'closed-form', although that is most properly used for functional notation) Recalling the last lecture, remember that we can multiply a matrix by a vector and get a matrix back. Let us describe the following system:

$$2 x_1 + 5 x_2 = 2$$
$$-4 x_1 + 3 x_2 = -30$$

In terms of this formality. Thus we can write a Matrix A:

$$ \left[ \begin{array}{cc|c}
2 & 5\\
-4 & 3 
\end{array} \right]$$

and multiply it by a vector $x = \begin{bmatrix}
x_1 \\
x_2 \\
\end{bmatrix}$ to get our linear system back. (remember the matrix multiplication rule: row by column.

$$ \left[ \begin{array}{cc|c}
2 & 5\\
-4 & 3 
\end{array} \right]\begin{bmatrix}
x_1 \\
x_2 \\
\end{bmatrix} = \begin{bmatrix}
2 x_1 + 5 x_2 \\
-4 x_1 + 3 x_2 \\
\end{bmatrix}$$

Now we can set the $b = \begin{bmatrix}
2 \\
-30 \\
\end{bmatrix}$ equal to the system:

$$ \left[ \begin{array}{cc|c}
2 & 5\\
-4 & 3 
\end{array} \right]\begin{bmatrix}
x_1 \\
x_2 \\
\end{bmatrix} = \begin{bmatrix}
2 \\
-30 \\
\end{bmatrix}$$

Therefore, it should be natural to you to understand that all linear systems can be expressed this way:

$$A\bf{x} = b$$

This is known as the **linear equation.**

### ASSIGNED PROBLEMS:
7.3: 1, 3, 5, 7, 11, 13