$$\newcommand{\F}{\mathbb{F}}
\newcommand{\R}{\mathbb{R}}
\newcommand{\v}{\mathbf{v}}
\newcommand{\a}{\mathbf{a}}
\newcommand{\b}{\mathbf{b}}
\newcommand{\c}{\mathbf{c}}
\newcommand{\d}{\mathbf{d}}
\newcommand{\p}{\mathbf{p}}
\newcommand{\w}{\mathbf{w}}
\newcommand{\u}{\mathbf{u}}
\newcommand{\x}{\mathbf{x}}
\newcommand{\y}{\mathbf{y}}
\newcommand{\z}{\mathbf{z}}
\newcommand{\0}{\mathbf{0}}
\newcommand{\1}{\mathbf{1}}
\newcommand{\A}{\mathbf{A}}
\newcommand{\B}{\mathbf{B}}
\newcommand{\C}{\mathbf{C}}
\newcommand{\E}{\mathbf{E}}
\newcommand{\P}{\mathbf{P}}$$

# https://en.wikipedia.org/wiki/System_of_linear_equations

# Chapter on the book and Macro Analyst

## General Form of Linear Equations

### Algebraic Form of Linear Equations

A general system of $m$ linear equations with $n$ unknowns can be written as:

$$
\begin{align}
a_{11} x_1 + a_{12} x_2  + \cdots + a_{1n} x_n  &= b_1 \\
a_{21} x_1 + a_{22} x_2  + \cdots + a_{2n} x_n  &= b_2 \\
& \ \ \vdots\\
a_{m1} x_1 + a_{m2} x_2  + \cdots + a_{mn} x_n  &= b_ m,
\end{align}
$$

where $x_1, x_2,\ldots,x_n$ are the unknowns, $a_{11},a_{12},\ldots,a_{mn}$ are the coefficients of the system, and $b_1,b_2,\ldots,b_m$ are the constant terms.

### Matrix Form of Linear Equations

The vector equation is equivalent to a matrix equation of the form $\A\x = \b$, where $\A \in \F^{m \times n}$, $\x$ a column vector in $\F^n$ and $\b$ a column vector in $\F^m$.

$$
\A =
\begin{bmatrix}
a_{11} & a_{12} & \cdots & a_{1n} \\
a_{21} & a_{22} & \cdots & a_{2n} \\
\vdots & \vdots & \ddots & \vdots \\
a_{m1} & a_{m2} & \cdots & a_{mn}
\end{bmatrix},\quad
\mathbf{x}=
\begin{bmatrix}
x_1 \\
x_2 \\
\vdots \\
x_n
\end{bmatrix},\quad
\mathbf{b}=
\begin{bmatrix}
b_1 \\
b_2 \\
\vdots \\
b_m
\end{bmatrix}
$$

### Vector Form of Linear Equations

Recall in the chapter on Matrix Multiplication, we note that $\A\x = \b$ is a right multiplication of a matrix $\A$ on the vector $\b$, and thus $\b$ can be represented as the **linear combination of columns of $\A$ with $x_i$ as coefficients**.

$$
\b = x_1 \a_1 + x_2 \a_2 + ... + x_n \a_n \implies 
\begin{bmatrix}
b_1 \\
b_2 \\
\vdots \\
b_m
\end{bmatrix} = x_1 \begin{bmatrix} a_{11} \\ a_{21} \\ \vdots \\ a_{m1} \end{bmatrix} + x_2 \begin{bmatrix} a_{12} \\ a_{22} \\ \vdots \\ a_{m2} \end{bmatrix} + ... + x_n \begin{bmatrix} a_{1n} \\ a_{2n} \\ \vdots \\ a_{mn} \end{bmatrix}
$$

## Elementary Row Operations

### The three types of Elementary Row Operations

In order to enable us to convert a system of linear equations to an **equivalent** system, we define the following **elementary row operations**:

- **Row Permutation:** Interchange any two rows of a matrix: $\r_i \iff \r_j$
- **Row Multiply:** Replace any row of a matrix with a non-zero scalar multiple of itself: $\r_i \to \lambda\r_i$
- **Row Addition:** Replace any row of a matrix with the sum of itself and a non-zero scalar multiple of any other row: $\r_i \to \r_i + \lambda \r_j$.

**$\r_i$ refers to row $i$ of the matrix.**

### The three types of Elementary Column Operations

By replacing the word *row* to *column*, we recover the definition of **elementary column operations**.

### Definition (Augmented Matrix)

We usually combine $\A\x = \b$ into one system (matrix) for ease of computing elementary row operations, after all, row operations are always applied to the **whole system**.

Given the general form of the linear equations, the **augmented matrix** of the system of equations is:

$$
[\A ~|~ \b] = \begin{bmatrix}
a_{11} & a_{12} & \cdots & a_{1n} & b_1 \\
a_{21} & a_{22} & \cdots & a_{2n} & b_2 \\
\vdots & \vdots & \ddots & \vdots \\
a_{m1} & a_{m2} & \cdots & a_{mn} & b_m
\end{bmatrix}
$$

### Definition (Row Echolon Form)

Given a matrix $\A \in \F^{m \times n}$, then we say $\A$ is in its **row echolon form** if:

- Any rows that are all **zeros** must be at the bottom of the matrix, that is to say, all **zero row vectors** are grouped at the bottom.
- The **leading coefficient (also called the pivot)** of a non-zero row is always strictly to the right of the leading coefficient of the row above it.
- All entries in a column below a pivot are zeros.

### Definition (Row Equivalence)

Two matrices $\A$ and $\B$ are **row equivalent** if one matrix can be obtained from the other matrix by a **finite sequence of elementary row operations**.

> Note that if $\A$ can be obtained by applying a sequence of elementary row operations on $\B$, then it follows that we just need to apply the sequence in reverse for $\B$ to get to $\A$.

### Theorem (Every matrix is row equivalent to a matrix in echolon form)

### Theorem (Row Equivalent Augmented Matrices have the same solution set)

Given $[\A ~|~ \b]$ and $[\C ~|~ \d]$, if both are **row equivalent** to each other, then the two linear systems have the same solution sets.

#### Left Multiplication (Linear Combination of Rows)

This part is also necessary to better understand matrix multiplication later.

##### Example (Left Multiplication)

We motivate this with an example.

Given a 3 by 3 matrix $A = \begin{bmatrix} x_1 & y_1 & z_1 \\ x_2 & y_2 & z_2 \\ x_3 & y_3 & z_3 \\ \end{bmatrix}$ and $\x = \begin{bmatrix} a & b & c \end{bmatrix}$ then 

$$\x\A = \begin{bmatrix} a & b & c \end{bmatrix} \begin{bmatrix} x_1 & y_1 & z_1 \\ x_2 & y_2 & z_2 \\ x_3 & y_3 & z_3 \\ \end{bmatrix} = \begin{bmatrix} ax_1+bx_2+cx_3 & ay_1+by_2+cy_3 & az_1+bz_2+cz_3 \end{bmatrix}$$

But notice that the above can also be written as:

$$\x\A = \begin{bmatrix} x_1 & y_1 & z_1 \\ x_2 & y_2 & z_2 \\ x_3 & y_3 & z_3 \\ \end{bmatrix} = \begin{bmatrix} ax_1+bx_2+cx_3 & ay_1+by_2+cy_3 & az_1+bz_2+cz_3 \end{bmatrix}
= \color{red}{a}\begin{bmatrix} x_1 & y_1 & z_1 \end{bmatrix} + \color{green}{b}\begin{bmatrix} x_2 & y_2 & z_2 \end{bmatrix} + \color{blue}{c}\begin{bmatrix} x_3 & y_3 & z_3 \end{bmatrix}$$

**Notice that now $\x\A$ is just a linear combination of the rows of $\A$.**

#### Matrix Multiplication using Left Multiplication (Rows)

Using back the notation in the **Element Wise Matrix Multiplication**, we can define 

$$\A\B = \begin{bmatrix}\a_1 \\ \a_2 \\  \vdots \\ \a_m \end{bmatrix}\B = \begin{bmatrix}\a_1\B \\ \a_2\B \\  \vdots \\ \a_m\B \end{bmatrix}$$

where $\a_i$ is the row $i$ of the matrix $\A$. This means that each row of the matrix $\C = \A\B$ is defined by $\a_i\B$, and recall in the section "Matrix-Vector Left Multiplication", $\a_i\B$ means a linear combination of the row of $\B$ with weight coefficients in $\a_i$. This becomes apparent when we come to the chapter of **Row-Echolon Form**. 

---

They say a picture is worth a thousand words. The below images are taken from Eli Bendersky's website [here](https://eli.thegreenplace.net/2015/visualizing-matrix-multiplication-as-a-linear-combination/).


<img src="https://storage.googleapis.com/reighns/reighns_ml_projects/docs/linear_algebra/visualizing-matrix-multiplication-as-a-linear-combination-row-perspective.PNG" style="margin-left:auto; margin-right:auto"/>
<p style="text-align: center">
    <b>Matrix Multiplication, Row Perspective; Courtesy of Eli Bendersky</b>
</p>

##### Significance

The row perspective is useful, for example in principal components analysis, where the rows of the right matrix contain data
(observations in rows and features in columns) and the rows of the
left matrix contain weights for combining the features. Then the
weighted sum of data creates the principal component scores.  - **Mike X Cohen: Linear Algebra: Theory, Intuition, Code, 2021. (pp. 147)**

Row Exchange:
$$
\begin{align}
x_1- 2x_2+x_3&=0\\
2x_2-8x_3&=8\\
-4x_1+5x_2+9x_3&=-9
\end{align}
$$

vs

$$
\begin{align}
2x_2-8x_3&=8\\
x_1- 2x_2+x_3&=0\\
-4x_1+5x_2+9x_3&=-9
\end{align}
$$

has no difference, we just swapped row 1 and 2. We can do the same in matrix for conveince.

Also, given

$$
\P = \begin{bmatrix} 0 & 1 & 0 \\ 1 & 0 & 0 \\ 0 & 0 & 1 \\ \end{bmatrix}
,\quad
\A = \begin{bmatrix} 1 & 2 & 3 \\ 4 & 5 & 6 \\ 7 & 8 & 9 \\ \end{bmatrix}
$$

then 

$$\P\A = \begin{bmatrix} 0 & 1 & 0 \\ 1 & 0 & 0 \\ 0 & 0 & 1 \\ \end{bmatrix} \begin{bmatrix} 1 & 2 & 3 \\ 4 & 5 & 6 \\ 7 & 8 & 9 \\ \end{bmatrix} = \begin{bmatrix} 4 & 5 & 6 \\ 1 & 2 & 3 \\ 7 & 8 & 9 \\ \end{bmatrix}$$

and notice that row 1 and 2 are swapped by the left multiplication of the permutation matrix $\P$. Why did it worked?

Recall now

$$\P\A = \begin{bmatrix}\ \p_1 \\ \p_2 \\  \p_3 \end{bmatrix}\A = \begin{bmatrix}\p_1\A \\ \p_2\A \\ \p_3\A \end{bmatrix}$$

We just look at the first row of $\P\A$ given by $\p_1\A$ which maps to the first row of $\P\A$.

$$\p_1\A = 0 \begin{bmatrix} 1 & 2 & 3 \end{bmatrix} + 1 \begin{bmatrix} 4 & 5 & 6 \end{bmatrix} + 0 \begin{bmatrix} 7 & 8 & 9 \end{bmatrix} = \begin{bmatrix} 4 & 5 & 6 \end{bmatrix}$$

Then the rest is the same logic:

$$\p_2\A = 1 \begin{bmatrix} 1 & 2 & 3 \end{bmatrix} + 0 \begin{bmatrix} 4 & 5 & 6  \end{bmatrix} + 0 \begin{bmatrix} 7 & 8 & 9 \end{bmatrix} = \begin{bmatrix} 1 & 2 & 3 \end{bmatrix}$$

$$\p_3\A = 0 \begin{bmatrix} 1 & 2 & 3 \end{bmatrix} + 0 \begin{bmatrix} 4 & 5 & 6  \end{bmatrix} + 1 \begin{bmatrix} 7 & 8 & 9  \end{bmatrix} = \begin{bmatrix} 7 & 8 & 9 \end{bmatrix}$$

We now see why through **Matrix Multiplication (Left row wise)** that the **Permutation Matrix** works the way it is!



https://math.stackexchange.com/questions/1634411/why-adding-or-subtracting-linear-equations-finds-their-intersection-point