# MCSE 6309 and EMoS 6309: 
# Machine Learning
## Linear Algebra
### June, 2019

__Linear algebra__ is the study of vectors and certain rules to manipulate vectors.
Linear algebra plays an important role in machine learning and general mathematics.

Example of vector objects:
* Geometric vectors: are directed segments which can be drawn (at least in two dimensions). Gives direction and magnitude to reason about mathematical operations.
* Polynomials: are also vectors
* Audio signals are vectors
* Elements of $\mathbb{R}^n $(tuples of `n` real numbers) are vectors. For instance,
\begin{align}
\mathbf{a} = \begin{bmatrix}1 \\ 2 \\ 3\end{bmatrix} \in \mathbb{R}^3
\label{eq1}\tag{2.1} 
\end{align}
is an example of a triplet of numbers. Adding two vectors $\mathbf{a, b} \in \mathbb{R}^n$ component-wise results in another vector: $\mathbf{a} + \mathbf{b} = \mathbf{c} \in \mathbb{R}^n$. Moreover, multiplying $\mathbf{a} \in \mathbb{R}^n$ by $\lambda \in \mathbb{R}$ results in a scaled vector $\lambda\mathbf{a} \in \mathbb{R}^n$ .

The concept of a vector space and its properties underlie much of machine learning.

## Types of Vectors
![vector](figsML/vect.png)

* Linear algebra focuses on the similarities between these vector concepts.
* We can add them together and multiply them by scalars. 
* We will largely focus on vectors in $\mathbb{R}^n $ since most algorithms in linear algebra are formulated in $\mathbb{R}^n $.

## 2.1 Systems of Linear Equations
The system of linear equations plays a central part of linear algebra. Many problems can be formulated as systems of linear equations and linear algebra gives us the tools for solving them.

>__Example 2.1__

>A company produces products $N_{1},....,N_{n}$ for which resources
$R_{1},....R_{m}$ are required. To produce a unit of product $N_{j}, a_{ij}$ units of
resource $R_{i}$ are needed, where $i = 1, . . . , m$ and $j = 1, . . . , n$. The objective is to find an optimal production plan, i.e., a plan of how many units $x_{j}$ of product $N_{j}$ should be produced if a total of $b_{i}$ units of resource $R_{i}$ are available and (ideally) no resources are left over. If we produce $x_{1}, . . . ,x_{n}$ units of the corresponding products, we need a total of

\begin{align}
a_{i1}x_{1} + ... + a_{in}x_{n}
\label{eq2}\tag{2.2}
\end{align}

many units of resource $R_{i}$. 




An optimal production plan $(x_{1}, ...., x_{n}) \in \mathbb{R}^n$, therefore, has to satisfy the following system of equations:
\begin{align}
a_{11}x_{1} +  &\ldots + a_{1n}x_{n} = b_{1}\\
&\vdots\\
a_{m1}x_{1} +  &\ldots + a_{mn}x_{n} = b_{m}
\label{eq3}\tag{2.3}
\end{align}
where $a_{ij} \in \mathbb{R}$ and $b_{i} \in \mathbb{R}$

Equation \ref{eq3} is the general form of a system of linear equations, and
$x_{1}, . . . , x_{n}$ are the _unknowns_ of this system. Every $n-$tuple $(x_{1}, . . . , x_{n}) \in \mathbb{R}^n$ that satisfies \ref{eq3} is a solution of the linear equation system.

## Example 2.2
The system of linear equations
\begin{align}
x_{1} + x_{2} &+ x_{3} &= 3\\
x_{1} - x_{2} &+ 2x_{3} &= 2\\
2x_{1} \qquad       &+3x_{3} &=1
\label{eq4}\tag{2.4}
\end{align}

has _no solution_: Adding the first two equations yields $2x_{1}+3x_{3} = 5$, which contradicts the third equation. 
Consider the system of linear equations
\begin{align}
x_{1} + x_{2} &+ x_{3} &= 3\\
x_{1} - x_{2} &+ 2x_{3} &= 2\\
\qquad x_{2}  &+ x_{3} &=2
\label{eq5}\tag{2.5}
\end{align}
This gives $x_{1}=1$, $x_{2}=1$ and $x_{3}=1$. Therefore, $(1,1,1)$ is the only possible and _unique solution_

Consider a third example,
\begin{align}
x_{1} + x_{2} &+ x_{3} &= 3\\
x_{1} - x_{2} &+ 2x_{3} &= 2\\
2x_{1} \qquad &+ 3x_{3} &=5
\label{eq6}\tag{2.6}
\end{align}

From $1$ and $2$, we get $2x_{1}=5-3x_{3}$ and $2x_{2} = 1+x_{3}$. We define $x_{3}=a \in \mathbb{R}$ as a free variable, such that any triplet 
\begin{align}
\left(\frac{5}{2}-\frac{3}{2}a,\frac{1}{2}+\frac{1}{2}a,a\right), a \in \mathbb{R}
\label{eq7}\tag{2.7}
\end{align}
is a solution of the system of linear equations, ie. we obtain a solution set that contains infinitely many solutions.
In general, for a real-valued system of linear equations we obtain either no, exactly one or infinitely many solutions.

## 2.2 Matrices
Matrices have an important role in linear algebra.
By definition, with $m, n \in \mathbb{N}$ a real-valued matrix $\mathbf{A}$ is an $m.n-$tuple of elements $a_{ij}, i=1,\ldots, m, j=1,\ldots,n$, which is ordered according to a rectangualr scheme consisting of $m$ rows and $n$ columns:

\begin{align}
\mathbf{A}=\left[
\begin{array}{cccc}
a_{11}&a_{12} &\ldots &a_{1n}\\
a_{21}& a_{22} & \ldots &a_{2n}\\
\vdots & \vdots & &\vdots \\
a_{m1} & a_{m2}& \ldots & a_{mn}
\end{array}
\right],  a_{ij} \in \mathbb{R}
\label{eq8}\tag{2.8}
\end{align}

$\mathbb{R}^n$ is the set of all real-valued $(m,n)$- matrices. $\mathbf{A} \in \mathbb{R}^{mxn}$ can be equivalently represented as $\mathbf{a} \in \mathbb{R}^{mn}$ by stacking all $n$ columns of the matrix into a long vector.

## 2.2.1 Properties of Matrices
1. Associativity:
\begin{align}
\forall\mathbf{A} \in \mathbb{R}^{mxn},\mathbf{B} \in \mathbb{R}^{nxp},\mathbf{C} \in \mathbb{R}^{pxq}: (\mathbf{AB})\mathbf{C} = \mathbf{A}(\mathbf{BC}) \
\label{eq9}\tag{2.9}
\end{align}
2. Distributivity
\begin{align}
\forall\mathbf{A, B} \in \mathbb{R}^{mxn},\mathbf{C,D} \in \mathbb{R}^{nxp}: (\mathbf{A+B})\mathbf{C} = \mathbf{AC}+\mathbf{BC}\\
\mathbf{A}(\mathbf{C+D}) = \mathbf{AC}+\mathbf{AD}
\label{eq10}\tag{2.10}
\end{align}
3. Multiplication with the identity matrix
\begin{align}
\forall\mathbf{A} \in \mathbb{R}^{mxn}: \mathbf{I}_{m} \mathbf{A} = \mathbf{AI}_{n} =\mathbf{A}\label{eq11}\tag{2.11}
\end{align}
Note that $\mathbf{I}_{m}\neq \mathbf{I}_{n}$ for $m \neq n$

## Assignment:  Read Chapter 2 of MML book by Deisenroth, et al. (2019)
