# Linear Algebra for Machine Learning

## Module 2
## Vectors are objects that move around space

### 1. Modulus and inner product

Vectors have two essential mathematical properties:

> The mathematical properties that the vector has are the length of the vector (size) and the product of the vector (Inner scalable projection product).

Meaning:
- The **length** of the vector (norm)
- The **inner product**, enabling angles, projections, and scaling

#### 1.1 Modulus (Norm)

For a vector:

$$
\mathbf{v} = (v_1, v_2, \dots, v_n)
$$


The norm is:

$$
\|\mathbf{v}\| = \sqrt{v_1^2 + v_2^2 + \dots + v_n^2}
$$

#### 1.2 Inner Product

For two vectors:

$$
\mathbf{u} = (u_1, u_2, \dots, u_n), \quad \mathbf{v} = (v_1, v_2, \dots, v_n)
$$


The inner product is:

$$
\langle \mathbf{u}, \mathbf{v} \rangle = u_1 v_1 + u_2 v_2 + \dots + u_n v_n
$$


Also known as:
- Dot product
- Scalar product
- Inner scalable projection product


### 2. Cosine & dot product

The dot product relates to the angle between vectors:

$$
\langle \mathbf{u}, \mathbf{v} \rangle = \|\mathbf{u}\| \, \|\mathbf{v}\| \cos(\theta)
$$


Thus:

$$
\cos(\theta) = \frac{\langle \mathbf{u}, \mathbf{v} \rangle}{\|\mathbf{u}\| \, \|\mathbf{v}\|}
$$


Interpretation:
- 1  → parallel
- -1 → opposite
- 0  → orthogonal

### 3. Projection

The projection of one vector onto another can be understood in two complementary ways: 
1. **Scalar projection** → the *length* (with sign) of the shadow of **u** onto **v** 
2. **Vector projection** → the *actual vector* representing that shadow along the direction of **v**


#### 3.1 Scalar Projection

The scalar projection of **u** onto **v** is: 
$$
\text{scalar proj}_{\mathbf{v}}(\mathbf{u}) = \frac{\langle \mathbf{u}, \mathbf{v} \rangle}{\|\mathbf{v}\|} 
$$

This gives a **number**, not a vector. 
- Positive → the angle between **u** and **v** is acute 
- Zero → the vectors are orthogonal 
- Negative → the angle is obtuse

#### 3.2 Vector Projection

The vector projection of **u** onto **v** is: 
$$
\text{proj}_{\mathbf{v}}(\mathbf{u}) = \left( \frac{\langle \mathbf{u}, \mathbf{v} \rangle}{\|\mathbf{v}\|^2} \right) \mathbf{v} 
$$

This gives the **vector** that represents the shadow of **u** on **v**.

### 4. Dot product of vectors


#### 4.1 Commutative

$$
\langle \mathbf{u}, \mathbf{v} \rangle = \langle \mathbf{v}, \mathbf{u} \rangle
$$

#### 4.2 Distributive

$$
\langle \mathbf{u}, \mathbf{v} + \mathbf{w} \rangle =
\langle \mathbf{u}, \mathbf{v} \rangle + \langle \mathbf{u}, \mathbf{w} \rangle
$$

#### 4.3 Associative with scalars

$$
\langle c\mathbf{u}, \mathbf{v} \rangle = c \langle \mathbf{u}, \mathbf{v} \rangle
$$

### 5. Changing basis

Changing basis means expressing the **same vector** using a **different coordinate system**. The vector does not change — only its coordinates change depending on the basis. We do this because some bases make computations easier, align with the geometry of the problem, or simplify transformations (e.g., PCA, eigenvectors, ML feature spaces).

#### 5.1 Coordinates in a New Basis

To express a vector **v** in a new basis **B = {b₁, b₂, …, bₙ}**, we want to find the coefficients:

$$
v = c_1 b_1 + c_2 b_2 + \dots + c_n b_n
$$


If the basis vectors are linearly independent, each coefficient can be computed using dot products:

$$
c_1 = \frac{v \cdot b_1}{b_1 \cdot b_1}, \quad 
c_2 = \frac{v \cdot b_2}{b_2 \cdot b_2}, \quad 
\dots, \quad 
c_n = \frac{v \cdot b_n}{b_n \cdot b_n}
$$


This gives the coordinates of **v** in the basis **B**:

$$
[v]_B = (c_1, c_2, \dots, c_n)
$$

This gives the representation of the same vector in the new coordinate system.


#### 5.2 Linear Independence

A set of vectors is **linearly independent** if the only solution to

$$
c_1 v_1 + c_2 v_2 + \cdots + c_n v_n = \mathbf{0}
$$

is
$$
c_1 = c_2 = \cdots = c_n = 0.
$$


This means no vector in the set can be written as a combination of the others.  
Each vector contributes a new, unique direction in the space.

##### 5.2.1 Example

Let 
$$
v_1 = \begin{bmatrix} 1 \\ 2 \end{bmatrix}, \qquad 
v_2 = \begin{bmatrix} -2 \\ 1 \end{bmatrix}. 
$$

Check independence:
$$
c_1 v_1 + c_2 v_2 = \mathbf{0}
$$

$$
c_1 \begin{bmatrix}1 \\ 2\end{bmatrix} 
+ 
c_2 \begin{bmatrix}-2 \\ 1\end{bmatrix} 
= 
\begin{bmatrix}0 \\ 0\end{bmatrix} 
$$

This produces the system:
$$
\begin{cases} 
c_1 - 2c_2 = 0 \\ 
2c_1 + c_2 = 0 
\end{cases} 
$$

Solving gives: 
$$
c_1 = 0, \quad c_2 = 0 
$$

Therefore, the vectors are **linearly independent**.

### 6. Summary

- This module introduced the core ideas of vectors and their geometric meaning.
- I learned how to compute dot products, projections, magnitudes, and angles.
- Orthogonal and linearly independent vectors were used to build and understand bases.
- Coordinate changes were performed using projections onto orthogonal basis vectors.
- Vector equations were applied to real scenarios such as motion, velocity, and direction.
- In the next notebook, we will begin working with matrices and matrix operations.