
---

### **Basic Questions**  

#### **1. What is a scalar, vector, matrix, and tensor?**  
✅ **Answer:**  
- **Scalar**: A single number (e.g., 5, -3.2).  
- **Vector**: A 1D array of numbers (e.g., $\mathbf{v} = [2, 5, -1]$).  
- **Matrix**: A 2D array of numbers (e.g.,  
  $
  A = \begin{bmatrix} 1 & 2 \\ 3 & 4 \end{bmatrix}
  $
- **Tensor**: A multi-dimensional array (e.g., a 3D array used in deep learning).  

---

#### **2. What is the rank of a matrix?**  
✅ **Answer:**  
The **rank** of a matrix is the number of **linearly independent** rows or columns in the matrix. It determines the dimension of the vector space spanned by the matrix.  

For example,  
$
A = \begin{bmatrix} 1 & 2 \\ 3 & 6 \end{bmatrix}
$  
The second column is just **2 times** the first column, so there is only **one independent column**, meaning **rank(A) = 1**.  

---

#### **3. What is the determinant of a matrix, and what does it tell us?**  
✅ **Answer:**  
The **determinant** of a square matrix $A$ is a scalar value that represents whether the matrix is **invertible** or **singular**.  

For a $2 \times 2$ matrix:  
$
\text{det}(A) = \begin{vmatrix} a & b \\ c & d \end{vmatrix} = ad - bc
$  
If **det(A) = 0**, the matrix is **singular (not invertible)**.  

Example:  

$
A = \begin{bmatrix} 1 & 2 \\ 3 & 4 \end{bmatrix}, \quad \text{det}(A) = (1)(4) - (2)(3) = 4 - 6 = -2
$  

Since **det(A) ≠ 0**, the matrix is **invertible**.

---

### **Intermediate Questions**  

#### **4. What is an identity matrix, and why is it important?**  
✅ **Answer:**  
The **identity matrix** $ I_n $ is a **square matrix** with **1s on the diagonal** and **0s everywhere else**.  

Example (for $3 \times 3$ matrix):  
$
I_3 = \begin{bmatrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{bmatrix}
$  
It acts as the **multiplicative identity** in matrix multiplication:  

$
A \times I = I \times A = A
$  

It is important because it plays the same role as **1 in scalar multiplication**.

---

#### **5. What is matrix inversion, and when does a matrix have an inverse?**  
✅ **Answer:**  
The **inverse** of a square matrix $A$ is another matrix $A^{-1}$ such that:  

$
A \times A^{-1} = I
$  

A matrix **has an inverse** if and only if its **determinant is non-zero** (**det(A) ≠ 0**).  

For a **$2 \times 2$ matrix**, the inverse is:  

$
A^{-1} = \frac{1}{\text{det}(A)} \begin{bmatrix} d & -b \\ -c & a \end{bmatrix}
$  

Example:  

$
A = \begin{bmatrix} 1 & 2 \\ 3 & 4 \end{bmatrix}, \quad \text{det}(A) = -2
$  

$
A^{-1} = \frac{1}{-2} \begin{bmatrix} 4 & -2 \\ -3 & 1 \end{bmatrix} = \begin{bmatrix} -2 & 1 \\ 1.5 & -0.5 \end{bmatrix}
$

---

#### **6. What is eigenvalue decomposition?**  
✅ **Answer:**  
Eigenvalue decomposition is the process of breaking down a matrix into **eigenvalues** and **eigenvectors**:  

$
A \mathbf{v} = \lambda \mathbf{v}
$  

where:  

- $ A $ is a square matrix.  
- $ \lambda $ is an **eigenvalue**.  
- $ \mathbf{v} $ is an **eigenvector**.  

Example:  
For  

$
A = \begin{bmatrix} 4 & -2 \\ 1 & 1 \end{bmatrix}
$  

The eigenvalues are **5 and 0**, and the corresponding eigenvectors are found by solving **$ (A - \lambda I) v = 0 $**.

Eigenvalue decomposition is widely used in **PCA (Principal Component Analysis)** for dimensionality reduction.

---

Eigenvalues and eigenvectors are fundamental concepts in linear algebra.

An eigenvector of a square matrix A is a non-zero vector v that, when multiplied by A, results in a scaled version of itself. The scaling factor is called the eigenvalue.

Eigenvector is  a non zero vector that change only in scale (not direction) when a linear transfomation is applied to it

An eigenvalue is indeed a scalar, but it indicates how much the eigenvector  is stretched or compressed during a transformation.

When a matrix A transforms an eigenvector v, the result points in the same direction as the original vector, just scaled by the eigenvalue λ:

A·v = λv

- If λ = 2, the eigenvector is stretched to twice its original length
- If λ = 0.5, the eigenvector is compressed to half its original length
- If λ = -1, the eigenvector is reversed and maintains its length
- If λ = 1, the eigenvector remains unchanged

So the eigenvalue tells you the scaling factor applied to its corresponding eigenvector when the transformation occurs.

Mathematically:
A·v = λv

Where:
- A is a square matrix
- v is the eigenvector
- λ (lambda) is the eigenvalue

In simpler terms, when you apply the transformation represented by matrix A to its eigenvector, the vector doesn't change direction - it only stretches or shrinks by the eigenvalue amount.

Key properties:
- For an n×n matrix, there are at most n eigenvalues
- Eigenvectors corresponding to different eigenvalues are linearly independent
- The determinant of a matrix equals the product of its eigenvalues
- The trace (sum of diagonal elements) equals the sum of eigenvalues

Practical applications include:
- Principal Component Analysis in data science
- Solving differential equations
- Quantum mechanics (where eigenvalues represent observable quantities)
- Stability analysis in dynamic systems
- Google's PageRank algorithm

To find eigenvalues, you solve the characteristic equation: det(A - λI) = 0
Then, for each eigenvalue, you solve (A - λI)v = 0 to find the corresponding eigenvectors.

---

### **Advanced Questions**  

#### **7. What is the Moore-Penrose pseudoinverse, and why is it useful?**  
✅ **Answer:**  
The **Moore-Penrose pseudoinverse** ($A^+$) is a **generalized inverse** used for **non-square** or **singular matrices** where normal inversion does not work. It is useful in **least squares regression** when solving  

$
Ax = b
$  

If $ A $ is **not invertible**, we use the pseudoinverse:  

$
x = A^+ b
$  

It is computed as:  

$
A^+ = (A^T A)^{-1} A^T
$  

This is used in **linear regression** to compute optimal weights.

---

#### **8. What is Singular Value Decomposition (SVD) and how is it used?**  
✅ **Answer:**  
**Singular Value Decomposition (SVD)** decomposes a matrix into three matrices:  

$
A = U \Sigma V^T
$  

where:  

- $ U $ and $ V^T $ are **orthogonal matrices**.  
- $ \Sigma $ is a **diagonal matrix** of **singular values**.  

Uses:  
- **Dimensionality reduction** (PCA).  
- **Feature extraction** in NLP (Latent Semantic Analysis).  
- **Solving linear systems** and **matrix compression**.

---

Singular Value Decomposition (SVD) is a way to break down any matrix into three simpler components. Think of it as factoring a number, but for matrices.

For a matrix A, SVD gives you: A = USVᵀ where:

- U is a matrix containing "output directions" (left singular vectors)
- S is a diagonal matrix with "stretching factors" (singular values)
- Vᵀ is a matrix containing "input directions" (right singular vectors, transposed)

In simple terms, SVD reveals:
1. Which directions in your input space are most important (V)
2. How much stretching happens in each direction (S)
3. Where those stretched directions end up in the output space (U)

A real-world analogy: Imagine you're transforming a sphere of clay into an ellipsoid. SVD tells you:
- Which directions to push the clay (V)
- How hard to push in each direction (S)
- Where those pushes send the clay (U)

SVD is powerful because it works for ANY matrix (not just square ones) and reveals the most important components of your transformation. This makes it useful for data compression, noise reduction, recommender systems, and image processing.

---

#### **9. What is the Frobenius norm of a matrix, and how is it computed?**  
✅ **Answer:**  
The **Frobenius norm** of a matrix $ A $ is a measure of its **magnitude**, computed as:  

$
\|A\|_F = \sqrt{\sum_{i,j} A_{ij}^2}
$  

It is used for evaluating the **size of a matrix** in machine learning models.

Example:  
For  

$
A = \begin{bmatrix} 1 & 2 \\ 3 & 4 \end{bmatrix}
$

$
\|A\|_F = \sqrt{1^2 + 2^2 + 3^2 + 4^2} = \sqrt{30}
$  

---



# Additional Questions
---

### **1. What is a transpose of a matrix?**
✅ **Answer:**  
The **transpose** of a matrix $ A $, denoted as $ A^T $, is obtained by **swapping rows and columns**.

Example:  
$
A = \begin{bmatrix} 1 & 2 & 3 \\ 4 & 5 & 6 \end{bmatrix}
$
The **transpose** $ A^T $ is:
$
A^T = \begin{bmatrix} 1 & 4 \\ 2 & 5 \\ 3 & 6 \end{bmatrix}
$
📌 **Why is it useful?**  
- In machine learning, **feature vectors** are often represented as **column vectors**, and transposing helps in matrix operations like dot products.  
- Used in computing **covariance matrices** in statistics.

---

### **2. What is a diagonal matrix?**
✅ **Answer:**  
A **diagonal matrix** is a square matrix where all the **non-diagonal elements** are **zero**.

Example:
$
D = \begin{bmatrix} 5 & 0 & 0 \\ 0 & 3 & 0 \\ 0 & 0 & 2 \end{bmatrix}
$
📌 **Why is it useful?**  
- **Fast computation**: Matrix operations (like multiplication, inversion) are much faster for diagonal matrices.  
- **Eigenvalues** of a diagonal matrix are just the diagonal elements.

---

### **3. What is the dot product of two vectors?**
✅ **Answer:**  
The **dot product** (also called the **inner product**) of two vectors **measures their similarity**.

For two vectors $ \mathbf{a} $ and $ \mathbf{b} $:
$
\mathbf{a} \cdot \mathbf{b} = a_1b_1 + a_2b_2 + \dots + a_n b_n
$
Example:
$
\mathbf{a} = [1, 2, 3], \quad \mathbf{b} = [4, 5, 6]
$
$
\mathbf{a} \cdot \mathbf{b} = (1 \times 4) + (2 \times 5) + (3 \times 6) = 4 + 10 + 18 = 32
$

📌 **Why is it useful?**  
- In **ML and deep learning**, dot products are used in **neural networks** and **cosine similarity** (for NLP tasks).  
- **Projection of vectors** and angle measurement.

---

### **4. What is matrix multiplication, and when is it possible?**
✅ **Answer:**  
**Matrix multiplication** is **only possible** if the **number of columns of the first matrix** matches the **number of rows of the second matrix**.

If $ A $ is $ m \times n $ and $ B $ is $ n \times p $, then the result $ C = A \times B $ will be an **$ m \times p $ matrix**.

Example:  
$
A = \begin{bmatrix} 1 & 2 \\ 3 & 4 \end{bmatrix}, \quad B = \begin{bmatrix} 5 & 6 \\ 7 & 8 \end{bmatrix}
$
$
C = A \times B = \begin{bmatrix} (1 \times 5 + 2 \times 7) & (1 \times 6 + 2 \times 8) \\ (3 \times 5 + 4 \times 7) & (3 \times 6 + 4 \times 8) \end{bmatrix}
$
$
C = \begin{bmatrix} 19 & 22 \\ 43 & 50 \end{bmatrix}
$

📌 **Why is it useful?**  
- Used in **deep learning** for **forward propagation** in neural networks.  
- Essential in **linear transformations** in ML.

---

### **5. What is the difference between row vector and column vector?**
✅ **Answer:**  
- A **row vector** is a **1 × n** matrix. Example:  
  $
  \mathbf{v} = \begin{bmatrix} 2 & 3 & 5 \end{bmatrix}
  $
- A **column vector** is an **n × 1** matrix. Example:  
  $
  \mathbf{v} = \begin{bmatrix} 2 \\ 3 \\ 5 \end{bmatrix}
  $

📌 **Why is it useful?**  
- In ML, **feature vectors** are usually **column vectors**.  
- Transposing row vectors helps in **dot products** and **matrix multiplication**.

---

### **6. What is a symmetric matrix?**
✅ **Answer:**  
A **symmetric matrix** is a square matrix where $ A^T = A $, meaning the **diagonal remains the same**, and values are **mirrored** across the diagonal.

Example:
$
A = \begin{bmatrix} 1 & 2 & 3 \\ 2 & 4 & 5 \\ 3 & 5 & 6 \end{bmatrix}
$
Here, $ A_{ij} = A_{ji} $ (e.g., $ A_{12} = A_{21} $).

📌 **Why is it useful?**  
- Used in **covariance matrices** in statistics and ML.  
- Helps in **eigenvalue decomposition**.

---

### **7. What is an orthogonal matrix?**
✅ **Answer:**  
A **matrix is orthogonal** if its **transpose equals its inverse**:  
$
A^T A = I
$
Example:
$
A = \begin{bmatrix} \frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}} \\ -\frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}} \end{bmatrix}
$
📌 **Why is it useful?**  
- Used in **dimensionality reduction** (PCA).  
- Efficient in numerical computations.

---

### **8. What is the trace of a matrix?**
✅ **Answer:**  
The **trace** of a square matrix $ A $ is the **sum of its diagonal elements**.

Example:
$
A = \begin{bmatrix} 2 & 3 \\ 4 & 5 \end{bmatrix}
$
$
\text{trace}(A) = 2 + 5 = 7
$

📌 **Why is it useful?**  
- Used in **linear regression** and **covariance matrices**.  
- Helps in **eigenvalue computations**.

---

### **9. What is a unit vector?**
✅ **Answer:**  
A **unit vector** is a vector with a **magnitude (length) of 1**.

If $ \mathbf{v} = [x_1, x_2, ..., x_n] $, its unit vector is:
$
\mathbf{u} = \frac{\mathbf{v}}{\|\mathbf{v}\|}
$
where $ \|\mathbf{v}\| $ is the **Euclidean norm**:
$
\|\mathbf{v}\| = \sqrt{x_1^2 + x_2^2 + ... + x_n^2}
$

Example:  
For $ \mathbf{v} = [3, 4] $:
$
\|\mathbf{v}\| = \sqrt{3^2 + 4^2} = \sqrt{9 + 16} = 5
$
$
\mathbf{u} = \left[\frac{3}{5}, \frac{4}{5}\right] = [0.6, 0.8]
$

📌 **Why is it useful?**  
- Used in **cosine similarity** in NLP.  
- Helps in **gradient computations** in ML.

---
