# Eigenvalues and Eigenvectors

## Definition

Given a square matrix \( A \) of size \( n \times n \), an **eigenvector** \( \mathbf{v} \) and the corresponding **eigenvalue** \( \lambda \) satisfy the following equation:

$$
A \mathbf{v} = \lambda \mathbf{v}
$$

Here:
- \( A \) is the square matrix.
- \( \mathbf{v} \) is the eigenvector.
- \( \lambda \) is the eigenvalue.

### Eigenvalue Equation

The above equation can also be written as:

$$
(A - \lambda I) \mathbf{v} = 0
$$

Where:
- \( I \) is the identity matrix of the same size as \( A \).
- \( \lambda \) is the eigenvalue, and \( \mathbf{v} \) is the corresponding eigenvector.

For non-trivial solutions (\( \mathbf{v} \neq 0 \)), the determinant of the matrix \( (A - \lambda I) \) must be zero. This leads to the **characteristic equation**:

$$
\det(A - \lambda I) = 0
$$

### Characteristic Polynomial

The determinant expression \( \det(A - \lambda I) = 0 \) is a **polynomial** in \( \lambda \), called the **characteristic polynomial** of matrix \( A \). The degree of this polynomial is equal to the size of the matrix, \( n \).

- For a matrix \( A \) of size \( n \times n \), the characteristic polynomial is:
  
  $$
  p(\lambda) = \det(A - \lambda I) = 0
  $$

This polynomial can be solved for the values of \( \lambda \) (the eigenvalues). The solutions to this equation are the eigenvalues of \( A \).

### Eigenvalue and Eigenvector Problem

Once the eigenvalues \( \lambda \) are found by solving the characteristic equation, the corresponding **eigenvectors** can be found by solving the system of linear equations:

$$
(A - \lambda I) \mathbf{v} = 0
$$

This is a homogeneous system of linear equations, where \( \mathbf{v} \) is the eigenvector corresponding to the eigenvalue \( \lambda \).

### Eigenvalue Decomposition

Eigenvalue decomposition expresses a matrix \( A \) in terms of its eigenvalues and eigenvectors. If \( A \) has \( n \) linearly independent eigenvectors, it can be decomposed as:

$$
A = V \Lambda V^{-1}
$$

Where:
- \( V \) is the matrix whose columns are the eigenvectors of \( A \).
- \( \Lambda \) is a diagonal matrix with the eigenvalues of \( A \) on the diagonal.

### Spectral Theorem

For a symmetric matrix \( A \) (i.e., \( A = A^T \)), the **spectral theorem** states that \( A \) can be diagonalized as:

$$
A = Q \Lambda Q^T
$$

Where:
- \( Q \) is an orthogonal matrix (i.e., \( Q^{-1} = Q^T \)) whose columns are the normalized eigenvectors of \( A \).
- \( \Lambda \) is a diagonal matrix of the eigenvalues.

### Diagonalization

A matrix \( A \) is said to be **diagonalizable** if there exists an invertible matrix \( P \) such that:

$$
A = P D P^{-1}
$$

Where:
- \( P \) is the matrix of eigenvectors.
- \( D \) is a diagonal matrix containing the eigenvalues.

This is equivalent to the equation for eigenvalue decomposition:

$$
A = V \Lambda V^{-1}
$$

### Power Method (for Largest Eigenvalue)

One iterative method to find the largest eigenvalue is the **power method**. Given an initial vector \( \mathbf{v_0} \), the power method proceeds as:

$$
\mathbf{v_{k+1}} = A \mathbf{v_k}
$$

The algorithm converges to the eigenvector corresponding to the largest eigenvalue. The corresponding eigenvalue can be approximated by:

$$
\lambda \approx \frac{\mathbf{v_k}^T A \mathbf{v_k}}{\mathbf{v_k}^T \mathbf{v_k}}
$$

### Singular Value Decomposition (SVD)

While not directly related to eigenvalues/eigenvectors of square matrices, **Singular Value Decomposition (SVD)** is a closely related concept often used in data science. It decomposes any \( m \times n \) matrix \( A \) into three matrices:

$$
A = U \Sigma V^T
$$

Where:
- \( U \) and \( V \) are orthogonal matrices (similar to eigenvector matrices).
- \( \Sigma \) is a diagonal matrix containing the singular values of \( A \), which are the square roots of the eigenvalues of \( A^T A \) and \( A A^T \).

### Applications in Data Science

Eigenvalues and eigenvectors play a critical role in many machine learning and data science techniques:
1. **Principal Component Analysis (PCA)**: PCA uses eigenvalues and eigenvectors to reduce the dimensionality of data by selecting the top \( k \) eigenvectors corresponding to the largest eigenvalues.
2. **Spectral Clustering**: Uses the eigenvectors of a similarity matrix to perform clustering.
3. **Latent Semantic Analysis (LSA)**: SVD and eigen-decomposition are used for dimensionality reduction in text mining and information retrieval.

---

## Conclusion

Understanding eigenvalues and eigenvectors is fundamental in many areas of machine learning, data science, and linear algebra. The ability to compute and interpret them enables powerful tools such as PCA, SVD, and spectral methods for clustering, dimensionality reduction, and matrix factorization.
