---
title: 9.1 Eigenvalues of Symmetric Matrices
subject:  Symmetric Matrices
subtitle: factorizing symmetric matrices
short_title: 9.1 Eigenvalues of Symmetric Matrices
authors:
  - name: Nikolai Matni
    affiliations:
      - Dept. of Electrical and Systems Engineering
      - University of Pennsylvania
    email: nmatni@seas.upenn.edu
license: CC-BY-4.0
keywords: 
math:
  '\vv': '\mathbf{#1}'
  '\bm': '\begin{bmatrix}'
  '\em': '\end{bmatrix}'
  '\R': '\mathbb{R}'
---

[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/nikolaimatni/ese-2030/HEAD?labpath=/08_Ch_9_Symmetric_Matrices/101-eigen_symm.ipynb)

{doc}`Lecture notes <../lecture_notes/Lecture 16 - Eigenvalues of Symmetric Matrices, Spectral Theorem, Quadratic Forms and Positive Definite Matrices, Optimization Principles for Eigenvalues of Symmetric Matrices.pdf>`

## Reading

Material related to this page, as well as additional exercises, can be found in ALA 8.5.

## Learning Objectives

By the end of this page, you should know:
- the properties of eigenvalues and eigenvectors of symmetric matrices
- the Spectral theorem
- the geometric interpretation of a linear transformation when the matrix is symmetric

## Symmetric Matrix

A square matrix $A$ is said to be symmetric if $A = A^T$. For example, all 2$\times$2 symmetric and 3$\times$3 symmetric matrices are of the form:

$$
\begin{bmatrix}
a & b \\
b & c
\end{bmatrix}
\quad \text{and} \quad
\begin{bmatrix}
a & b & c \\
b & d & e \\
c & e & f
\end{bmatrix}
$$

Symmetric matrices arise in many practical contexts: an important one we will spend time on next lecture are _covariance matrices_. For now, we simply take them as a family of interesting matrices.

Symmetric matrices enjoy many interesting properties, including the following one which will be the focus of this lecture:

:::{prf:theorem}
Let $A = A^T \in \mathbb{R}^{n\times n}$ be a symmetric $n\times n$ matrix. Then:
1. All eigenvalues of $A$ are real.
2. Eigenvectors corresponding to distinct eigenvalues of $A$ are orthogonal.
3. There is an orthonormal basis of $\mathbb{R}^n$ consisting of $n$ eigenvectors of $A$.
In particular, all real symmetric matrices are complete and real diagonalizable.
:::

We'll spend the rest of this lecture exploring the consequences of this remarkable theorem, before diving into applications over the next few lectures.

First, we work through a few simple examples to see this theorem in action.

:::{prf:example}
$A = \begin{bmatrix} 3 & 1 \\ 1 & 3 \end{bmatrix}$. We've seen this matrix in previous examples. It has eigenvalues $\lambda_1 = 4$ and $\lambda_2 = 2$ with corresponding eigenvectors $\vv v_1 = (1,1)$ and $\vv v_2 = (-1,1)$. We easily verify that $\vv v_1^T \vv v_2 = 0$, and hence are orthogonal. We construct an orthonormal basis by dividing each eigenvector by its Euclidean norm:

$$
\vv u_1 = \frac{\vv v_1}{\|\vv v_1\|} = \frac{1}{\sqrt{2}} \begin{bmatrix} 1 \\ 1 \end{bmatrix}
\quad \text{and} \quad
\vv u_2 = \frac{\vv v_2}{\|\vv v_2\|} = \frac{1}{\sqrt{2}} \begin{bmatrix} -1 \\ 1 \end{bmatrix}
$$

:::

:::{prf:example} 
Consider the symmetric matrix $A = \begin{bmatrix} 5 & -4 & 2 \\ -4 & 5 & 2 \\ 2 & 2 & -1 \end{bmatrix}$. Computing the eigenvalues/eigenvectors of $A$ (e.g., using `np.linalg.eig`) we see that

$$
\lambda_1 = 9, \vv v_1 = \begin{bmatrix} 1 \\ -1 \\ 0 \end{bmatrix}, \quad
\lambda_2 = 3, \vv v_2 = \begin{bmatrix} 1 \\ 1 \\ 1 \end{bmatrix}, \quad \text{and} \quad
\lambda_3 = -3, \vv v_3 = \begin{bmatrix} 1 \\ 1 \\ -2 \end{bmatrix}.
$$

You can check that these vectors are pairwise orthogonal: $\vv v_i^T \vv v_j = 0$ for $i \neq j$, and hence form an orthogonal basis for $\mathbb{R}^3$. An orthonormal basis is obtained by the corresponding unit norm eigenvectors:
$$
\vv u_1 = \frac{1}{\sqrt{2}} \begin{bmatrix} -1 \\ 1 \\ 0 \end{bmatrix}, \quad
\vv u_2 = \frac{1}{\sqrt{3}} \begin{bmatrix} 1 \\ 1 \\ 1 \end{bmatrix}, \quad \text{and} \quad
\vv u_3 = \frac{1}{\sqrt{6}} \begin{bmatrix} 1 \\ 1 \\ -2 \end{bmatrix}.
$$
:::


## The Spectral Theorem

The theorem above tells us that every real, symmetric matrix admits an eigenvector basis, and hence is diagonalizable. Furthermore, we can always choose eigenvectors that form an orthonormal basis—hence, the diagonalizing matrix takes a particularly simple form.

Remember that a matrix $Q \in \mathbb{R}^{n \times n}$ is **orthogonal** if and only if its columns form an orthonormal basis of $\mathbb{R}^n$. Alternatively, we can characterize orthogonal matrices by the condition that $Q^T Q = Q Q^T = I$, i.e., $Q^{-1} = Q^T$.

If we use this orthonormal eigenbasis when diagonalizing a symmetric matrix $A$, we obtain its _spectral factorization_:

:::{prf:theorem}
:label: spectral_thm
Let $A$ be a real symmetric matrix. Then there exists an orthogonal matrix $Q$ such that
\begin{equation}
\label{ST_eqn}
A = Q \Lambda Q^{-1} = Q \Lambda Q^T \qquad (\text{ST})
\end{equation}
where $\Lambda$ is a real diagonal matrix. The eigenvalues of $A$ appear on the diagonal of $\Lambda$, while the columns of $Q$ are the corresponding orthonormal eigenvectors.
:::

:::{note} Historical Remark
The term "spectrum" refers to the eigenvalues of a matrix, or more generally, a linear operator. This terminology originates in physics: the spectral energy lines of atoms, molecules, and nuclei are characterized as the eigenvalues of the governing quantum mechanical Schrödinger operator.
:::

:::{prf:example}
For $A = \begin{bmatrix} 3 & 1 \\ 1 & 3 \end{bmatrix}$ seen above, we build $Q = \frac{1}{\sqrt{2}} \begin{bmatrix} 1 & -1 \\ 1 & 1 \end{bmatrix}$, and write

$$
\begin{bmatrix} 3 & 1 \\ 1 & 3 \end{bmatrix} = A = Q \Lambda Q^T = 
\begin{bmatrix} \frac{1}{\sqrt{2}} & -\frac{1}{\sqrt{2}} \\ 
\frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}} \end{bmatrix}
\begin{bmatrix} 4 & 0 \\ 0 & 2 \end{bmatrix}
\begin{bmatrix} \frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}} \\ 
-\frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}} \end{bmatrix}.
$$
:::

## Geometric Interpretation 

You can always choose $Q$ to have $\det Q = 1$; such a $Q$ represents a rotation. Thus the diagonalization of a symmetric matrix can be interpreted as a rotation of the coordinate system so that the orthogonal eigenvectors align with the coordinate axes. Therefore, the linear transformation $L(\vv x) = A\vv x$ for which $A$ has all positive eigenvalues can be interpreted as a combination of stretches in $n$ mutually orthogonal directions. One way to visualize this is to consider what $L(\vv x)$ does to the unit Euclidean sphere $S = \{ \vv x \in \mathbb{R}^n \mid \|\vv x\| = 1\}$: stretching it in orthogonal directions will transform it into an ellipsoid : $E = L(S) = \{ A\vv x \mid \|\vv x\| = 1\}$ whose principal axes are the directions of stretch, i.e., the eigenvectors of $A$.

:::{figure}../figures/09-ellipse.jpg
:label:ellipse
:alt:Ellipse
:width: 400px
:align: center
:::



[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/nikolaimatni/ese-2030/HEAD?labpath=/08_Ch_9_Symmetric_Matrices/101-eigen_symm.ipynb)