---
title: 7.3 Matrix Exponential
subject: Dynamics
subtitle: 
short_title: 7.3 Matrix Exponential
authors:
  - name: Nikolai Matni
    affiliations:
      - Dept. of Electrical and Systems Engineering
      - University of Pennsylvania
    email: nmatni@seas.upenn.edu
license: CC-BY-4.0
keywords: linear systems, 
math:
  '\vv': '\mathbf{#1}'
  '\bm': '\begin{bmatrix}'
  '\em': '\end{bmatrix}'
  '\R': '\mathbb{R}'
---

[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/nikolaimatni/ese-2030/HEAD?labpath=/06_Ch_7_Dynamics/083-Matrix_exponential.ipynb)

{doc}`Lecture notes <../lecture_notes/Lecture 13 - Complex and Repeated Eigenvalues Revisited, Jordan Blocks, Matrix Exponential.pdf>`

## Reading

Material related to this page, as well as additional exercises, can be found in ALA .

## Learning Objectives

By the end of this page, you should know:
- the matrix exponential expressed as a power series
- 

## Matrix Exponential

We've seen four cases for eigenvalues/eigenvectors and their relationship to solutions of initial value problems defined by $\dot{\mathbf{x}} = A\mathbf{x}$ and $\mathbf{x}(0)$ given:

1) real distinct eigenvalues, solved by diagonalization;
2) real repeated eigenvalues with algebraic multiplicity = geometric multiplicity, also solved by diagonalization;
3) complex distinct eigenvalues, solved by diagonalization and applying Euler's formula to define real-valued eigenfunctions;
4) repeated eigenvalues with algebraic multiplicity > geometric multiplicity, solved by Jordan decomposition using generalized eigenvectors.

While correct, the fact that there are four different cases we need to consider is somewhat unsatisfying. In this section, we show that by appropriately defining a _matrix exponential_, we can provide a unified treatment of all the aforementioned settings.

We start by recalling the power series definition for the scalar exponential $e^x$, for $x \in \mathbb{R}$:

\begin{equation}
\label{ps}
e^x = 1 + x + \frac{x^2}{2!} + \frac{x^3}{3!} + \cdots = \sum_{k=0}^{\infty} \frac{x^k}{k!}, \quad (\text{PS})
\end{equation}

where we recall that $k! = 1 \cdot 2 \cdots (k-1) \cdot k$. We know that for the scalar initial value problem $\dot{x} = ax$, the solution is $x(t) = e^{at}x(0)$, where $e^{at}$ can be computed via ([PS](#ps)) by setting $x = at$.

Wouldn't it be cool if we could do something similar for the vector valued initial value problem defined by $\dot{\mathbf{\vv x}} = A\mathbf{\vv x}$? Does there exist a function, call it $e^{At}$, so that $\mathbf{x}(t) = e^{At}\mathbf{x}(0)$? How would we even begin to define such a thing?

Let's do the "obvious" thing and start with the definition ([PS](#ps)), and replace the scalar $x$ with a matrix $X$ to obtain the _matrix exponential of X_:

\begin{equation}
\label{MPS}
e^X = I + X + \frac{X^2}{2!} + \frac{X^3}{3!} + \cdots = \sum_{k=0}^{\infty} \frac{X^k}{k!}, \quad (\text{MPS})
\end{equation}

Although we can't prove it, it can be shown that ([MPS](#MPS)) converges for any $X$, so this is a well defined object. Does ([MPS](#MPS)) help with solving $\dot{\mathbf{\vv x}} = A\mathbf{\vv x}$? Let's try the test solution $\mathbf{\vv x}(t) = e^{At}\mathbf{\vv x}(0)$ — this is exactly what we did for the scalar setting, but we replace $e^{at}$ with $e^{At}$. Is this a solution to $\dot{\mathbf{\vv x}} = A\mathbf{\vv x}$?

First, we compute $A\mathbf{\vv x}(t) = Ae^{At}\mathbf{\vv x}(0)$. Next, we need to compute $\frac{d}{dt}e^{At}\mathbf{\vv x}(0)$. But how do we do this? We will rely on ([MPS](#MPS)):

\frac{d}{dt} e^{At} x(0) = \frac{d}{dt} (I + At + \frac{(At)^2}{2!} + \frac{(At)^3}{3!} + \cdots)

= \frac{d}{dt}I + \frac{d}{dt}At + \frac{d}{dt}\frac{A^2t^2}{2!} + \frac{d}{dt}\frac{A^3t^3}{3!} + \cdots

= 0 + A + A^2t + A^3t^2 + \cdots

= A + A^2t + \frac{A^3t^2}{2!} + \frac{A^4t^3}{3!} + \cdots

= A(I + At + \frac{A^2t^2}{2!} + \frac{A^3t^3}{3!} + \cdots)

= A e^{At} x(0).

\text{This works, and we have found a general solution to } \dot{x} = Ax \text{ defined}
\text{in terms of the matrix exponential!}

\textbf{Theorem:} \text{Consider the initial value problem } \dot{x} = Ax, \text{ with } x(0) \text{ specified. Its}
\text{solution is given by } x(t) = e^{At} x(0), \text{ where } e^{At} \text{ is defined according}
\text{to the matrix power series (MPS).}

\text{This is very satisfying, as now our scalar and vector-valued problems have similar}
\text{looking solutions defined in terms of appropriate exponential functions. The only thing}
\text{that remains is to compute } e^{At}. \text{ How do we do this? This is where all}
\text{of the work we've done on diagonalization and Jordan forms really pays off!}

\textbf{Case 1:} \text{ Real eigenvalues, diagonalizable A}

\text{Suppose that } A \in \mathbb{R}^{n\times n} \text{ and has eigenvalues } \lambda_1, \lambda_2, \ldots, \lambda_n \text{ with corresponding linearly}
\text{independent eigenvectors } v_1, v_2, \ldots, v_n. \text{ Then we can write}

A = V \Lambda V^{-1}, \text{ for } V = [v_1, v_2, \ldots, v_n] \text{ and } \Lambda = \text{diag}(\lambda_1, \lambda_2, \ldots, \lambda_n).

\text{To compute } e^{At} \text{ we need to compute powers } (At)^k. \text{ Let's work a few of}
\text{these out using } A = V\Lambda V^{-1}:

(At)^0 = I, \quad At = V\Lambda V^{-1}t, \quad A^2t^2 = (V\Lambda V^{-1})(V\Lambda V^{-1})t^2 = V\Lambda^2 V^{-1}t^2,
\quad A^3t^3 = (V\Lambda V^{-1})(V\Lambda^2 V^{-1})t^3 = V\Lambda^3 V^{-1}t^3

\text{There is a pattern: } (At)^k = V \Lambda^k V^{-1} t^k. \text{ This is nice, since computing}
\text{powers of diagonal matrices is easy:}

\Lambda^k = \begin{bmatrix}
\lambda_1 & & \\
& \ddots & \\
& & \lambda_n
\end{bmatrix}^k = \begin{bmatrix}
\lambda_1^k & & \\
& \ddots & \\
& & \lambda_n^k
\end{bmatrix}.

\text{Let's plug these expressions into (MPS):}

e^{At} = I + At + \frac{A^2t^2}{2!} + \frac{A^3t^3}{3!} + \cdots

= VV^{-1} + V\Lambda V^{-1}t + V\Lambda^2 V^{-1}\frac{t^2}{2!} + V\Lambda^3 V^{-1}\frac{t^3}{3!} + \cdots

= V(I + \Lambda t + \frac{\Lambda^2 t^2}{2!} + \frac{\Lambda^3 t^3}{3!} + \cdots)V^{-1} \quad \text{(factor out } V(\cdot)V^{-1}\text{)}

= V(\text{diag}(1+\lambda_1t+\frac{\lambda_1^2t^2}{2!}+\frac{\lambda_1^3t^3}{3!}, \ldots, 1+\lambda_nt+\frac{\lambda_n^2t^2}{2!}+\frac{\lambda_n^3t^3}{3!}))V^{-1}

= V \begin{bmatrix}
e^{\lambda_1 t} & & \\
& \ddots & \\
& & e^{\lambda_n t}
\end{bmatrix} V^{-1} \quad \text{(we recognize } 1+\lambda t+\frac{\lambda^2t^2}{2!}+\cdots \text{ as (PS))}

\text{That's very nice! We diagonalize A, then exponentiate its eigenvalues to compute } e^{At}.
\text{Let's plug this back in to } x(t) = e^{At} x(0):

x(t) = V \begin{bmatrix}
e^{\lambda_1 t} & & \\
& \ddots & \\
& & e^{\lambda_n t}
\end{bmatrix} V^{-1} x(0).

\text{Now, if we let } c = V^{-1}x(0), \text{ we can write}

x(t) = [v_1 \cdots v_n] \begin{bmatrix}
e^{\lambda_1 t} & & \\
& \ddots & \\
& & e^{\lambda_n t}
\end{bmatrix} \begin{bmatrix}
c_1 \\ \vdots \\ c_n
\end{bmatrix} = c_1 e^{\lambda_1 t} v_1 + \cdots + c_n e^{\lambda_n t} v_n,

\text{recovering our previous solution, with the exact formula } c = V^{-1} x(0) \text{ we saw previously}
\text{for the coefficients } c_1, \ldots, c_n.

\textbf{Case 2: Imaginary Eigenvalues}

\text{We focus on the 2x2 case with } A = \begin{bmatrix} 0 & \omega \\ -\omega & 0 \end{bmatrix} = \omega \begin{bmatrix} 0 & 1 \\ -1 & 0 \end{bmatrix}. \text{ In this case,}
\text{we will compute the power series directly.}

A = \omega \begin{bmatrix} 0 & 1 \\ -1 & 0 \end{bmatrix}, A^2 = \omega^2 \begin{bmatrix} -1 & 0 \\ 0 & -1 \end{bmatrix}, A^3 = \omega^3 \begin{bmatrix} 0 & -1 \\ 1 & 0 \end{bmatrix}, A^4 = \omega^4 \begin{bmatrix} 1 & 0 \\ 0 & 1 \end{bmatrix}

= \omega J, = \omega^2 J^2, = \omega^3 J^3, = \omega^4 J^4

A^5 = \omega^5 J^5 = \omega^5 J, A^6 = \omega^6 J^6 = J^2, A^7 = \omega^7 J^7 = \omega^7 J^3, A^8 = \omega^8 J^8 = \omega^8 J^4,

\text{etc. So putting this together in computing } e^{At} \text{ we get:}

e^{At} = \begin{bmatrix} 1 - \frac{1}{2!}(\omega t)^2 + \cdots & \omega t - \frac{1}{3!}(\omega t)^3 + \cdots \\
-\omega t + \frac{1}{3!}(\omega t)^3 - \cdots & 1 - \frac{1}{2!}(\omega t)^2 + \cdots \end{bmatrix} = \begin{bmatrix} \cos \omega t & \sin \omega t \\ -\sin \omega t & \cos \omega t \end{bmatrix},

\text{where we used the power series for } \sin \omega t \text{ and } \cos \omega t \text{ in the last equality.}
\text{As expected, the matrix } A = \omega \begin{bmatrix} 0 & 1 \\ -1 & 0 \end{bmatrix} \text{ has a matrix exponential which defines}
\text{a rotation, at rate } \omega, \text{ so that}

x(t) = \begin{bmatrix} \cos \omega t & \sin \omega t \\ -\sin \omega t & \cos \omega t \end{bmatrix} x(0).

\textbf{Case 3: Complex Eigenvalues}

\text{Let's generalize our previous example to } A = \begin{bmatrix} 6 & \omega \\ -\omega & 6 \end{bmatrix}. \text{ The matrix } A \text{ has}
\text{complex conjugate eigenvalues } \lambda_1 = 6 + i\omega \text{ and } \lambda_2 = 6 - i\omega. \text{ We will again compute}
\text{the power series directly. To do so, we will use the following very useful fact:}

\text{Fact: } e^{A+B} = e^A e^B \text{ if and only if } AB = BA, \text{ that is, if and only if A and B}
\text{commute}

\textbf{ONLINE NOTES: PLEASE PROVIDE PROOF}

\text{We will strategically use this fact. First, defining } J = \begin{bmatrix} 0 & 1 \\ -1 & 0 \end{bmatrix}, \text{ we note we}
\text{can write } A = 6I + \omega J. \text{ Importantly, } 6I \text{ and } \omega J \text{ commute as } (6I)(\omega J) = (\omega J)(6I) = \omega 6J

\text{Therefore, } e^{At} = e^{(6I + \omega J)t} = e^{6It} e^{\omega Jt} = \begin{bmatrix} e^{6t} & 0 \\ 0 & e^{6t} \end{bmatrix} \begin{bmatrix} \cos \omega t & \sin \omega t \\ -\sin \omega t & \cos \omega t \end{bmatrix} = e^{6t} \begin{bmatrix} \cos \omega t & \sin \omega t \\ -\sin \omega t & \cos \omega t \end{bmatrix}

\textbf{Case 4: Jordan Block}

\text{Assume } A = V \begin{bmatrix} \lambda & 1 \\ 0 & \lambda \end{bmatrix} V^{-1}, \text{ for } V = [v_1 \, v_2] \text{ an eigenvector and generalized eigenvector}
\text{of A.}

\text{Then following the same argument as case 2, we have that } e^{At} = V e^{\begin{bmatrix} \lambda & 1 \\ 0 & \lambda \end{bmatrix}t} V^{-1}.

\text{To compute } e^{\begin{bmatrix} \lambda & 1 \\ 0 & \lambda \end{bmatrix}t}, \text{ we note } \begin{bmatrix} \lambda & 1 \\ 0 & \lambda \end{bmatrix}t = \lambda It + t\begin{bmatrix} 0 & 1 \\ 0 & 0 \end{bmatrix}, \text{ and that these two}
\text{terms commute. Hence: } e^{\begin{bmatrix} \lambda & 1 \\ 0 & \lambda \end{bmatrix}t} = e^{\lambda It} e^{t\begin{bmatrix} 0 & 1 \\ 0 & 0 \end{bmatrix}} \text{ We note that}

e^{\lambda It} = \begin{bmatrix} e^{\lambda t} & 0 \\ 0 & e^{\lambda t} \end{bmatrix} \text{ and } e^{t\begin{bmatrix} 0 & 1 \\ 0 & 0 \end{bmatrix}} = I + t\begin{bmatrix} 0 & 1 \\ 0 & 0 \end{bmatrix} \text{ (higher powers }=0)
= \begin{bmatrix} 1 & t \\ 0 & 1 \end{bmatrix}.

\text{Allowing us to conclude that } e^{\begin{bmatrix} \lambda & 1 \\ 0 & \lambda \end{bmatrix}t} = \begin{bmatrix} e^{\lambda t} & te^{\lambda t} \\ 0 & e^{\lambda t} \end{bmatrix}, \text{ and that}

x(t) = e^{At} x(0) = [v_1 \, v_2] \begin{bmatrix} e^{\lambda t} & te^{\lambda t} \\ 0 & e^{\lambda t} \end{bmatrix} V^{-1} x(0), \quad \text{and letting } c = V^{-1}x(0)

= [v_1 \, v_2] \begin{bmatrix} c_1 e^{\lambda t} + c_2 te^{\lambda t} \\ c_2 e^{\lambda t} \end{bmatrix} = (c_1 e^{\lambda t} + c_2 te^{\lambda t})v_1 + c_2 e^{\lambda t} v_2,

\text{which we recognize from our previous section on Jordan Blocks.}

[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/nikolaimatni/ese-2030/HEAD?labpath=/06_Ch_7_Dynamics/083-Matrix_exponential.ipynb)