---
title: 2.5 Kernel and Image
subject: Vector Spaces and Bases
subtitle: For our next trick, we'll make this vector disappear
short_title: 2.5 Kernel and Image
authors:
  - name: Nikolai Matni
    affiliations:
      - Dept. of Electrical and Systems Engineering
      - University of Pennsylvania
    email: nmatni@seas.upenn.edu
license: CC-BY-4.0
keywords: null spaces, column spaces, solution of linear systems, superposition principle
math:
  '\vv': '\mathbf{#1}'
  '\bm': '\begin{bmatrix}'
  '\em': '\end{bmatrix}'
  '\R': '\mathbb{R}'
---

[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/nikolaimatni/ese-2030/HEAD?labpath=/01_Vector_Spaces_and_Bases/035-kernel_image.ipynb)

{doc}`Lecture notes <../lecture_notes/Lecture 04 - The Fundamental Matrix Subspaces (Kernel, Image, CoKernel, CoImage), Fundamental Theorem of Linear Algebra, and a brief interlude on the Matrix Transpose.pdf>`

## Reading

Material related to this page, as well as additional exercises, can be found in ALA Ch. 2.5 and LAA 4.2.

## Learning Objectives

By the end of this page, you should know:
- what is a kernel (null space) and image (column space) of a matrix
- how does null space and column space relate to solutions of linear systems
- to apply the superposition principle for solving linear systems with different right-hand vectors

## Introduction

The following two subspaces of $\mathbb{R}^n$ (or generic vector spaces $V$) typically arise in applications of linear algebra and are closely related to systems of linear equations.

1. **Null space**: set of all solutions to a system of linear equations of the form $A \vv x = \vv 0$. called a _homogeneous_ linear system.
2. span of certain specified vectors. One common example is the **column space** of a matrix $A$, which will be detailed in this page. 

## Null Space of a Matrix


:::{prf:definition} Null space
:label: null_defn
The _Null space_ of a matrix $A \in \mathbb{R}^{m \times n}$ is the subspace Null$A \subset \mathbb{R}^n$ consisting of all vectors that are annihilated by $A$:
\begin{equation}
\label{null_eqn}
\textrm{Null}(A) = \{ \vv x \in \mathbb{R}^n \ | \ A \vv x = \vv 0 \} \subset \mathbb{R}^n.
\end{equation}
:::

If we think of the function $f(\vv x) = A \vv x$ that maps $\vv x \mapsto A \vv x$, then Null$(A)$ is the subset of $\mathbb{R}^n$ that $f(\vv x)$ to $\vv 0$.
```{note}
The _Null space_ is also known as the _kernel_ of the matrix. We will use Null$(A)$ in our notes even though ALA uses _kernel_, because _null space_ is more descriptive of what it actually is: $\{ \vv x \in \mathbb{R}^n \ | \ A \vv x = \vv 0 \}$. 
```

:::{prf:example}
:label:null_eg

Consider the following system of homogeneous equations
\begin{equation}
\label{null_eg_eqn}
x_1 + 2x_2 + x_3 &= 0, \\
-x_1 + 4x_2 + 2x_3 &= 0, 
\end{equation}
or in matrix form $A \vv x = \vv 0$, where $A = \bm1 & 2 & 1 \\ -1 & 4 & 2 \em$.
The set of $\vv x$ satisfying $A \vv x = \vv 0$ is the [solution set](../Linear_Algebraic_Systems/026-linsys-general.ipynb#solution_set) of [](#null_eg_eqn). Our goal is to relate the solution set to the matrix $A$ which will give us a geometric interpretation. 

Is $\vv u = \bm 0 \\ 1 \\ -2 \em$ in Null$(A)$? 

Evaluating $A \vv u = \bm1 & 2 & 1 \\ -1 & 4 & 2 \em\bm 0 \\ 1 \\ -2 \em = \bm 0 \\ 0\em \Rightarrow A \vv u = \vv 0 \Rightarrow \vv u \in $Null$(A)$. 
:::

We call Null$(A)$ the null _space_ because Null$(A)$ is a [subspace](./032-subspaces.ipynb#sub_def) and we test it as follows. Suppose $\vv u, \vv v \in $Null$(A)$ and $c d \in \mathbb{R}$. We need to check if $c \vv u + d \vv v \in$Null$(A)$. 
\begin{equation}
\label{null_space_check}
A(c \vv u + d \vv v) &= c A \vv u + d A \vv v, \ (\textrm{linearly of matrix multiplication}) \\
&= c \vv 0 + d \vv 0, \ (\vv u, \vv v \in \textrm{Null}(A)) \\
&= \vv 0
\end{equation}

From [](#null_space_check), if $A \in \mathbb{R}^{m \times n}$, then Null$(A)$ is a subspace of $\mathbb{R}^n$ which leads ot the following principle. 

```{prf:theorem} Superposition principle
:label: super_thm
If $\vv u_1, \vv u_2, \ldots, \vv u_k$ are each solutions to $A \vv u = 0$, then so is every [linear combination](./033-span_lin_ind.ipynb#ln_comb) $c_1 \vv u_1 + c_2\vv u_2 + \ldots + c_k\vv u_k$.
```

```{warning}
The set of solutions to $A \vv x = \vv b$ where $\vv b \neq \vv 0$ is not a subspace! 
```

```{important}
[Superposition](#super_thm) has the following practical implication. We only need to find a few specific solutions to $A \vv u = 0$ to construct every possible solution via linear combinations.
```

```{note}
The same ideas of solving linear systems with a matrix representation extend to more general linear systems with infinite dimensional vector spaces such as linear differential equations, which we will see later in the course. 
```

## Describing the Null Space

An explicit description of Null$(A)$ can be obtained by solving $A \vv x = 0$, for example, using Gaussian Elimination.

:::{prf:example}
:label:null_eg_GE
Let us find a [basis](./034-basis_dim.ipynb#basis_defn) for Null$(A)$, where 
$A = \bm -3 & 6 & -1 & 1 & -7 \\
1 & -2 & 2 & 3 & -1 \\
2 & -4 & 5 & 8 & -4
\em$. We reduce $A$ to row echelon form
\begin{equation}
\label{null_GE_eqn}
A \leftrightarrow \bm 1 & -2 & 0 & -1 & 3 \\
0 & 0 & 1 & 2 & -2 \\
0 & 0 & 0 & 0 & 0
\em \leftrightarrow x_1 = 2x_2 + x_4 - 3x_5, \ x_3 = -2x_4 + 2x_5.
\end{equation}
From [](#null_GE_eqn), the free variables are $x_2, x_4, x_4$ and the basic variables are $x_1, x_3$, since, the pivots are at $(1, 1), (2, 3)$. We can decompose the general solution as
\begin{equation}
\label{null_soln}
\bm x_1 \\ x_2 \\ x_3 \\ x_4 \\ x_5\em = \bm 2x_2 + x_4 - 3x_5 \\ x_2 \\ -2x_4 + 2x_5 \\ x_4 \\ x_5\em = x_2 \bm 2 \\ 1 \\ 0 \\ 0 \\ 0\em + x_4 \bm 1 \\ 0 \\ -2 \\ 1 \\ 0\em + x_5 \bm -3 \\ 0 \\ 2 \\ 0 \\ 1\em = x_2 \vv u_1 + x_4 \vv u_2 + x_5 \vv u_3, \\
\textrm{where}, \vv u_1 = \bm 2 \\ 1 \\ 0 \\ 0 \\ 0\em, \ \vv u_2 = \bm 1 \\ 0 \\ -2 \\ 1 \\ 0\em, \ \vv u_3 = \bm -3 \\ 0 \\ 2 \\ 0 \\ 1\em.
\end{equation}
From [](#null_soln), every linear combination of $\vv u_1, \vv u_2, \vv u_3$ is in Null$(A)$. Also, $\vv u_1, \vv u_2, \vv u_3$ are linearly independent (think when does [](#null_soln) become zero and why so?), hence, $\vv u_1, \vv u_2, \vv u_3$ form a basis for Null$(A)$.

We conclude that Null$(A) \subset \mathbb{R}^5$ is a subspace of [dimension](./034-basis_dim.ipynb#dim_defn) 3.

:::

## The Column Space of $A$

We have seen previously that the matrix-vetor product $A \vv x$, 
$$
A \vv x = x_1 \vv a_1 + x \vv a_2 + \ldots + x_n \vv a_n,
$$
is the linear combination of the columns of $A = \bm \vv a_1 & \vv a_2 & \ldots & \vv a_n\em$ weighted by the elements of $\vv x$. By letting the coefficients of $\vv x$ vary, we define the _column space_ of $A$.

:::{prf:definition} Column space
The column space of an $m \times n$ matrix $A$, written as Col$(A)$, is the set of all linear combinations of the columns of $A =  \bm \vv a_1 & \vv a_2 & \ldots & \vv a_n\em$:
\begin{equation}
\label{col_eqn}
\textrm{Col}(A) &= \{\vv c \in \mathbb{R}^m : \vv b = A \vv x \ \textrm{for some} \ \vv x \in \mathbb{R}^n\}, \\
&= \textrm{span}(\vv a_1, \vv a_2, \ldots, \vv a_n).
\end{equation}
The Col$(A)$ is also sometimes called the _image_ or _range space_ of $A$.
:::

Since Col$(A)$ is defined by the span of some vectors, it is immediate that Col$(A)$ is a subspace. However, Col$(A) \subset \mathbb{R}^m$ (where $\vv b$ lives), not $\mathbb{R}^n$ (where Null$(A)$ and $\vv x$ lives). 

```{important}
It is immediate that $A \vv x = \vv b$ has at least one solution if and only if $\vv b \in $Col$(A)$. 
```

:::{exercise}
:label:col_ex
Find a matrix $A$ so that the set 
\begin{equation}
\label{col_ex_eqn}
W = \left\{ \bm a + b - 3c \\ b + c + a \\ -a-b - 6c \\ 7a + 7b + 2c\em : a, b, c \in \mathbb{R}\right\}
\end{equation}
is equal to Col$(A)$.

```{solution} col_ex
:class: dropdown 

First, we write $W$ as a linear combination
\begin{equation}
\label{col_ex_soln1}
W = \left\{ a \bm 1 \\ 1 \\ -1 \\ 7\em + b \bm 1 \\ 1 \\ -1 \\ 7\em + c \bm -3 \\ 1 \\ -6 \\ 2\em : a, b, c \in \mathbb{R}\right\}
\end{equation}
Note that in [](#col_ex_soln1), the first two vectors are the same. Hence, rewriting [](#col_ex_soln1)
\begin{equation}
\label{col_ex_soln2}
W &= \left\{ (a+b) \bm 1 \\ 1 \\ -1 \\ 7\em + c \bm -3 \\ 1 \\ -6 \\ 2\em = a' \bm 1 \\ 1 \\ -1 \\ 7\em + c \bm -3 \\ 1 \\ -6 \\ 2\em : a', c \in \mathbb{R}\right\}, \\
&= \textrm{span}\left\{\bm 1 \\ 1 \\ -1 \\ 7\em, \bm -3 \\ 1 \\ -6 \\ 2\em\right\}
\end{equation}
We can set the vectors in [](#col_ex_soln2) as the columns of $A: A = \bm 1 & -3 \\ 1 & 1 \\ -1 & -6 \\ 7 & 2\em \Rightarrow W = $Col$(A)$. 

``` 
:::

## The Complete Solution to $A \vv x = \vv b$

```{prf:theorem}
:label: soln_thm
The linear system $A \vv x = \vv b$ has at least one solution $\vv x^*$ if and only if $\vv b \in $Col$(A)$. If this occurs, then $\vv x$ is a solution to $A \vv x = \vv b$ if and only if 
$$
\vv x = \vv x^* + \vv n,
$$
where $\vv n \in$ Null$(A)$ is an element of the null space of $A$.
:::{prf:proof} Proof of [](#soln_thm)
:label: proof-soln_thm
:class: dropdown
The first part of the theorem was already discussed [here](./033-span_lin_ind.ipynb#lin_dep_thm). Suppose, $\vv x$ and $\vv x^*$ are both solutions so that $A \vv x = A \vv x^* = \vv b$, then their difference $\vv n = \vv x - \vv x^*$ should satisfy
$$
A \vv n = A (\vv x - \vv x^*) = A \vv x - A \vv x^* = \vv b - \vv b = \vv 0
$$
so that $\vv n \in$Null$(A)$. This means that $\vv x = \vv x^* + (\vv x - \vv x^*) = \vv x^* + \vv n$.
:::
```

```{note} Consequences of [](#soln_thm)
1. [](#soln_thm) tells us that to contruct the most general solution to $A \vv x = \vv b$, we only need to know a _particular solution $\vv x^*$_ and the general solution to $A \vv n = \vv 0$.
2. Remember, we solve inhomogeneous linear ordinary differential equations similar to [](#soln_thm). We will see later that linear algebraic systems and linear ordinary differential equations are both examples of _general linear systems_.
3. Computing the general solution to $A \vv x = \vv b$ requires applying Gaussian Elimination (GE) first to $\bm A | \vv b\em$ to get a particular solution, and then GE to $\bm A | \vv 0 \em$ to characterize the null space.
```

:::{exercise}
:label:thm_ex

Find all solutions to the linear system

\begin{align*}
    x_1 + 2x_2 + x_3 - 2x_4 &= 9\\
    -2x_1 - 3x_2 - x_3 + x_4 &= -7\\
    x_1 + x_2 - x_3 - x_4 &= 2
\end{align*}

```{solution} thm_ex
:class: dropdown 

First, let's form the augmented matrix:

\begin{align*}
    \left[ \begin{array}{cccc|c}
        1 & 2 & 1 & -2 & 9\\
        -2 & -3 & -1 & 1 & -7\\
        1 & 1 & -1 & -1 & 2
    \end{array} \right]
\end{align*}

Putting the coefficient matrix in row echelon form, we get the equivalent system

\begin{align*}
    \left[ \begin{array}{cccc|c}
        1 & 2 & 1 & -2 & 9\\
        0 & 1 & 1 & -3 & 11\\
        0 & 0 & -1 & -2 & 4
    \end{array} \right]
\end{align*}

Putting the coefficient matrix in reduced row echelon form (i.e., eliminating above the pivots, then scaling the pivots to equal 1), we get the equivalent system

\begin{align*}
    \left[ \begin{array}{cccc|c}
        1 & 0 & 0 & 6 & -17\\
        0 & 1 & 0 & -5 & 15\\
        0 & 0 & 1 & 2 & -4
    \end{array} \right]
\end{align*}

This is equivalent to the linear system

\begin{align*}
    x_1 + 6x_4 &= -17\\
    x_2 - 5x_4 &= 15\\
    x_3 + 2x_4 &= -4
\end{align*}

whose solutions are given by:

\begin{align*}
    \bm x_1 \\ x_2 \\ x_3 \\ x_4 \em &= \bm -17 - 6x_4 \\ 15 + 5x_4 \\ -4 - 2x_4 \\ x_4 \em \\
    &= \underbrace{\bm -17 \\ 15 \\ -4 \\ 0 \em}_{\text{particular solution}} + \underbrace{x_4 \bm -6 \\ 5 \\ -2 \\1 \em}_{\text{null space}}\tag{for all $x_4 \in \mathbb R$}
\end{align*}

```

:::

```{prf:theorem} Summary
:label: sum_thm
If $A \in \mathbb{R}^{m \times n}$, then the following conditions are equivalent (any one implies the other):
1. Null$(A) = \{\vv 0\}$, i.e., $A \vv x = 0$ if and only if $\vv x = 0$
2. rank$(A) = n$
3. The linear system $A \vv x = \vv b$ has no [free variables](../Linear_Algebraic_Systems/026-linsys-general.ipynb#free_basic_defn)
4. The system $A \vv x = \vv b$ has a unique solution for each $\vv b \in $Col$(A)$.
```

We can specialize [](#sum_thm) or square matrices, which allows us to characterize if $A$ is invertible via either its null space or column space.

```{prf:theorem} Invertible Matrices
:label: inv_thm
If $A \in \mathbb{R}^{n \times n}$, then the following conditions are equivalent:
1. $A$ is nonsingular
2. rank$(A) = n$
3. Null$(A) = \{\vv 0\}$
4. Col$(A) = \mathbb{R}^n$
5. $A \vv x = \vv b$ has a unique solution for all $\vv b \in \mathbb{R}^n$.
```

## Applying the Superposition Principle

For homogeneous systems $A \vv x = \vv 0$, [superposition](./035-kernel_image.ipynb#super_thm) let us generate new solutions by combining known solutions. For _inhomogeneous systems_ $A \vv x = \vv b$, superposition lets us combine solutions for different $\vv b$ vectors.

Suppose we have solutions $\vv x_1^*$ and $\vv x_2^*$ to $A \vv x = \vv b_1$ and $A \vv x = \vv b_2$, respectively. Can I build a solution to $A \vv x = c_1 \vv b_1 + c_2 \vv b_2$ for some $c_1, c_2 \in \mathbb{R}$? The answer is [superposition](./035-kernel_image.ipynb#super_thm)!  Let's try $\vv x^* = c_1 \vv x_1^* + c_2 \vv x_2^*$:
\begin{equation}
\label{super_eqn}
A \vv x^* = A (c_1 \vv x_1^* + c_2 \vv x_2^*) = c_1 (A \vv x_1^*) + c_2 (A \vv x_2^*) = c_1 \vv b_1 + c_2 \vv b_2.
\end{equation}
From [](#super_eqn), $\vv x^* = c_1 \vv x_1^* + c_2 \vv x_2^*$ is a solution to $A \vv x = c_1 \vv b_1 + c_2 \vv b_2$. 

:::{prf:example}
:label:sup_eg

The system
$$
\bm 4 & 1 \\ 1 & 4 \em \bm x_1 \\ x_2 \em = \bm f_1 \\ f_2 \em
$$
models the mechanical response of a pair of masses connected by springs subject to external forcing. 

The solution $\vv x = \bm x_1 \\ x_2 \em$ is the displacement of the masses and the right-hand side $\vv f = \bm f_1 \\ f_2 \em$ are the appplied forces.

For $\vv f = \vv e_1 = \bm 1 \\ 0 \em$, $\vv x_1^* = \bm \frac{4}{15} \\ -\frac{1}{15}\em$; and $\vv f = \vv e_2 = \bm 0 \\ 1 \em$, $\vv x_2^* = \bm -\frac{1}{15} \\ \frac{4}{15}\em$. 

Hence, we can write the general solution for $\vv f = \bm f_1 \\ f_2 \em f_1 \vv e_1 + f_2 \vv e_2$ as $\vv x^* = f_1 \vv x_1^* + f_2 \vv x_2^*$.  
:::

### The General form

We can generalize the above idea to more than two solutions. 

If $\vv x_1^*, \vv x_2^*, \ldots, \vv x_k^*$ are solutions to $A \vv x = \vv b_1, A \vv x = \vv b_2, \ldots, A \vv x = \vv b_k$, then, for any choice of $c_1, c_2, \ldots, c_k \in \mathbb{R}$, a particular solution to
\begin{equation}
\label{gen}
A \vv x = c_1 \vv b_1 + c_2 \vv b_2 + \ldots + c_k \vv b_k
\end{equation}
is given by $\vv x^* = c_1 \vv x_1^* + c_2 \vv x_2^* + \ldots + c_k \vv x_k^*$. The general solution to [](#gen) is
\begin{equation}
\label{gen_n}
\vv x = \vv x^* + \vv n = c_1 \vv x_1^* + c_2 \vv x_2^* + \ldots + c_k \vv x_k^* + \vv n,
\end{equation}
where $\vv n \in $Null$(A)$.

:::{important}
:label: sup_imp
If we know the particular solutions $\vv x_1^*, \vv x_2^*, \ldots, \vv x_m^*$ to $A \vv x = \vv e_i$ for $i = 1, 2, \ldots, m$, where $\vv e_1, \ldots, \vv e_m$ are the standard basis vectors of $\mathbb{R}^m$, then, we can construct a particular solution $\vv x^*$
 to $A \vv x = \vv b$ by first writing 
 $$
\vv b = b_1 \vv e_1 + b_2 \vv e_2  +\ldots + b_m \vv e_m
 $$
 to conclude that $\vv x = b_1 \vv x_1 + b_2 \vv x_2  +\ldots + b_m \vv x_m$ is a solution to $A \vv x = \vv b$. This tells us how the elements $b_i$ of $\vv b$ affect our solution $\vv x^*$.
:::

```{note}
If $A$ is square, then [this fact](#sup_imp) is another way of computing $A^{-1}$. The vectors $\vv x_1^*, \vv x_2^*, \ldots, \vv x_m^*$ are the columns of $A^{-1}$(what are the $m$ linear systems?), and $\vv x^* = A^{-1}\vv b$.
```



[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/nikolaimatni/ese-2030/HEAD?labpath=/01_Vector_Spaces_and_Bases/035-kernel_image.ipynb)