# 4. Numerical Linear Algebra/Eigenproblems (Prof. Peterson)

Let $A$ be an $n \times n$ full rank matrix. We are interested in solving the linear system $Ax = B$ using an iterative method.

**a) Give reasons why you would choose to use an iterative method instead of a direct solver.**

* Iterative methods are often faster if we have a good initial guess. This occurs often when solving time dependent PDEs, where we can use the solution of the previous time step as the initial guess for the next time step.
* Iterative methods often use less storage, in particular when dealing with sparse matrices. We only have to store the nonzero elements of sparse matrices. There are direct methods for sparse matrices as well, but they are more complicated than iterative methods.

**b)** Let $x^0$ be given and consider the preconditioner form of an iterative method given by
$$ x^{k+1} = x^k + Q^{-1} r_k, \qquad k=0,1,2,... $$
where $Q$ is an $n \times n$ invertible matrix and $r_k$ is the residual
$$ r_k = b - Ax^k $$ 

**b) i) If $x^k$ converges to a vector $y$, prove that this vector is the unique solution to $Ax = b$. **

$x^k$ converges to $y$ means that $y$ satisfies the equation:
$$y = y + Q^{-1}(b - Ay)$$
$$0 = Q^{-1}(b - Ay)$$
Since we have a product, and $Q^{-1}$ cannot be 0, then $(b - Ay) = 0$.
Which implies $Ay = b$ and hence $y$ is the unique solution.

**b) ii) The motivation for this form of the iterative method can be seen by taking $Q = A$. In this case, what is $x^1$? What does this suggest we could choose for $Q$? **

Let $Q = A$. Then our iterative method looks like:
$$ x^{k+1} = x^k + A^{-1} (b-Ax^k)$$
$$ x^{k+1} = x^k + A^{-1}b - A^{-1}Ax^k$$
$$ x^{k+1} = A^{-1}b $$
Our new method finds the solution within one iteration, so $x^1 = x$
## What does this suggest we could choose for $Q$?

**c)** Let $e_k = x^k - x$ be the error vector.

**c) i) Show that $e_k$ satisfies
$$ e_k = B^k e_0 $$
for an appropriately chosen matrix $B$; explicitly give $B$. **

$$e_{k+1} = x^{k+1} - x = x^k + Q^{-1}(b-Ax^k) - x - Q^{-1}(b-Ax)$$
$$= x^k - x + Q^{-1}b - Q^{-1}Ax^k - Q^{-1}b + Q^{-1}Ax$$
$$= x^k - x - Q^{-1}Ax^k + Q^{-1}Ax$$
$$= x^k - x - Q^{-1}A(x^k - x)$$
$$= (1-Q^{-1}A)(x^k - x)$$
If we do this recursively we get
$$(x^{k+1} - x) = (1-Q^{-1}A)^{k+1} (x^0 - x)$$
If we set $B = (1-Q^{-1}A)$ then we've shown 
$e_{k+1} = B^{k+1}e_0$

**c) ii) Use your result in (i) to give a sufficient condition for convergence of the iterative method in (b) in terms of the norm of $B$. Justify your answer. Is this true for any matrix norm? Relate your answer in terms of the norm of $B$ to the eigenvaleus of $B$. **

We have
$$ x^k - x = (1-Q^{-1}A)^k (x^0-x) $$
Taking the norm of both sides we get
$$ ||x^k - x|| = ||(1-Q^{-1}A)^k (x^0-x)|| $$
$$ \leq ||(1-Q^{-1}A)^k||\quad ||x^0-x|| $$
Since we want a condition for convergence, we take the limit of both sides as $k \to \infty$
$$ \lim_{k \to \infty} ||x^k - x|| \leq \lim_{k \to \infty} ||(1-Q^{-1}A)^k|| \quad ||x^0-x|| $$
Notice that $||x^0-x||$ is constant, so we only want $\lim_{k \to \infty} ||(1-Q^{-1}A)^k|| = \lim_{k \to \infty} ||B^k||$.
Since $\lim_{k \to \infty} e_k = 0$ we know $\rho(B) < 1$ and we also know $\rho(B) < ||B||$, so we have a criterion for convergence in terms of the norm of $B$. Also, because we did not use any property of norms that is specific to a particular norm, our result is true for all norms.

**c) iii) Show that the residual $r_k$ satisfies
$$ r_k = C^k r_0 $$
for an appropriately chosen matrix $C$; Explicitly give $C$. **

$$r_{k+1} = b-Ax^{k+1} = b-A(x^k + Q^{-1}(b-Ax^k))$$
$$ = b-Ax^k + AQ^{-1}(b-Ax^k)$$
$$ = r_k + AQ^{-1}r_k $$
$$ = (1 + AQ^{-1})r_k $$
If we do this recursively we get
$$r_{k+1} = (1 + AQ^{-1})^{k+1} r_0 $$
So we set $C = (1 + AQ^{-1})$ and we've shown that
$$r_{k+1} = C^{k+1} r_0 $$

**d)** Write pseudo-code to efficiently implement the iterative method in **(b)** assuming you are given the matrix $Q$.

**d) i)  give the operation count for computing $p$ iterations if $Q$ is a full matrix.**

** d) ii)  Give the operation count for computing $p$ iterations if $Q$ is a tridiagional matrix. **

** e)  if B and C are similar matrices then explain how this effects the behaviour of $e_k$ and $r_k$ **

If B and C are similar matrices, they have the same eigenvalues. This means they have the same spectral radii, which implies they both converge (or both diverge). This is useful because in practice we don't have the true solution, so we can't calculate $e_k$. We can track $r_k$ and if $r_k$ converges, then we know $e_k$ also converges.