# Example: Iterative Linear Algebraic Equation (LAEs) Solvers
This example will familiarize students with developing and using iterative solvers for systems of Linear Algebraic Equations (LAEs). We'll consider two iterative solvers: the [Jacobi](https://en.wikipedia.org/wiki/Jacobi_method) and [Gauss-Siedel](https://en.wikipedia.org/wiki/Gauss%E2%80%93Seidel_method) methods

* [Jacobi's method](https://en.wikipedia.org/wiki/Jacobi_method) updates the estimated solution for all variables at the same time. Let the estimate of the value of $x_{i}$ at iteration $k$ be $\hat{x}_{i,k}$. Then, the solution at the next iteration $\hat{x}_{i,k+1}$ is given by:
$$
\begin{equation*}
\hat{x}_{i,k+1}=\frac{1}{a_{ii}}\bigl(b_{i}-\sum_{j=1,i}^{n}a_{ij}\hat{x}_{j,k}\bigr)\qquad{i=1,2,\cdots,n}
\end{equation*}
$$
* The [Gauss-Seidel method](https://en.wikipedia.org/wiki/Gauss%E2%80%93Seidel_method) updates the best estimate of $\hat{x}_{i}$ while processing equations $i=1,\cdots,n$. Let the estimate for variable $i$ at iteration $k$ be $\hat{x}_{i,k}$. Then, the solution at the next iteration $\hat{x}_{i,k+1}$ is given by:
$$
\begin{equation*}
\hat{x}_{i,k+1}=\frac{1}{a_{ii}}\bigl(b_{i}-\sum_{j=1}^{i-1}a_{ij}\hat{x}_{j,k+1}-\sum_{j=i+1}^{n}a_{ij}\hat{x}_{j,k}\bigr)\qquad{i=1,2,\cdots,n}
\end{equation*}
$$

### Learning objectives
* __Task 1__: Random Diagonally Dominate $\mathbf{A}$ and right-hand-side vector $\mathbf{b}$. In this task, we'll generate a random system matrix $\mathbf{A}$ that is diagonally dominant and a random right-hand side vector $\mathbf{b}$
* __Task 2__: Solve the LAEs using the  Jacobi and the Gauss-Seidel methods. In this task, we'll solve our system of random linear algebraic equations using the [Jacobi](https://en.wikipedia.org/wiki/Jacobi_method) and [Gauss-Siedel](https://en.wikipedia.org/wiki/Gauss%E2%80%93Seidel_method) methods
* __Task 3__: In this task, we'll compare the runtime performance of the different iterative approaches against the Gaussian elimination method implemented by the LinearAlgebra.jl package included with Julia using the BenchmarkTools.jl package

## Setup
This example may use external third-party packages. In [the `Include.jl` file](Include.jl), we load our codes to access them in the notebook, set some required paths for this example, and load any required external packages.

In [3]:
include("Include.jl");

[32m[1m  Activating[22m[39m project at `~/Desktop/julia_work/CHEME-4800-5800-Examples-Fall-2024/lecture/week-7/L7a`
[32m[1m  No Changes[22m[39m to `~/Desktop/julia_work/CHEME-4800-5800-Examples-Fall-2024/lecture/week-7/L7a/Project.toml`
[32m[1m  No Changes[22m[39m to `~/Desktop/julia_work/CHEME-4800-5800-Examples-Fall-2024/lecture/week-7/L7a/Manifest.toml`
[32m[1m    Updating[22m[39m registry at `~/.julia/registries/General.toml`
[32m[1m  No Changes[22m[39m to `~/Desktop/julia_work/CHEME-4800-5800-Examples-Fall-2024/lecture/week-7/L7a/Project.toml`
[32m[1m  No Changes[22m[39m to `~/Desktop/julia_work/CHEME-4800-5800-Examples-Fall-2024/lecture/week-7/L7a/Manifest.toml`


### Task 1: Random Diagonally Dominate $\mathbf{A}$ and right-hand-side vector $\mathbf{b}$
In this task, we'll generate a random system matrix $\mathbf{A}$ that is diagonally dominant and a random right-hand side vector $\mathbf{b}$. Diagonal dominance is a sufficient (but not necessary) condition for the convergence of an iterative method. A diagonally dominate system matrix $\mathbf{A}$ has the feature:
$$
\begin{equation*}
\sum_{j=1,i}^{n}\lvert{a_{ij}}\rvert<\lvert{a_{ii}}\rvert\qquad\forall{i}
\end{equation*}
$$

* Diagonal dominance is a matrix property where the absolute value of the diagonal element of each row is greater than or equal to the sum of the absolute values of the other elements in that row.
A matrix that satisfies this property is said to be diagonally dominant.
* Diagonal dominance is a sufficient (but not necessary) condition for convergence. 
However, this condition says nothing above the rate of convergence.

Let's start by specifying how many rows we have in the _square_ system matrix $\mathbf{A}$ in the `number_of_rows::Int64` variable:

In [5]:
number_of_rows = 5000;

Then generate a $n\times{n}$ random system matrix $\mathbf{A}$ and a $n\times{1}$ random vector $\mathbf{b}$, [using the `randn(...)` method](https://docs.julialang.org/en/v1/stdlib/Random/#Base.randn). We add some extra to the diagonal elements of the test system matrix $\mathbf{A}$ to ensure diagonal dominance.

In [7]:
A = randn(number_of_rows, number_of_rows) .+ 10*(number_of_rows)*diagm(rand(number_of_rows));
b = randn(number_of_rows);

In [8]:
A

5000×5000 Matrix{Float64}:
 24877.4           -1.23692        0.29121    …     -0.0783251      0.0319248
    -1.17067    31963.6            0.148463          1.04083       -0.289523
     1.84314        0.964158   34528.2               0.733883       0.165201
     1.09757       -0.828713      -1.62513          -0.0863846     -0.655511
    -0.336371      -1.43681       -0.615674          1.16371       -0.233884
     1.66643        0.490444      -0.964125   …     -1.40086       -0.526244
    -0.760686      -0.0836007      0.888143          1.10601       -0.545177
     1.60928        0.479189      -1.00491           0.383731       0.846152
    -0.129733       1.38374       -0.220397          0.523212      -0.449456
    -1.77307        2.90176        0.0935623         1.6678         0.841787
    -0.132576      -1.88773        0.133993   …      0.694049       0.275796
     0.153477      -1.30855        1.78692          -0.850957      -0.203889
    -0.92109       -2.11822        0.594018     

### Check: Is the system matrix $\mathbf{A}$ strictly diagonally dominant?
Before we continue to the solvers, let's verify the randomly generated system matrix $\mathbf{A}$ is actually diagonally dominant. We check every row of the matrix $\mathbf{A}$ and store the result of each test in the `ddcondition::Dict{Int64, Bool}` variable.

In [10]:
ddcondition = Dict{Int64,Bool}()
for i ∈ 1:number_of_rows
    aii = abs(A[i,i]);
    σ = 0.0;
    for j ∈ 1:number_of_rows
        if (i ≠ j)
            σ += abs(A[i,j]);
        end
    end
    ddcondition[i] = (aii > σ) ? true : false;
end

If any of the entries of the `ddcondition::Dict{Int64, Bool}` are `false`, then we fail this test:

In [12]:
(findall(x-> x == 0, ddcondition) |> isempty) == true

false

## Task 2: Solve the LAEs using the  Jacobi and the Gauss-Seidel methods
In this task, we'll solve our system of random linear algebraic equations using the [Jacobi](https://en.wikipedia.org/wiki/Jacobi_method) and [Gauss-Siedel](https://en.wikipedia.org/wiki/Gauss%E2%80%93Seidel_method) methods. First, we set an overall error criteria (stopping condition), a maximum number of iterations that we are allowed, and an initial solution guess.

In [14]:
xₒ = rand(number_of_rows); # initial condition
maxiterations = 100;
ϵ = 1e-6;

### Jacobi
We call [the `solve(...)` method](src/Solvers.jl) with the appropriate data, including the solver type we wish to use, which in this case is [the Jacobi method](https://en.wikipedia.org/wiki/Jacobi_method). We indicate this choice by passing [a `MyJacobiMethod` instance](src/Types.jl) to the solve routine.

In [16]:
dJM = solve(A,b,xₒ, ϵ = ϵ, maxiterations = maxiterations, algorithm = MyJacobiMethod())

Dict{Int64, Vector{Float64}} with 12 entries:
  5  => [-3.62035e-5, -1.85552e-5, 5.89392e-5, -1.03793e-5, -1.84137e-5, 4.8433…
  8  => [-3.6208e-5, -1.84088e-5, 5.89579e-5, -1.04884e-5, -1.84573e-5, 4.84945…
  1  => [-0.000703225, 0.000139073, 0.000129635, -0.00134302, -0.00171552, -0.0…
  0  => [0.319502, 0.609319, 0.223855, 0.128712, 0.189751, 0.169528, 0.0555595,…
  6  => [-3.62012e-5, -1.83921e-5, 5.89686e-5, -1.04692e-5, -1.84615e-5, 4.8493…
  11 => [-3.62079e-5, -1.84086e-5, 5.89579e-5, -1.04882e-5, -1.84581e-5, 4.8494…
  9  => [-3.62079e-5, -1.84086e-5, 5.89579e-5, -1.04882e-5, -1.84581e-5, 4.8494…
  3  => [1.09578e-5, -4.91236e-5, 4.81914e-5, -2.93489e-5, -8.72812e-5, -5.7997…
  7  => [-3.62058e-5, -1.8407e-5, 5.89547e-5, -1.04897e-5, -1.84664e-5, 4.84972…
  4  => [-3.84843e-5, -1.51789e-5, 6.10037e-5, -1.07521e-5, -1.49333e-5, 5.1693…
  2  => [-9.49517e-5, -7.10126e-5, 5.64345e-5, -0.000255346, 0.000348108, 0.000…
  10 => [-3.62079e-5, -1.84086e-5, 5.89579e-5, -1.04882e-5, -1.

#### Check: Did we meet the error condition for Jacobi?
Let's check if the [the Jacobi method](https://en.wikipedia.org/wiki/Jacobi_method) met the desired error criteria. In this case, we'll check the _maxium error at the last iteration_. We compute the error for each equation and then find the worst case. If this worst-case error is smaller than our error tolerance, we pass the test:

In [18]:
error = A*dJM[maximum(keys(dJM))] - b
@assert maximum(error) < ϵ

### Gauss-Seidel method
Similar to above, we call [the `solve(...)` method](src/Solvers.jl) with the appropriate data, including the solver type we wish to use, which in this case is [the Gauss-Siedel method](https://en.wikipedia.org/wiki/Gauss%E2%80%93Seidel_method). We indicate this choice by passing [a `MyGaussSeidelMethod` instance](src/Types.jl) to the solve routine.

In [20]:
dGSM = solve(A,b,xₒ, ϵ = ϵ, maxiterations = maxiterations, algorithm = MyGaussSeidelMethod())

Dict{Int64, Vector{Float64}} with 9 entries:
  0 => [0.319502, 0.609319, 0.223855, 0.128712, 0.189751, 0.169528, 0.0555595, …
  4 => [-3.57954e-5, -1.87865e-5, 5.91465e-5, -1.04242e-5, -1.82358e-5, 4.76695…
  5 => [-3.61985e-5, -1.84049e-5, 5.89636e-5, -1.04824e-5, -1.84365e-5, 4.84857…
  6 => [-3.62072e-5, -1.84082e-5, 5.89575e-5, -1.04875e-5, -1.84595e-5, 4.84948…
  2 => [-0.000218526, -7.43282e-5, 2.81591e-6, -0.000179436, 7.85241e-5, 0.0002…
  7 => [-3.62079e-5, -1.84086e-5, 5.89578e-5, -1.04882e-5, -1.84581e-5, 4.84947…
  8 => [-3.62079e-5, -1.84086e-5, 5.89579e-5, -1.04882e-5, -1.84581e-5, 4.84947…
  3 => [-3.82239e-5, -2.04218e-5, 6.22426e-5, -1.29131e-5, -2.81758e-5, 4.76947…
  1 => [-0.000703225, 0.000127345, 0.000163739, -0.00136237, -0.00179259, -0.00…

#### Check: Did we meet the error condition for Gauss-Seidel?
Let's check if the [the Gauss-Siedel method](https://en.wikipedia.org/wiki/Gauss%E2%80%93Seidel_method) met the desired error criteria. In this case, we'll check the _maxium error at the last iteration_. We compute the error for each equation and then find the worst case. If this worst-case error is smaller than our error tolerance, we pass the test:

In [22]:
error = A*dGSM[maximum(keys(dGSM))] - b
@assert maximum(error) < ϵ

## Task 3: How well do these algorithms scale?
In this task, we'll compare the runtime performance of the different iterative approaches against [the Gaussian elimination method](https://en.wikipedia.org/wiki/Gaussian_elimination) implemented by the [LinearAlgebra.jl package included with Julia](https://docs.julialang.org/en/v1/stdlib/LinearAlgebra/#man-linalg) using [the BenchmarkTools.jl package](https://github.com/JuliaCI/BenchmarkTools.jl). We expect, in general [that the Gaussian elimination method](https://en.wikipedia.org/wiki/Gaussian_elimination) should be faster than the two iterative methods. 

#### Jacobi

In [25]:
let
    @benchmark solve(A,b,xₒ, ϵ = 1e-6, maxiterations = 100, algorithm = MyJacobiMethod()) setup=(A=$A,b=$b,xₒ=$xₒ)
end

BenchmarkTools.Trial: 5 samples with 1 evaluation.
 Range [90m([39m[36m[1mmin[22m[39m … [35mmax[39m[90m):  [39m[36m[1m1.197 s[22m[39m … [35m 1.215 s[39m  [90m┊[39m GC [90m([39mmin … max[90m): [39m0.00% … 0.00%
 Time  [90m([39m[34m[1mmedian[22m[39m[90m):     [39m[34m[1m1.202 s             [22m[39m[90m┊[39m GC [90m([39mmedian[90m):    [39m0.00%
 Time  [90m([39m[32m[1mmean[22m[39m ± [32mσ[39m[90m):   [39m[32m[1m1.205 s[22m[39m ± [32m6.885 ms[39m  [90m┊[39m GC [90m([39mmean ± σ[90m):  [39m0.00% ± 0.00%

  [39m█[39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [34m█[39m[39m [39m [39m [39m█[39m [39m [39m [39m [39m [39m [39m [32m [39m[39m [39m [39m [39m [39m [39m [39m [39m [39m█[39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m█[39m [39m 
  [39m█[39m▁[39m▁[39m▁[39m▁[39m▁[39m▁[39m▁[39m▁[39m▁[39m▁[34

#### Gauss-Seidel

In [27]:
let
    @benchmark solve(A,b,xₒ, ϵ = 1e-6, maxiterations = 100, algorithm = MyGaussSeidelMethod()) setup=(A=$A,b=$b,xₒ=$xₒ)
end

BenchmarkTools.Trial: 6 samples with 1 evaluation.
 Range [90m([39m[36m[1mmin[22m[39m … [35mmax[39m[90m):  [39m[36m[1m900.732 ms[22m[39m … [35m937.961 ms[39m  [90m┊[39m GC [90m([39mmin … max[90m): [39m0.00% … 0.00%
 Time  [90m([39m[34m[1mmedian[22m[39m[90m):     [39m[34m[1m911.434 ms               [22m[39m[90m┊[39m GC [90m([39mmedian[90m):    [39m0.00%
 Time  [90m([39m[32m[1mmean[22m[39m ± [32mσ[39m[90m):   [39m[32m[1m914.288 ms[22m[39m ± [32m 14.141 ms[39m  [90m┊[39m GC [90m([39mmean ± σ[90m):  [39m0.00% ± 0.00%

  [39m█[39m [39m█[39m [39m [39m [39m [39m [39m [39m [34m█[39m[39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [32m [39m[39m [39m█[39m [39m [39m [39m [39m [39m [39m [39m [39m [39m█[39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m█[39m [39m 
  [39m█[39m▁[39m█[39m▁

#### Gaussian elimination

In [29]:
let
    @benchmark solve(A,b,xₒ, ϵ = 1e-6, maxiterations = 100, algorithm = MyGaussianEliminationMethod()) setup=(A=$A,b=$b,xₒ=$xₒ)
end

BenchmarkTools.Trial: 17 samples with 1 evaluation.
 Range [90m([39m[36m[1mmin[22m[39m … [35mmax[39m[90m):  [39m[36m[1m295.172 ms[22m[39m … [35m359.439 ms[39m  [90m┊[39m GC [90m([39mmin … max[90m): [39m0.00% … 15.63%
 Time  [90m([39m[34m[1mmedian[22m[39m[90m):     [39m[34m[1m308.343 ms               [22m[39m[90m┊[39m GC [90m([39mmedian[90m):    [39m0.00%
 Time  [90m([39m[32m[1mmean[22m[39m ± [32mσ[39m[90m):   [39m[32m[1m310.776 ms[22m[39m ± [32m 13.764 ms[39m  [90m┊[39m GC [90m([39mmean ± σ[90m):  [39m1.91% ±  3.73%

  [39m▁[39m [39m [39m [39m▁[39m [39m▁[39m [39m▁[39m▁[39m█[34m▁[39m[39m▁[39m▁[39m▁[32m█[39m[39m [39m [39m▁[39m▁[39m▁[39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m [39m▁[39m [39m 
  [39m█[39m▁[39m▁[39