## Why do I need an additional library?

Let's look at lists to do square matrix-matrix calculations for a start.

In [1]:
def list_dgemm(A, B):
    """Does double precision-matrix matrix multiply
    A : m x n
    B : n x p
    """
    # Preallocate matrix of size m x p
    C = [[0.0]*len(A) for _ in range(len(B[0]))]

    for i in range(len(A)):
        for j in range(len(B)):
            for k in range(len(A[0])):
                C[i][j]+= A[i][k]*B[k][j]
        
    return C            

Let's create the basic data structures required for this case

In [2]:
# Problem size
N = 50

import random
# Generate one dimensional list of N numbers
A = [random.uniform(1.5, 1.9) for _ in range(N)]

# Generate two dimensional lists of N numbers
A = [[random.uniform(1.5, 1.9) for _ in range(N)] for _ in range(N)]
B = [[random.uniform(1.5, 1.9) for _ in range(N)] for _ in range(N)]

# Print dimensions
print("The matrix size is {0}x{1}".format(len(A), len(A[0])))

The matrix size is 50x50


### First let's verify that we (or `numpy`) get the right answers.

 Create data for numpy first...

In [3]:
import numpy as np
np_A = np.array(A)
np_B = np.array(B)

In [4]:
import numpy as np
np_C = np_A@np_B; C = list_dgemm(A,B);
np.linalg.norm(np.array(C) - np_C, np.inf)

6.252776074688882e-13

Let's run and time this

In [5]:
%timeit C = list_dgemm(A,B)

26.2 ms ± 1.52 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)


That's **frustratingly** slooooow. Let's look at `numpy` magic!

Let's see the time taken!

In [6]:
%timeit np_C = np_A@np_B

12.9 µs ± 153 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)


Show in class : https://jekel.me/2017/Python-with-Numba-faster-than-fortran/