# Numpy (Numeric Python)
- A linear algebra library for python
- Numpy is the basis of many python libraries 
- Built on C library, so it is a fast library

## Numpy library is extensive, a few methods and functions
- Numpy arrays: creation methods
- Numpy arrays: manipulation methods
- Mathematical operations on Numpy arrays
- Matrix and vector operations
- Sorting methods
- Searching methods
- Statistical methods

In [279]:
import numpy as np

# Numpy Arrays Creation Methods

### We can cast a python list into a 1D numpy array

In [280]:
#array
my_list = [0,1,2]
print(type(my_list))
arr = np.array(my_list)
print(type(arr))
arr

<class 'list'>
<class 'numpy.ndarray'>


array([0, 1, 2])

A mulitdimensonal numpy array can be mad from a python list of lists

In [60]:
my_mat = [[1,2,3],[4,5,6],[7,8,9]]
print(type(my_mat))
my_mat

<class 'list'>


[[1, 2, 3], [4, 5, 6], [7, 8, 9]]

In [61]:
arr_multi = np.array(my_mat)
print(type(arr_multi))
arr_multi

<class 'numpy.ndarray'>


array([[1, 2, 3],
       [4, 5, 6],
       [7, 8, 9]])

**Double** brackets indicate it is a 2D array. **Single** brackets indicate 1D array.
Can also modify data type:

In [62]:
arr_multi = np.array(my_mat,dtype = np.float32)
arr_multi

array([[1., 2., 3.],
       [4., 5., 6.],
       [7., 8., 9.]], dtype=float32)

Copying a numpy array

In [256]:
#start here
arr = np.arange(1,11)
arr

array([ 1,  2,  3,  4,  5,  6,  7,  8,  9, 10])

In [258]:
arr_new = arr
arr_new

array([ 1,  2,  3,  4,  5,  6,  7,  8,  9, 10])

In [259]:
arr_new[0:3]= -1
arr_new

array([-1, -1, -1,  4,  5,  6,  7,  8,  9, 10])

In [260]:
arr

array([-1, -1, -1,  4,  5,  6,  7,  8,  9, 10])

By default numpy works with references to the arrays, this saves memory with large arrays. So if you can assing the array a new name and any modifications to it also apply to the original array. To make a second independent one, use the **copy** method.

In [261]:
arr_new = arr.copy()
arr_new

array([-1, -1, -1,  4,  5,  6,  7,  8,  9, 10])

In [262]:
arr_new[0:3] = 0
print('arr_new: ', arr_new)
print('arr: ', arr)

arr_new:  [ 0  0  0  4  5  6  7  8  9 10]
arr:  [-1 -1 -1  4  5  6  7  8  9 10]


### Create 0's and 1's

In [63]:
np.zeros(3)

array([0., 0., 0.])

In [64]:
np.zeros((2,3)) #(row,col)

array([[0., 0., 0.],
       [0., 0., 0.]])

In [65]:
np.ones(3)

array([1., 1., 1.])

In [66]:
3*np.ones(3)

array([3., 3., 3.])

### Identity matrices:

In [67]:
np.eye(3)

array([[1., 0., 0.],
       [0., 1., 0.],
       [0., 0., 1.]])

### Equally spaced arrays
Create equally spaced array with a fixed step size using **arange**
>arange(start, stop, step_size)

By default start = 0, step_size = 1, and stop value is not included in the final array



array from 0 to 10 with step size of 1

In [68]:
np.arange(10)

array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])

array from 5 to 11 with step size 1

In [69]:
np.arange(5,11)

array([ 5,  6,  7,  8,  9, 10])

array from 2 to 20 with step size = 2

In [70]:
np.arange(2,21,2)

array([ 2,  4,  6,  8, 10, 12, 14, 16, 18, 20])

### Equally spaced array with specific size
another useful method is **linspace**, which creates a specified number of equally spaced points
> linspace(start,stop,npoints)

In [71]:
np.linspace(0,5,10) 

array([0.        , 0.55555556, 1.11111111, 1.66666667, 2.22222222,
       2.77777778, 3.33333333, 3.88888889, 4.44444444, 5.        ])

### Random Numbers

Numpy has many methods for generating random numbers. The methods are accessable from numpy.random

- rand: uniform distribution from 0 to 1
- randn: normal distribution centered at zero
- randint: random integer from [start,stop) *stop value not include*
- many more

In [72]:
np.random.rand(5)

array([0.80208552, 0.1477598 , 0.32110476, 0.64703195, 0.1319932 ])

In [73]:
np.random.rand(5,5)

array([[0.24288963, 0.24088645, 0.49078052, 0.40815923, 0.02164457],
       [0.57375798, 0.83427503, 0.06259815, 0.67427884, 0.84619799],
       [0.30220699, 0.35654746, 0.62982726, 0.46864688, 0.2431337 ],
       [0.85919018, 0.03167595, 0.27427865, 0.57697677, 0.39457703],
       [0.38320467, 0.83516202, 0.33393762, 0.49968275, 0.5897799 ]])

In [74]:
np.random.randn(5)

array([ 0.19220863,  0.55772632,  0.71025322,  0.22301859, -0.83222692])

In [75]:
np.random.randn(5,5)

array([[-5.78624669e-01, -6.99372023e-01,  1.06460744e-01,
        -2.28612136e-01,  1.32324414e+00],
       [ 1.24146305e-01,  3.17491812e-01, -5.45996846e-01,
        -2.90761484e-01, -7.19369573e-05],
       [-5.79642022e-01, -1.72368835e+00,  7.67087194e-01,
         9.16134046e-02, -4.47872157e-01],
       [-2.95606602e-02, -6.25805553e-01, -1.69708074e-01,
         4.20267099e-01,  8.66917140e-01],
       [ 2.77074209e-01, -7.02982611e-01,  2.08967956e-01,
         3.73537185e-01,  3.21926448e-01]])

In [76]:
np.random.randint(1,100,3) #100 not included

array([18, 79, 88])

# Numpy Array Manipulation Methods
### Shape

Determine the shape of a numpy array using
> np.shape()

In [77]:
a = np.ones( (2,3) )
print(a)
print('shape of a (row, col): ',np.shape(a))


[[1. 1. 1.]
 [1. 1. 1.]]
shape of a (row, col):  (2, 3)


### Reshape

We can reshape our arrays, i.e. change the number of rows and columns. We just need to keep the product of rows and columns equal to the original array size.

In [78]:
arr = np.arange(12)
arr

array([ 0,  1,  2,  3,  4,  5,  6,  7,  8,  9, 10, 11])

In [79]:
arr.reshape(3,4)

array([[ 0,  1,  2,  3],
       [ 4,  5,  6,  7],
       [ 8,  9, 10, 11]])

In [80]:
arr.reshape(4,3)

array([[ 0,  1,  2],
       [ 3,  4,  5],
       [ 6,  7,  8],
       [ 9, 10, 11]])

In [81]:
arr.reshape(4,5) 

ValueError: cannot reshape array of size 12 into shape (4,5)

You can transpose an array via

In [281]:
arr.reshape(4,3).transpose()

ValueError: cannot reshape array of size 3 into shape (4,3)

### Concatenate Multiple numpy arrays

In [None]:
## Concatenate Row-wise
a = np.array([[1, 2], [3, 4]])
b = np.array([[5, 6]])
np.concatenate((a, b), axis=0)

In [None]:
## Concatenate Column-wise
np.concatenate((a, b.transpose()), axis=1)

In [83]:
## Concatenate to generate a flat NumPy Array

a = np.array([[1, 2], [3, 4]])
b = np.array([[5, 6]])
np.concatenate((a, b), axis=None)

array([1, 2, 3, 4, 5, 6])

### Flatten numpy array
- use **flatten( )** to collapse an np.array into a single dimension

In [86]:
a = np.array([[1,2], [3,4]])
print(a)

[[1 2]
 [3 4]]


In [88]:
a.flatten()

array([1, 2, 3, 4])

### Determine unique elements of numpy array
- Use **np.unique( )**

In [89]:
a = np.array([[1, 2], [2, 3]])
np.unique(a)

array([1, 2, 3])

In [91]:
#Unique rows
a = np.array([[1, 2, 3], [1, 2, 3], [2, 3, 4]])
print(a)

np.unique(a, axis=0)

[[1 2 3]
 [1 2 3]
 [2 3 4]]


array([[1, 2, 3],
       [2, 3, 4]])

In [95]:
#unique columns
a = np.array([[1, 1, 3], [1, 1, 3], [1, 1, 4]])
print(a)

np.unique(a, axis=1)

[[1 1 3]
 [1 1 3]
 [1 1 4]]


array([[1, 3],
       [1, 3],
       [1, 4]])

### Numpy array to Python lists
- Use **tolist( )**

In [99]:
a = np.array([[1, 1, 3], [1, 1, 3], [1, 1, 4]])
print(type(a))

aa = a.tolist()
print(type(aa))

<class 'numpy.ndarray'>
<class 'list'>


# Numpy Operations

Many mathematical functions (see here: https://numpy.org/doc/1.23/reference/routines.math.html)

### Trig. Functions 

In [100]:
a = np.array([1,2,3])
print("Trigonometric Sine   :", np.sin(a))
print("Trigonometric Cosine :", np.cos(a))
print("Trigonometric Tangent:", np.tan(a))

Trigonometric Sine   : [0.84147098 0.90929743 0.14112001]
Trigonometric Cosine : [ 0.54030231 -0.41614684 -0.9899925 ]
Trigonometric Tangent: [ 1.55740772 -2.18503986 -0.14254654]


### Rounding arrays

Each element of an array can be rounded:
- up using np.ceil( )
- down using np.floor( )
- to the nearest integer using np.rint( )
- to a specific decimal place using np.round_( )

In [109]:
a = np.linspace(1, 2, 7)
print(a)
print(np.ceil(a))
print(np.floor(a))
print(np.rint(a))
print(np.round_(a,2)) #round to 2 decimal places

[1.         1.16666667 1.33333333 1.5        1.66666667 1.83333333
 2.        ]
[1. 2. 2. 2. 2. 2. 2.]
[1. 1. 1. 1. 1. 1. 2.]
[1. 1. 1. 2. 2. 2. 2.]
[1.   1.17 1.33 1.5  1.67 1.83 2.  ]


### Exponents and Logs
- $log$ and $ln$ are one of those definitons that has no set standard. In numpy 
> np.log()

is the natrual log, $ln$

- Calculate element-by-element wise natural log with **np.log( )**
- Calculate element-by-element wise exponential with **np.exp( )**

In [211]:
a = np.arange(1,6)
print(a)

print(np.log(a).round(2))
print(np.exp(a).round(2))

[1 2 3 4 5]
[0.   0.69 1.1  1.39 1.61]
[  2.72   7.39  20.09  54.6  148.41]


### Sum of array elements
Calculate array element sums using **np.sum( )**

In [217]:
a = np.array([[1, 2], [3, 4]])
print(a)

print('sum along columns: ',a.sum(axis=0))
print('sum along rows: ',a.sum(axis=1))
print('sum along columns and rows: ', a.sum())

[[1 2]
 [3 4]]
sum along columns:  [4 6]
sum along rows:  [3 7]
sum along columns and rows:  10


### Product of array elements
Calculate array element sums using **np.prod( )**

In [218]:
a = np.array([[1, 2], [3, 4]])
print(a)

print('sum along columns: ',a.prod(axis=0))
print('sum along rows: ',a.prod(axis=1))
print('sum along columns and rows: ', a.prod())

[[1 2]
 [3 4]]
sum along columns:  [3 8]
sum along rows:  [ 2 12]
sum along columns and rows:  24


### Square Root of array elements
Calculate array element sums using **np.sqrt( )**

In [221]:
a = np.array([[1, 2], [3, 4]])
print(a)

print('square root of a: ', np.sqrt(a))

[[1 2]
 [3 4]]
square root of a:  [[1.         1.41421356]
 [1.73205081 2.        ]]


# Matrix and Vector Operations

### Dot Product (Scalar Product)
- np.dot( )

In [226]:
a = np.array([1, 2, 3])
b = np.array([1, 1, 1])
print(a)
print(b)
np.dot(a, b)

[1 2 3]
[1 1 1]


6

### Cross Product
- np.cross( )

In [228]:
print(a)
print(b)
print(np.cross(a,b))
print(np.cross(b,a))

[1 2 3]
[1 1 1]
[-1  2 -1]
[ 1 -2  1]


### Matrix Multiplication
- np.matmul( )

In [230]:
a = np.array([[1, 2], [3, 4]])
b = np.array([[1, 1], [1, 1]])
print(a)
print(b)
np.matmul(a, b)

[[1 2]
 [3 4]]
[[1 1]
 [1 1]]


array([[3, 3],
       [7, 7]])

### Vector Normalization
- np.linalg.norm( )

In [233]:
a = np.arange(-4, 5)
print(a)
print(np.linalg.norm(a)) ## L2 Norm sum(|x|^2)^1/2

print(np.linalg.norm(a, 1)) ## L1 Norm, sum(|x|^1)^1/1


[-4 -3 -2 -1  0  1  2  3  4]
7.745966692414834
20.0


# Sorting Methods

### Sort a Numpy array
- Use ndarray.sort( ) method

In [238]:
a = np.array([[1,4],[3,1]])

print('a: ',a)

print('1: ',np.sort(a)) ## sort based on rows

print('2: ',np.sort(a, axis=None)) ## sort the flattened array

print('3: ',np.sort(a, axis=0)) ## sort based on columns

print('4: ',np.sort(a, axis=1)) ## sort based on rows


a:  [[1 4]
 [3 1]]
1:  [[1 4]
 [1 3]]
2:  [1 1 3 4]
3:  [[1 1]
 [3 4]]
4:  [[1 4]
 [1 3]]


### Order of indicies in sorted Numpy array
- Return the order of indicies that would sort the array using **np.argsort( )**

In [239]:
x = np.array([3, 1, 2])
np.argsort(x)

array([1, 2, 0])

In [165]:
np.random.seed(10) #any int
np.random.randint(0,10,10)

array([9, 4, 0, 1, 9, 0, 1, 8, 9, 0])

# Searching Methods

### Indicies corresponding to maximum values
- np.argmax( ), returns the first indice of the maximum value in the array along a particular axis

In [242]:
a = np.random.randint(1, 20, 10).reshape(2,5)
print(a)

print(np.argmax(a)) ## index in a flattend array


print(np.argmax(a, axis=0)) ## indices along columns

print(np.argmax(a, axis=1)) ## indices along rows

[[19 14 12 11 10]
 [16 19 17  8 12]]
0
[0 1 1 0 1]
[0 1]


### Find indicies corresponding to minimum values
- similar to argmax, use **np.argmin( )**

In [243]:
a = np.random.randint(1, 20, 10).reshape(2,5)
print(a)

print(np.argmin(a)) ## index in a flattend array


print(np.argmin(a, axis=0)) ## indices along columns

print(np.argmin(a, axis=1)) ## indices along rows

[[18 15  8 12  2]
 [ 1 13  6  5  8]]
5
[1 1 1 1 0]
[4 0]


### Search based on condition
- **np.where( )** can be used to select between two arrays based on a condition

In [245]:
a = np.random.randint(-10, 10, 10)
print(a)

b = np.where(a < 0, 0, a)
print(b)

#if element < 0:
#    return 0
#else:
#    return element


[  6   2 -10   7   5   2  -2   0   3   1]
[6 2 0 7 5 2 0 0 3 1]


### More Conditionals

In [250]:
arr = np.arange(1,11)
arr

array([ 1,  2,  3,  4,  5,  6,  7,  8,  9, 10])

In [251]:
bool_arr = arr > 5
bool_arr

array([False, False, False, False, False,  True,  True,  True,  True,
        True])

In [252]:
arr[bool_arr]

array([ 6,  7,  8,  9, 10])

In [253]:
arr

array([ 1,  2,  3,  4,  5,  6,  7,  8,  9, 10])

In [254]:
arr[arr>5]

array([ 6,  7,  8,  9, 10])

## Statistical Methods
- Mean
- Median
- Standard Deviation

### Mean

In [278]:
a = np.array([[1, 2], [3, 4]])
print(a)
np.mean(a)

[[1 2]
 [3 4]]


2.5

In [270]:
np.mean(a, axis = 1) ## along the row axis

array([1.5, 3.5])

In [271]:
np.mean(a, axis = 0) ## along the column axis

array([2., 3.])

### Median

In [272]:
np.median(a)

2.5

In [273]:
np.median(a,axis = 1) ## along the row axis

array([1.5, 3.5])

In [274]:
np.median(a, axis = 0) ## along the column axis

array([2., 3.])

### Standard Deviation

In [275]:
np.std(a)

1.118033988749895

In [276]:
np.std(a,axis = 1) # along the row axis

array([0.5, 0.5])

In [277]:
np.std(a, axis = 0)# along the column axis

array([1., 1.])