# NumPy 

NumPy (or Numpy) is a Linear Algebra Library for Python, the reason it is so important for Data Science with Python is that almost all of the libraries in the PyData Ecosystem rely on NumPy as one of their main building blocks.

Numpy is also incredibly fast, as it has bindings to C libraries. For more info on why you would want to use Arrays instead of lists, check out this great [StackOverflow post](http://stackoverflow.com/questions/993984/why-numpy-instead-of-python-lists).

We will only learn the basics of NumPy, to get started we need to install it!

In [1]:
# To use numpy 
import numpy as np

## NumPy Arrays

Numpy arrays essentially come in two flavors: `Vectors` and `Matrices`.
Vectors are strictly 1-d arrays and matrices are 2-d.

In [2]:
# Creating Numpy Arrays
# From a python list
my_list = [1, 2, 3]
my_list

[1, 2, 3]

In [3]:
np.array(my_list)

array([1, 2, 3])

In [4]:
my_matrix = [[1, 2, 3], [4, 5, 6], [7, 8 , 9]]
my_matrix

[[1, 2, 3], [4, 5, 6], [7, 8, 9]]

In [5]:
np.array(my_matrix)

array([[1, 2, 3],
       [4, 5, 6],
       [7, 8, 9]])

## Built-in Methods

In [6]:
np.arange(0, 10)

array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])

In [7]:
np.arange(0, 10, 2)

array([0, 2, 4, 6, 8])

In [8]:
np.arange(0, 50, 5)

array([ 0,  5, 10, 15, 20, 25, 30, 35, 40, 45])

## Zeros and Ones

In [9]:
# Generate an array of zeros
np.zeros(3)

array([0., 0., 0.])

In [10]:
# Generate 5 X 5 matrix with zeros
np.zeros((5,5))

array([[0., 0., 0., 0., 0.],
       [0., 0., 0., 0., 0.],
       [0., 0., 0., 0., 0.],
       [0., 0., 0., 0., 0.],
       [0., 0., 0., 0., 0.]])

In [11]:
np.ones(4)

array([1., 1., 1., 1.])

In [12]:
np.ones((3, 3))

array([[1., 1., 1.],
       [1., 1., 1.],
       [1., 1., 1.]])

## linspace
Return evenly spaced number over a specified interval.Returns `num` evenly spaced samples, calculated over the
interval [`start`, `stop`].

The endpoint of the interval can optionally be excluded.

In [13]:
np.linspace(0, 10)

array([ 0.        ,  0.20408163,  0.40816327,  0.6122449 ,  0.81632653,
        1.02040816,  1.2244898 ,  1.42857143,  1.63265306,  1.83673469,
        2.04081633,  2.24489796,  2.44897959,  2.65306122,  2.85714286,
        3.06122449,  3.26530612,  3.46938776,  3.67346939,  3.87755102,
        4.08163265,  4.28571429,  4.48979592,  4.69387755,  4.89795918,
        5.10204082,  5.30612245,  5.51020408,  5.71428571,  5.91836735,
        6.12244898,  6.32653061,  6.53061224,  6.73469388,  6.93877551,
        7.14285714,  7.34693878,  7.55102041,  7.75510204,  7.95918367,
        8.16326531,  8.36734694,  8.57142857,  8.7755102 ,  8.97959184,
        9.18367347,  9.3877551 ,  9.59183673,  9.79591837, 10.        ])

In [14]:
np.linspace(0, 10, 3)

array([ 0.,  5., 10.])

In [15]:
np.linspace(2, 10, 5)

array([ 2.,  4.,  6.,  8., 10.])

In [16]:
np.linspace(0, 10, 50)

array([ 0.        ,  0.20408163,  0.40816327,  0.6122449 ,  0.81632653,
        1.02040816,  1.2244898 ,  1.42857143,  1.63265306,  1.83673469,
        2.04081633,  2.24489796,  2.44897959,  2.65306122,  2.85714286,
        3.06122449,  3.26530612,  3.46938776,  3.67346939,  3.87755102,
        4.08163265,  4.28571429,  4.48979592,  4.69387755,  4.89795918,
        5.10204082,  5.30612245,  5.51020408,  5.71428571,  5.91836735,
        6.12244898,  6.32653061,  6.53061224,  6.73469388,  6.93877551,
        7.14285714,  7.34693878,  7.55102041,  7.75510204,  7.95918367,
        8.16326531,  8.36734694,  8.57142857,  8.7755102 ,  8.97959184,
        9.18367347,  9.3877551 ,  9.59183673,  9.79591837, 10.        ])

## eye

Creates identity matrix

In [17]:
np.eye(4)

array([[1., 0., 0., 0.],
       [0., 1., 0., 0.],
       [0., 0., 1., 0.],
       [0., 0., 0., 1.]])

## Random
Numpy also has lots of ways to create random number arrays:

### Rand

Create an array of the given shape and populate it with
random samples from a uniform distribution
over ``[0, 1)``.

In [18]:
np.random.rand(2)

array([0.74829785, 0.5653491 ])

In [19]:
# Generate 5 X 5 random numbers array
np.random.rand(5, 5)

array([[0.71411407, 0.28016586, 0.4287669 , 0.28609522, 0.93514606],
       [0.95346751, 0.31181671, 0.58047915, 0.90773127, 0.83629302],
       [0.47627779, 0.09000674, 0.93479394, 0.61415057, 0.07792417],
       [0.46945797, 0.89106574, 0.9279958 , 0.42216017, 0.01022835],
       [0.87089848, 0.31123415, 0.44355141, 0.23834846, 0.13410084]])

## Randn
Return a sample (or samples) from the "standard normal" distribution. Unlike rand which is uniform:

In [20]:
np.random.randn(2)

array([ 2.11101342, -0.39151313])

In [21]:
np.random.randn(5, 5)

array([[-0.05152892, -0.83109694, -0.8290442 ,  0.19803321, -0.06845641],
       [ 1.56273738,  0.35083493, -0.53087745,  0.63044211, -0.48675494],
       [ 0.69577338, -0.14969867, -0.86852209, -1.0816226 ,  0.43445085],
       [ 0.58856404, -0.13762915, -3.26949701, -0.44474437, -1.03409991],
       [-0.38129596, -1.71366274, -0.41459824,  0.84791644, -0.1321568 ]])

## Randint
returns random integers from `low` (inclusive) to `high` (exclusive).

In [22]:
np.random.randint(1,100)

25

In [23]:
np.random.randint(1, 100, 10)

array([38, 40, 65, 38, 22, 35,  4, 25, 34, 31])

## Array attributes and methods

In [24]:
arr = np.arange(25)
ranarr = np.random.randint(0, 50, 10)

In [25]:
arr

array([ 0,  1,  2,  3,  4,  5,  6,  7,  8,  9, 10, 11, 12, 13, 14, 15, 16,
       17, 18, 19, 20, 21, 22, 23, 24])

In [26]:
ranarr

array([12, 12, 40, 24, 18, 48,  6, 36, 38, 35])

## Reshape
Returns an array containing the same data with a new shape.

In [27]:
arr.reshape(5, 5)

array([[ 0,  1,  2,  3,  4],
       [ 5,  6,  7,  8,  9],
       [10, 11, 12, 13, 14],
       [15, 16, 17, 18, 19],
       [20, 21, 22, 23, 24]])

### max,min,argmax,argmin

These are useful methods for finding max or min values. Or to find their index locations using argmin or argmax

In [28]:
ranarr

array([12, 12, 40, 24, 18, 48,  6, 36, 38, 35])

In [29]:
#Return the maximum along a given axis.
ranarr.max()

48

In [30]:
#Return the minimum along a given axis.
ranarr.min()

6

In [31]:
#Return indices of the maximum values along the given axis.
ranarr.argmax()

5

In [32]:
#Return indices of the minimum values along the given axis.
ranarr.argmin()

6

## shape
Return the shape of an array. (not a method)

In [33]:
# Vector
arr.shape

(25,)

In [34]:
# notice the two set of brackets
arr.reshape(1, 25)

array([[ 0,  1,  2,  3,  4,  5,  6,  7,  8,  9, 10, 11, 12, 13, 14, 15,
        16, 17, 18, 19, 20, 21, 22, 23, 24]])

In [35]:
arr.reshape(1,25).shape

(1, 25)

In [36]:
arr.reshape(25,1)

array([[ 0],
       [ 1],
       [ 2],
       [ 3],
       [ 4],
       [ 5],
       [ 6],
       [ 7],
       [ 8],
       [ 9],
       [10],
       [11],
       [12],
       [13],
       [14],
       [15],
       [16],
       [17],
       [18],
       [19],
       [20],
       [21],
       [22],
       [23],
       [24]])

In [37]:
arr.reshape(25, 1).shape

(25, 1)

In [38]:
arr.dtype

dtype('int64')