<a href="https://colab.research.google.com/github/shubham62025865/shubham1/blob/main/Lecture_4_NumPy.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>


<img src="https://user-images.githubusercontent.com/7065401/39118381-910eb0c2-46e9-11e8-81f1-a5b897401c23.jpeg"
    style="width:300px; float: right; margin: 0 40px 40px 40px;"></img>

# Numpy: Numeric computing library

NumPy (Numerical Python) is one of the core packages for numerical computing in Python. Pandas, Matplotlib, Statmodels and many other Scientific libraries rely on NumPy.

NumPy major contributions are:

* Efficient numeric computation with C primitives
* Efficient collections with vectorized operations
* An integrated and natural Linear Algebra API
* A C API for connecting NumPy with libraries written in C, C++, or FORTRAN.



![purple-divider](https://user-images.githubusercontent.com/7065401/52071927-c1cd7100-2562-11e9-908a-dde91ba14e59.png)

## Hands on! 

In [None]:
import sys
import numpy as np

## Basic Numpy Arrays

In [None]:
import pandas as pd
pd.Series([1,2,3,4])

0    1
1    2
2    3
3    4
dtype: int64

In [None]:
np.array([1,2,3,4])

array([1, 2, 3, 4])

In [None]:
a = np.array([1, 2, 3, 4])

In [None]:
a

array([1, 2, 3, 4])

In [None]:
a[1]

2

In [None]:
a[-1]

4

In [None]:
b = np.array([1.0, 2.5, 3.1,4.7])
b

array([1. , 2.5, 3.1, 4.7])

In [None]:
b.dtype

dtype('float64')

In [None]:
a.dtype

dtype('int64')

In [None]:
a = np.array([1, 2, 3, 4])

In [None]:
b = np.array([0, .5, 1, 1.5, 2])

In [None]:
a[0], a[1]

(1, 2)

In [None]:
a[0:3]

array([1, 2, 3])

In [None]:
a[1:3]

array([2, 3])

In [None]:
a

array([1, 2, 3, 4])

In [None]:
a[0:-1]

array([1, 2, 3])

In [None]:
a[::2]

array([1, 3])

In [None]:
b

array([1. , 2.5, 3.1, 4.7])

In [None]:
b[0], b[2], a[-1]

(1.0, 3.1, 4)

In [None]:
b[[0,2,-1]]

array([1. , 3.1, 4.7])

In [None]:
b[[0, 2, -1]]

array([1. , 3.1, 4.7])

![green-divider](https://user-images.githubusercontent.com/7065401/52071924-c003ad80-2562-11e9-8297-1c6595f8a7ff.png)

## Array Types

In [None]:
a

array([1, 2, 3, 4])

In [None]:
a.dtype

dtype('int64')

In [None]:
c = np.array([1,2,3,4,"hello"])


In [None]:
c

array(['1', '2', '3', '4', 'hello'], dtype='<U21')

In [None]:
b

array([0. , 0.5, 1. , 1.5, 2. ])

In [None]:
b.dtype

dtype('float64')

In [None]:
np.array([1, 2, 3, 4], dtype=float)

array([1., 2., 3., 4.])

In [None]:
np.array([1, 2, 3, 4], dtype=np.int8)

array([1, 2, 3, 4], dtype=int8)

In [None]:
c = np.array(['a', 'b', 'c'])

In [None]:
c.dtype

dtype('<U1')

In [None]:
d = np.array([{'a': 1}, sys])

In [None]:
d.dtype

dtype('O')

![green-divider](https://user-images.githubusercontent.com/7065401/52071924-c003ad80-2562-11e9-8297-1c6595f8a7ff.png)

## Dimensions and shapes

In [None]:
a = np.array([
    [7,8,9],
    [4,5,6],
    [1,2,3]
])

In [None]:
a.shape

(3, 3)

In [None]:
b

array([1. , 2.5, 3.1, 4.7])

In [None]:
b.shape

(4,)

In [None]:
a.ndim

2

In [None]:
a.size

9

In [None]:
B = np.array([
    [
        [12, 11, 10],
        [9, 8, 7],
    ],
    [
        [6, 5, 4],
        [3, 2, 1]
    ]
])

In [None]:
B

array([[[12, 11, 10],
        [ 9,  8,  7]],

       [[ 6,  5,  4],
        [ 3,  2,  1]]])

In [None]:
B.shape

(2, 2, 3)

In [None]:
B.ndim

3

In [None]:
B.size

12

If the shape isn't consistent, it'll just fall back to regular Python objects:

In [None]:
C = np.array([
    [
        [12, 11, 10],
        [9, 8, 7],
    ],
    [
        [6, 5, 4],
        
    ]
])

  C = np.array([


In [None]:
type(C)

numpy.ndarray

In [None]:
C

array([list([[12, 11, 10], [9, 8, 7]]), list([[6, 5, 4]])], dtype=object)

In [None]:
C.dtype

dtype('O')

In [None]:
C.shape

(2,)

In [None]:
C.size

2

In [None]:
type(C[0])

list

![green-divider](https://user-images.githubusercontent.com/7065401/52071924-c003ad80-2562-11e9-8297-1c6595f8a7ff.png)

## Indexing and Slicing of Matrices

In [None]:
import numpy as np
# Square matrix
A = np.array([
#.   0. 1. 2
    [1, 2, 3], # 0
    [4, 5, 6], # 1
    [7, 8, 9]  # 2
])

In [None]:
A

array([[1, 2, 3],
       [4, 5, 6],
       [7, 8, 9]])

In [None]:
A[2][0]

7

In [None]:
a_list = [7, 8, 9]

In [None]:
a_list[2]

9

In [None]:
A[2][2]

array([7, 8, 9])

In [None]:
A[0][1]

2

In [None]:
A[0][0], A[2][1]

(1, 8)

In [None]:
A[1]

array([4, 5, 6])

In [None]:
A[1]

array([4, 5, 6])

In [None]:
A[1][0]

4

In [None]:
# A[d1, d2, d3, d4]

In [None]:
A

array([[1, 2, 3],
       [4, 5, 6],
       [7, 8, 9]])

In [None]:
A[1, 0]

4

In [None]:
A

array([[1, 2, 3],
       [4, 5, 6],
       [7, 8, 9]])

In [None]:
A[2:3, 0:1]

array([[7]])

In [None]:
A[1:3, 1:3]

array([[5, 6],
       [8, 9]])

In [None]:
A[2][0]

7

In [None]:
A[ : , 1 :2]

array([[2],
       [5],
       [8]])

In [None]:
A[0:, 1].shape

(3,)

In [None]:
A[1:,:2]

array([[4, 5],
       [7, 8]])

In [None]:
A[0:2,1:]

array([[2, 3],
       [5, 6]])

In [None]:
A[0][2]

3

In [None]:
A[:3,2:3]

array([[3],
       [6],
       [9]])

In [None]:
A[0:2,2]

array([3, 6])

In [None]:
A[0:2]

array([[1, 2, 3],
       [4, 5, 6]])

In [None]:
A[:, :2]

array([[1, 2],
       [4, 5],
       [7, 8]])

In [None]:
A[:2, :2]

array([[1, 2],
       [4, 5]])

In [None]:
A[:2, 2:]

array([[3],
       [6]])

In [None]:
A

array([[1, 2, 3],
       [4, 5, 6],
       [7, 8, 9]])

In [None]:
A[1] = np.array([10, 10, 10])

In [None]:
A

array([[ 1,  2,  3],
       [10, 10, 10],
       [ 7,  8,  9]])

In [None]:
A[2] = 99

In [None]:
A

array([[ 1,  2,  3],
       [10, 10, 10],
       [99, 99, 99]])

![green-divider](https://user-images.githubusercontent.com/7065401/52071924-c003ad80-2562-11e9-8297-1c6595f8a7ff.png)

## Summary statistics

In [None]:
a = np.array([1, 2, 3, 4])

In [None]:
a.sum()

10

In [None]:
a.mean()

2.5

In [None]:
a.std()

1.118033988749895

In [None]:
a.var()

1.25

In [None]:
A = np.array([
    [1, 2, 3],
    [4, 5, 6],
    [7, 8, 9]
])

In [None]:
A[0:2].sum()

21

In [None]:
A.max()

9

In [None]:
A.min()

1

In [None]:
A.sum()

45

In [None]:
A.mean()

5.0

In [None]:
A.std()

2.581988897471611

In [None]:
A

array([[1, 2, 3],
       [4, 5, 6],
       [7, 8, 9]])

In [None]:
A.sum(axis=0)

array([12, 15, 18])

In [None]:
A.sum(axis=1)

array([ 6, 15, 24])

In [None]:
A.mean(axis=0)

array([4., 5., 6.])

In [None]:
A.mean(axis=1)

array([2., 5., 8.])

In [None]:
A.std(axis=0)

array([2.44948974, 2.44948974, 2.44948974])

In [None]:
A.std(axis=1)

array([0.81649658, 0.81649658, 0.81649658])

And [many more](https://docs.scipy.org/doc/numpy-1.13.0/reference/arrays.ndarray.html#array-methods)...

![green-divider](https://user-images.githubusercontent.com/7065401/52071924-c003ad80-2562-11e9-8297-1c6595f8a7ff.png)

## Broadcasting and Vectorized operations

In [None]:
a = np.arange(4)

In [None]:
a

array([0, 1, 2, 3])

In [None]:
a + 10

array([10, 11, 12, 13])

In [None]:
a = a * 10

In [None]:
a = 5
a = a + 4

In [None]:
a

9

In [None]:
a

array([0, 1, 2, 3])

In [None]:
a += 100

In [None]:
a

array([100, 101, 102, 103])

In [None]:
l = [0, 1, 2, 3]

In [None]:
l_new = []
for i in l:
  l_new.append(i * 10)

In [None]:
l_new

[0, 10, 20, 30]

In [None]:
[i * 10 for i in l]

[0, 10, 20, 30]

In [None]:
a = np.arange(4)

In [None]:
a

array([0, 1, 2, 3])

In [None]:
b = np.array([10, 10, 10, 10])

In [None]:
b

array([10, 10, 10, 10])

In [None]:
a + b

array([10, 11, 12, 13])

In [None]:
a * b

array([ 0, 10, 20, 30])

![green-divider](https://user-images.githubusercontent.com/7065401/52071924-c003ad80-2562-11e9-8297-1c6595f8a7ff.png)

## Boolean arrays
_(Also called masks)_

In [None]:
a = np.arange(4)

In [None]:
a

array([0, 1, 2, 3])

In [None]:
a[0], a[-1]

(0, 3)

In [None]:
a[[0, -1]]

array([0, 3])

In [None]:
a

array([0, 1, 2, 3])

In [None]:
a[[True, False, True, True]]

array([0, 2, 3])

In [None]:
a[[False, True, False, True]]

array([1, 3])

In [None]:
a

array([0, 1, 2, 3])

In [None]:
a >= 2

array([False, False,  True,  True])

In [None]:
a[a>=2]

array([2, 3])

In [None]:
a[a >= 2]

array([2, 3])

In [None]:
a.mean()

1.5

In [None]:
a[a > a.mean()]

array([2, 3])

In [None]:
a> a.mean()

array([False, False,  True,  True])

In [None]:
~(a > a.mean())

array([ True,  True, False, False])

In [None]:
a[~(a > a.mean())]

array([0, 1])

In [None]:
a[(a == 0) | (a == 1)]

array([0, 1])

In [None]:
a = np.array([0,1,2,3,4])
a

array([0, 1, 2, 3, 4])

In [None]:
a[(a <= 2) & (a % 2 == 0)]

array([0, 2])

In [None]:
A = np.random.randint(100, size=(3, 3))

In [None]:
A

array([[64, 88, 60],
       [ 6, 84, 99],
       [50, 35, 17]])

In [None]:
A[np.array([
    [True, False, True],
    [False, True, False],
    [True, False, True]
])]

array([64, 60, 84, 50, 17])

In [None]:
A

array([[64, 88, 60],
       [ 6, 84, 99],
       [50, 35, 17]])

In [None]:
A > 30

array([[ True,  True,  True],
       [False,  True,  True],
       [ True,  True, False]])

In [None]:
A[A>30]

array([64, 88, 60, 84, 99, 50, 35])

![green-divider](https://user-images.githubusercontent.com/7065401/52071924-c003ad80-2562-11e9-8297-1c6595f8a7ff.png)

## Linear Algebra

In [None]:
A = np.array([
    [1, 2, 3],
    [4, 5, 6],
    [7, 8, 9]
])

In [None]:
A.size

9

In [None]:
A.shape

(3, 3)

In [None]:
B = np.array([
    [6, 5],
    [4, 3],
    [2, 1]
])

In [None]:
A.shape,B.shape

((3, 3), (3, 2))

In [None]:
print(A)
print(B)

[[1 2 3]
 [4 5 6]
 [7 8 9]]
[[6 5]
 [4 3]
 [2 1]]


In [None]:
A.dot(B)

array([[20, 14],
       [56, 41],
       [92, 68]])

In [None]:
A @ B

array([[20, 14],
       [56, 41],
       [92, 68]])

In [None]:
B @ A

ValueError: ignored

In [None]:
B

array([[6, 5],
       [4, 3],
       [2, 1]])

In [None]:
B.T

array([[6, 4, 2],
       [5, 3, 1]])

In [None]:
B.T.shape

(2, 3)

In [None]:
B.T @ A

array([[36, 48, 60],
       [24, 33, 42]])

In [None]:
A

array([[1, 2, 3],
       [4, 5, 6],
       [7, 8, 9]])

In [None]:
B.T @ A

array([[36, 48, 60],
       [24, 33, 42]])

![green-divider](https://user-images.githubusercontent.com/7065401/52071924-c003ad80-2562-11e9-8297-1c6595f8a7ff.png)

## Size of objects in Memory

### Int, floats

In [None]:
# An integer in Python is > 24bytes
sys.getsizeof(1)

28

In [None]:
10**100

10000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000

In [None]:
# Longs are even larger
sys.getsizeof(10**100)

72

In [None]:
# Numpy size is much smaller
np.dtype(int).itemsize

8

In [None]:
# Numpy size is much smaller
np.dtype(np.int8).itemsize

1

In [None]:
np.dtype(float).itemsize

8

### Lists are even larger

In [None]:
A

array([[1, 2, 3],
       [4, 5, 6],
       [7, 8, 9]])

In [None]:
l = [[1,2,3],[4,5,6],[7,8,9]]
l

[[1, 2, 3], [4, 5, 6], [7, 8, 9]]

In [None]:
# A one-element list
sys.getsizeof(l)

80

In [None]:
sys.getsizeof(A)

200

In [None]:
A.nbytes

72

In [None]:
# An array of one element in numpy
np.array([1]).nbytes

8

### And performance is also important

In [None]:
l = list(range(100000))

In [None]:
a = np.arange(100000)

In [None]:
%time np.sum(a ** 2)

CPU times: user 1.77 ms, sys: 0 ns, total: 1.77 ms
Wall time: 4.08 ms


333328333350000

In [None]:
total = 0
for x in l:
  total = total + x ** 2

In [None]:
%time sum([x ** 2 for x in l])

CPU times: user 31.6 ms, sys: 2.96 ms, total: 34.5 ms
Wall time: 39.3 ms


333328333350000

![green-divider](https://user-images.githubusercontent.com/7065401/52071924-c003ad80-2562-11e9-8297-1c6595f8a7ff.png)

## Useful Numpy functions

### `random` 

In [None]:
np.random.random(size=(2,3))

array([[0.58898973, 0.574433  , 0.03143499],
       [0.28198128, 0.32565634, 0.19442542]])

In [None]:
np.random.normal(size=(5,6))

array([[-0.84057722, -0.59024273,  0.55548696,  1.1726968 ,  1.20521045,
        -0.48614122],
       [ 0.60468936,  0.67279471,  1.00267783,  0.84192216, -0.22178095,
         3.13658169],
       [ 0.68723809,  0.94525619, -0.56621844,  0.20030403,  1.10170478,
        -0.33699378],
       [ 1.77670015, -0.00655044, -0.59067784,  1.58365128, -0.94401531,
        -0.12003463],
       [-0.23553438, -1.28728761,  1.13855694,  0.02432497,  0.36663582,
        -0.94309475]])

In [None]:
np.random.rand(2, 4)

array([[0.72850766, 0.78762947, 0.66898155, 0.48236568],
       [0.77490774, 0.80350347, 0.52459634, 0.19196585]])

---
### `arange`

In [None]:
np.arange(10)

array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])

In [None]:
np.arange(5, 10)

array([5, 6, 7, 8, 9])

In [None]:
np.arange(0, 1, .1)

array([0. , 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9])

---
### `reshape`

In [None]:
np.arange(20)

array([ 0,  1,  2,  3,  4,  5,  6,  7,  8,  9, 10, 11, 12, 13, 14, 15, 16,
       17, 18, 19])

In [None]:
np.arange(20).shape

(20,)

In [None]:
np.arange(20).reshape(5,4)

array([[ 0,  1,  2,  3],
       [ 4,  5,  6,  7],
       [ 8,  9, 10, 11],
       [12, 13, 14, 15],
       [16, 17, 18, 19]])

In [None]:
np.arange(20).reshape(5,2,2)

array([[[ 0,  1],
        [ 2,  3]],

       [[ 4,  5],
        [ 6,  7]],

       [[ 8,  9],
        [10, 11]],

       [[12, 13],
        [14, 15]],

       [[16, 17],
        [18, 19]]])

In [None]:
temp = np.arange(20).reshape(2, 5, 2)
temp

array([[[ 0,  1],
        [ 2,  3],
        [ 4,  5],
        [ 6,  7],
        [ 8,  9]],

       [[10, 11],
        [12, 13],
        [14, 15],
        [16, 17],
        [18, 19]]])

In [None]:
temp[0][2][1]

5

In [None]:
temp[1]

array([[10, 11],
       [12, 13],
       [14, 15],
       [16, 17],
       [18, 19]])

In [None]:
temp[1][2]

array([14, 15])

In [None]:
temp[1][2][1]

15

In [None]:
z = np.arange(60).reshape(2, 2, 5, 3)

In [None]:
z.shape

(2, 2, 5, 3)

In [None]:
z.ndim

4

In [None]:
z

array([[[[ 0,  1,  2],
         [ 3,  4,  5],
         [ 6,  7,  8],
         [ 9, 10, 11],
         [12, 13, 14]],

        [[15, 16, 17],
         [18, 19, 20],
         [21, 22, 23],
         [24, 25, 26],
         [27, 28, 29]]],


       [[[30, 31, 32],
         [33, 34, 35],
         [36, 37, 38],
         [39, 40, 41],
         [42, 43, 44]],

        [[45, 46, 47],
         [48, 49, 50],
         [51, 52, 53],
         [54, 55, 56],
         [57, 58, 59]]]])

In [None]:
b

array([1. , 2.5, 3.1, 4.7])

In [None]:
b.shape

(4,)

In [None]:
b.ndim

1

In [None]:
a

array([[7, 8, 9],
       [4, 5, 6],
       [1, 2, 3]])

In [None]:
a.shape

(3, 3)

In [None]:
a.ndim

2

In [None]:
np.arange(10).reshape(5, 2)

---
### `linspace`

In [None]:
np.linspace(0, 1, 5)

array([0.  , 0.25, 0.5 , 0.75, 1.  ])

In [None]:
np.linspace(0, 1, 20)

array([0.        , 0.05263158, 0.10526316, 0.15789474, 0.21052632,
       0.26315789, 0.31578947, 0.36842105, 0.42105263, 0.47368421,
       0.52631579, 0.57894737, 0.63157895, 0.68421053, 0.73684211,
       0.78947368, 0.84210526, 0.89473684, 0.94736842, 1.        ])

In [None]:
np.linspace(0, 1, 20, False)

array([0.  , 0.05, 0.1 , 0.15, 0.2 , 0.25, 0.3 , 0.35, 0.4 , 0.45, 0.5 ,
       0.55, 0.6 , 0.65, 0.7 , 0.75, 0.8 , 0.85, 0.9 , 0.95])

---
### `zeros`, `ones`, `empty`

In [None]:
np.zeros(5)

array([0., 0., 0., 0., 0.])

In [None]:
np.zeros((3, 3))

array([[0., 0., 0.],
       [0., 0., 0.],
       [0., 0., 0.]])

In [None]:
np.zeros((3, 3), dtype=int)

array([[0, 0, 0],
       [0, 0, 0],
       [0, 0, 0]])

In [None]:
np.ones(5)

array([1., 1., 1., 1., 1.])

In [None]:
np.ones((3, 3))

array([[1., 1., 1.],
       [1., 1., 1.],
       [1., 1., 1.]])

In [None]:
np.empty(5)

array([1., 1., 1., 1., 1.])

In [None]:
np.empty((3, 3))

array([[1., 0., 0.],
       [0., 1., 0.],
       [0., 0., 1.]])

---
### `identity` and `eye`

In [None]:
np.identity(6)

array([[1., 0., 0., 0., 0., 0.],
       [0., 1., 0., 0., 0., 0.],
       [0., 0., 1., 0., 0., 0.],
       [0., 0., 0., 1., 0., 0.],
       [0., 0., 0., 0., 1., 0.],
       [0., 0., 0., 0., 0., 1.]])

In [None]:
np.eye(3, 3)

array([[1., 0., 0.],
       [0., 1., 0.],
       [0., 0., 1.]])

In [None]:
np.eye(8, 4)

array([[1., 0., 0., 0.],
       [0., 1., 0., 0.],
       [0., 0., 1., 0.],
       [0., 0., 0., 1.],
       [0., 0., 0., 0.],
       [0., 0., 0., 0.],
       [0., 0., 0., 0.],
       [0., 0., 0., 0.]])

In [None]:
np.eye(4, 4, k=1)

array([[0., 1., 0., 0.],
       [0., 0., 1., 0.],
       [0., 0., 0., 1.],
       [0., 0., 0., 0.]])

In [None]:
np.eye(4, 4, k=0)

array([[1., 0., 0., 0.],
       [0., 1., 0., 0.],
       [0., 0., 1., 0.],
       [0., 0., 0., 1.]])

In [None]:
np.eye(4, 4, k=1)

array([[0., 1., 0., 0.],
       [0., 0., 1., 0.],
       [0., 0., 0., 1.],
       [0., 0., 0., 0.]])

In [None]:
np.eye(4, 4, k=-1)

array([[0., 0., 0., 0.],
       [1., 0., 0., 0.],
       [0., 1., 0., 0.],
       [0., 0., 1., 0.]])

![purple-divider](https://user-images.githubusercontent.com/7065401/52071927-c1cd7100-2562-11e9-908a-dde91ba14e59.png)