# NumPy
The NumPy package provides the "ndarray" object. The NumPy array is used to contain data of uniform type with an arbitrary number of dimensions. NumPy then provides basic mathematical and array methods to lay down the foundation for the entire SciPy ecosystem. The following import statement is the generally accepted convention for NumPy.

In [1]:
import numpy as np

## Array Creation
There are several ways to make NumPy arrays. An array has three particular attributes that can be queried: shape, size and the number of dimensions.

In [2]:
a = np.array([1, 2, 3])
print(a.shape)
print(a.size)
print(a.ndim)

(3,)
3
1


In [3]:
x = np.arange(100)
print(x.shape)
print(x.size)
print(x.ndim)

(100,)
100
1


In [4]:
y = np.random.rand(5, 80)
print(y.shape)
print(y.size)
print(y.ndim)

(5, 80)
400
2


## Array Manipulation
How to change the shape of an array without a copy!

In [5]:
x.shape = (20, 5)
print(x)

[[ 0  1  2  3  4]
 [ 5  6  7  8  9]
 [10 11 12 13 14]
 [15 16 17 18 19]
 [20 21 22 23 24]
 [25 26 27 28 29]
 [30 31 32 33 34]
 [35 36 37 38 39]
 [40 41 42 43 44]
 [45 46 47 48 49]
 [50 51 52 53 54]
 [55 56 57 58 59]
 [60 61 62 63 64]
 [65 66 67 68 69]
 [70 71 72 73 74]
 [75 76 77 78 79]
 [80 81 82 83 84]
 [85 86 87 88 89]
 [90 91 92 93 94]
 [95 96 97 98 99]]


NumPy can even automatically figure out the size of at most one dimension for you.

In [6]:
y.shape = (4, 20, -1)
print(y.shape)

(4, 20, 5)


## Array Indexing

In [7]:
# Scalar Indexing
print(x[2])

[10 11 12 13 14]


In [8]:
# Slicing
print(x[2:5])

[[10 11 12 13 14]
 [15 16 17 18 19]
 [20 21 22 23 24]]


In [9]:
# Advanced slicing
print("First 5 rows\n", x[:5])
print("Row 18 to the end\n", x[18:])
print("Last 5 rows\n", x[-5:])
print("Reverse the rows\n", x[::-1])

First 5 rows
 [[ 0  1  2  3  4]
 [ 5  6  7  8  9]
 [10 11 12 13 14]
 [15 16 17 18 19]
 [20 21 22 23 24]]
Row 18 to the end
 [[90 91 92 93 94]
 [95 96 97 98 99]]
Last 5 rows
 [[75 76 77 78 79]
 [80 81 82 83 84]
 [85 86 87 88 89]
 [90 91 92 93 94]
 [95 96 97 98 99]]
Reverse the rows
 [[95 96 97 98 99]
 [90 91 92 93 94]
 [85 86 87 88 89]
 [80 81 82 83 84]
 [75 76 77 78 79]
 [70 71 72 73 74]
 [65 66 67 68 69]
 [60 61 62 63 64]
 [55 56 57 58 59]
 [50 51 52 53 54]
 [45 46 47 48 49]
 [40 41 42 43 44]
 [35 36 37 38 39]
 [30 31 32 33 34]
 [25 26 27 28 29]
 [20 21 22 23 24]
 [15 16 17 18 19]
 [10 11 12 13 14]
 [ 5  6  7  8  9]
 [ 0  1  2  3  4]]


In [10]:
# Boolean Indexing
print(x[(x % 2) == 0])

[ 0  2  4  6  8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 38 40 42 44 46
 48 50 52 54 56 58 60 62 64 66 68 70 72 74 76 78 80 82 84 86 88 90 92 94
 96 98]


In [11]:
# Fancy Indexing -- Note the use of a list, not tuple!
print(x)
print(x[[1, 3, 8, 9, 2]])

[[ 0  1  2  3  4]
 [ 5  6  7  8  9]
 [10 11 12 13 14]
 [15 16 17 18 19]
 [20 21 22 23 24]
 [25 26 27 28 29]
 [30 31 32 33 34]
 [35 36 37 38 39]
 [40 41 42 43 44]
 [45 46 47 48 49]
 [50 51 52 53 54]
 [55 56 57 58 59]
 [60 61 62 63 64]
 [65 66 67 68 69]
 [70 71 72 73 74]
 [75 76 77 78 79]
 [80 81 82 83 84]
 [85 86 87 88 89]
 [90 91 92 93 94]
 [95 96 97 98 99]]
[[ 5  6  7  8  9]
 [15 16 17 18 19]
 [40 41 42 43 44]
 [45 46 47 48 49]
 [10 11 12 13 14]]


## Broadcasting
Broadcasting is a very useful feature of NumPy that will let arrays with differing shapes still be used together. In most cases, broadcasting is faster, and it is more memory efficient than the equivalent full array operation.

In [12]:
print("Shape of X:", x.shape)
print("Shape of Y:", y.shape)
y


Shape of X: (20, 5)
Shape of Y: (4, 20, 5)


array([[[0.99006561, 0.9088096 , 0.31670581, 0.52471287, 0.52187106],
        [0.1063357 , 0.14227933, 0.4056292 , 0.21333014, 0.13200134],
        [0.7452258 , 0.70066455, 0.35681715, 0.85850643, 0.29509828],
        [0.5289299 , 0.5089104 , 0.07442997, 0.70152296, 0.55268921],
        [0.97373484, 0.62955893, 0.73574208, 0.56426242, 0.57265027],
        [0.79155005, 0.11877572, 0.59597486, 0.59114219, 0.84728759],
        [0.32491074, 0.56883829, 0.97885024, 0.31504281, 0.33656886],
        [0.25047432, 0.92454502, 0.21709229, 0.41889616, 0.63816653],
        [0.68763018, 0.71004542, 0.17480189, 0.40546584, 0.11481524],
        [0.10098263, 0.09330102, 0.14514827, 0.86113608, 0.46028402],
        [0.36042197, 0.27887421, 0.15275147, 0.55282398, 0.21838237],
        [0.72218656, 0.15667834, 0.95256282, 0.17569191, 0.35795382],
        [0.31185247, 0.01424091, 0.28352908, 0.8913361 , 0.72061344],
        [0.69684312, 0.92761812, 0.58426071, 0.6588935 , 0.62038695],
        [0.62832543,

Now, here are three identical assignments. The first one takes full advantage of broadcasting by allowing NumPy to automatically add a new dimension to the *left*. The second explicitly adds that dimension with the special NumPy alias "np.newaxis". These first two creates a singleton dimension without any new arrays being created. That singleton dimension is then implicitly tiled, much like the third example to match with the RHS of the addition operator. However, unlike the third example, the broadcasting merely re-uses the existing data in memory.

In [13]:
a = x + y
print(a.shape)
b = x[np.newaxis, :, :] + y
print(b.shape)
c = np.tile(x, (4, 1, 1)) + y
print(c.shape)
print("Are a and b identical?", np.all(a == b))
print("Are a and c identical?", np.all(a == c))

(4, 20, 5)
(4, 20, 5)
(4, 20, 5)
Are a and b identical? True
Are a and c identical? True


Another example of broadcasting two 1-D arrays to make a 2-D array.

In [14]:
x = np.arange(-5, 5, 0.1)
y = np.arange(-8, 8, 0.25)
print(x.shape, y.shape)
z = x[np.newaxis, :] * y[:, np.newaxis]
print(z.shape)

(100,) (64,)
(64, 100)


In [15]:
# More concisely
y, x = np.ogrid[-8:8:0.25, -5:5:0.1]
print(x.shape, y.shape)
z = x * y
print(z.shape)

(1, 100) (64, 1)
(64, 100)
