## Numpy
```
Q) Difference between Python List and Numpy array?
Ans) numpy array is a list which contains elements of same data type only
```

In [1]:
import numpy as np

In [2]:
print(np.__doc__)


NumPy
=====

Provides
  1. An array object of arbitrary homogeneous items
  2. Fast mathematical operations over arrays
  3. Linear Algebra, Fourier Transforms, Random Number Generation

How to use the documentation
----------------------------
Documentation is available in two forms: docstrings provided
with the code, and a loose standing reference guide, available from
`the NumPy homepage <https://numpy.org>`_.

We recommend exploring the docstrings using
`IPython <https://ipython.org>`_, an advanced Python shell with
TAB-completion and introspection capabilities.  See below for further
instructions.

The docstring examples assume that `numpy` has been imported as ``np``::

  >>> import numpy as np

Code snippets are indicated by three greater-than signs::

  >>> x = 42
  >>> x = x + 1

Use the built-in ``help`` function to view a function's docstring::

  >>> help(np.sort)
  ... # doctest: +SKIP

For some objects, ``np.info(obj)`` may provide additional help.  This is
particularly 

In [3]:
print(dir(np))



The main feature of numpy is array object class. Arrays are similar to lists, except that every element in an array must be of the same type.

In [4]:
my_array = np.array([1, 5, 7, 9])

my_array

array([1, 5, 7, 9])

In [5]:
type(my_array)

numpy.ndarray

In [6]:
my_array.dtype  # type of elements in array

dtype('int32')

In [7]:
my_array = np.array([1, 5, 7, 9], float)

my_array

array([1., 5., 7., 9.])

In [8]:
my_array.dtype  # type of elements in array

dtype('float64')

In [9]:
len(my_array)

4

In [10]:
my_array.shape

(4,)

Multi-dimensional Arrays

In [11]:
myary = np.array([(1, 2, 3), (5, 6, 7)], float)

myary

array([[1., 2., 3.],
       [5., 6., 7.]])

In [12]:
myary.shape

(2, 3)

In [13]:
len(myary)

2

In [14]:
my_array = np.arange(15)

In [15]:
print(my_array)

[ 0  1  2  3  4  5  6  7  8  9 10 11 12 13 14]


In [16]:
my_array.shape

(15,)

In [17]:
my_array.ndim  # the number of axes (dimensions) of the array.

1

In [18]:
my_array1 = my_array.reshape(3, 5)

print(my_array1)

[[ 0  1  2  3  4]
 [ 5  6  7  8  9]
 [10 11 12 13 14]]


In [19]:
my_array1.shape

(3, 5)

In [20]:
my_array1.ndim

2

In [21]:
my_array1.dtype.name

'int32'

**ndarray.itemsize**
```
the size in bytes of each element of the array.
For example, an array of elements of type float64 has  8 (=6itemsize4/8), 
while one of type complex32 has itemsize 4 (=32/8). 
It is equivalent to ndarray.dtype.itemsize.
```

In [22]:
my_array1.itemsize

4

In [23]:
my_array1.size

15

In [24]:
type(my_array1)

numpy.ndarray

### Array Creation

In [25]:
my_array2 = np.array([6, 7, 8])

In [26]:
my_array2

array([6, 7, 8])

In [27]:
print(my_array2)

[6 7 8]


In [28]:
my_array2.dtype

dtype('int32')

In [29]:
type(my_array2)

numpy.ndarray

In [30]:
my_array3 = np.array([1.2, 3.5, 5.1])
print("my_array3.dtype", my_array3.dtype)
print("type(my_array3)", type(my_array3))

my_array3.dtype float64
type(my_array3) <class 'numpy.ndarray'>


In [31]:
np.array([(1.5, 2, 3), (4, 5, 6)])

array([[1.5, 2. , 3. ],
       [4. , 5. , 6. ]])

In [32]:
my_array3 = np.array([[1, 2], [3, 4]], dtype=complex)

In [33]:
my_array3

array([[1.+0.j, 2.+0.j],
       [3.+0.j, 4.+0.j]])

_zeros_like_ and _ones_like_ functions create a new array with the same dimensions and type of an existing one:

In [34]:
np.zeros_like(my_array3)

array([[0.+0.j, 0.+0.j],
       [0.+0.j, 0.+0.j]])

In [35]:
np.ones_like(my_array3)

array([[1.+0.j, 1.+0.j],
       [1.+0.j, 1.+0.j]])

In [36]:
np.zeros((3, 4))

array([[0., 0., 0., 0.],
       [0., 0., 0., 0.],
       [0., 0., 0., 0.]])

In [37]:
np.ones((2, 3, 4), dtype=np.int16)  # dtype can also be specified

array([[[1, 1, 1, 1],
        [1, 1, 1, 1],
        [1, 1, 1, 1]],

       [[1, 1, 1, 1],
        [1, 1, 1, 1],
        [1, 1, 1, 1]]], dtype=int16)

In [38]:
np.ones((3, 4, 2), dtype=np.int16)  # dtype can also be specified

array([[[1, 1],
        [1, 1],
        [1, 1],
        [1, 1]],

       [[1, 1],
        [1, 1],
        [1, 1],
        [1, 1]],

       [[1, 1],
        [1, 1],
        [1, 1],
        [1, 1]]], dtype=int16)

In [39]:
print(np.ones((2, 3, 4), dtype=np.int16))

[[[1 1 1 1]
  [1 1 1 1]
  [1 1 1 1]]

 [[1 1 1 1]
  [1 1 1 1]
  [1 1 1 1]]]


In [40]:
print(np.ones((3, 4, 2), dtype=np.int16))

[[[1 1]
  [1 1]
  [1 1]
  [1 1]]

 [[1 1]
  [1 1]
  [1 1]
  [1 1]]

 [[1 1]
  [1 1]
  [1 1]
  [1 1]]]


In [41]:
np.ones((2, 2)) * 7

array([[7., 7.],
       [7., 7.]])

In [42]:
np.ones((2, 2), dtype=np.int16) * 7

array([[7, 7],
       [7, 7]], dtype=int16)

In [43]:
np.full((2, 2), 7)  # Create a constant array

array([[7, 7],
       [7, 7]])

In [44]:
np.eye(2)  # unit 2x2 matrix; "eye" represents "I"

array([[1., 0.],
       [0., 1.]])

The eye function returns matrices with ones along the kth diagonal:

In [45]:
np.eye(4, k=0, dtype=float)

array([[1., 0., 0., 0.],
       [0., 1., 0., 0.],
       [0., 0., 1., 0.],
       [0., 0., 0., 1.]])

In [46]:
np.eye(4, k=1, dtype=float)

array([[0., 1., 0., 0.],
       [0., 0., 1., 0.],
       [0., 0., 0., 1.],
       [0., 0., 0., 0.]])

In [47]:
np.eye(4, k=2, dtype=float)

array([[0., 0., 1., 0.],
       [0., 0., 0., 1.],
       [0., 0., 0., 0.],
       [0., 0., 0., 0.]])

In [48]:
np.eye(4, k=-1, dtype=float)

array([[0., 0., 0., 0.],
       [1., 0., 0., 0.],
       [0., 1., 0., 0.],
       [0., 0., 1., 0.]])

In [49]:
np.identity(4, dtype=float)

array([[1., 0., 0., 0.],
       [0., 1., 0., 0.],
       [0., 0., 1., 0.],
       [0., 0., 0., 1.]])

In [50]:
np.identity(6, dtype=float)

array([[1., 0., 0., 0., 0., 0.],
       [0., 1., 0., 0., 0., 0.],
       [0., 0., 1., 0., 0., 0.],
       [0., 0., 0., 1., 0., 0.],
       [0., 0., 0., 0., 1., 0.],
       [0., 0., 0., 0., 0., 1.]])

In [51]:
np.random.random((2, 3))  # Create an array filled with random values

array([[0.379798  , 0.21160595, 0.81160829],
       [0.58974561, 0.65116357, 0.07036802]])

In [52]:
np.empty((2, 3))  # uninitialized, output may vary

array([[0.379798  , 0.21160595, 0.81160829],
       [0.58974561, 0.65116357, 0.07036802]])

In [53]:
np.empty((2, 3))

array([[0.379798  , 0.21160595, 0.81160829],
       [0.58974561, 0.65116357, 0.07036802]])

In [54]:
np.random.random((2, 3))

array([[0.64368923, 0.4343231 , 0.49121185],
       [0.5183927 , 0.77152926, 0.1332249 ]])

In [55]:
np.empty((2, 3))

array([[0.64368923, 0.4343231 , 0.49121185],
       [0.5183927 , 0.77152926, 0.1332249 ]])

In [56]:
np.empty((2, 4))

array([[0.00000000e+000, 0.00000000e+000, 0.00000000e+000,
        0.00000000e+000],
       [0.00000000e+000, 4.36754031e-321, 1.37959740e-306,
        2.29175545e-312]])

To create sequences of numbers, NumPy provides a function analogous to range that returns arrays instead of lists.



In [57]:
np.arange(10, 30, 5)

array([10, 15, 20, 25])

In [58]:
np.arange(0, 2, 0.3)  # it accepts float arguments

array([0. , 0.3, 0.6, 0.9, 1.2, 1.5, 1.8])

In [59]:
np.arange(0.4, 2.2, 0.3)  # It accepts float start and final values

array([0.4, 0.7, 1. , 1.3, 1.6, 1.9, 2.2])

In [60]:
np.linspace(0, 2, 9)  # 9 numbers from 0 to 2

array([0.  , 0.25, 0.5 , 0.75, 1.  , 1.25, 1.5 , 1.75, 2.  ])

In [61]:
np.linspace(0, 2, 7)

array([0.        , 0.33333333, 0.66666667, 1.        , 1.33333333,
       1.66666667, 2.        ])

In [62]:
np.linspace(0, 2, 5)

array([0. , 0.5, 1. , 1.5, 2. ])

In [63]:
x = np.linspace(0, 2 * np.pi, 100)  # useful to evaluate function at lots of points
x

array([0.        , 0.06346652, 0.12693304, 0.19039955, 0.25386607,
       0.31733259, 0.38079911, 0.44426563, 0.50773215, 0.57119866,
       0.63466518, 0.6981317 , 0.76159822, 0.82506474, 0.88853126,
       0.95199777, 1.01546429, 1.07893081, 1.14239733, 1.20586385,
       1.26933037, 1.33279688, 1.3962634 , 1.45972992, 1.52319644,
       1.58666296, 1.65012947, 1.71359599, 1.77706251, 1.84052903,
       1.90399555, 1.96746207, 2.03092858, 2.0943951 , 2.15786162,
       2.22132814, 2.28479466, 2.34826118, 2.41172769, 2.47519421,
       2.53866073, 2.60212725, 2.66559377, 2.72906028, 2.7925268 ,
       2.85599332, 2.91945984, 2.98292636, 3.04639288, 3.10985939,
       3.17332591, 3.23679243, 3.30025895, 3.36372547, 3.42719199,
       3.4906585 , 3.55412502, 3.61759154, 3.68105806, 3.74452458,
       3.8079911 , 3.87145761, 3.93492413, 3.99839065, 4.06185717,
       4.12532369, 4.1887902 , 4.25225672, 4.31572324, 4.37918976,
       4.44265628, 4.5061228 , 4.56958931, 4.63305583, 4.69652

In [64]:
f = np.sin(x)
f

array([ 0.00000000e+00,  6.34239197e-02,  1.26592454e-01,  1.89251244e-01,
        2.51147987e-01,  3.12033446e-01,  3.71662456e-01,  4.29794912e-01,
        4.86196736e-01,  5.40640817e-01,  5.92907929e-01,  6.42787610e-01,
        6.90079011e-01,  7.34591709e-01,  7.76146464e-01,  8.14575952e-01,
        8.49725430e-01,  8.81453363e-01,  9.09631995e-01,  9.34147860e-01,
        9.54902241e-01,  9.71811568e-01,  9.84807753e-01,  9.93838464e-01,
        9.98867339e-01,  9.99874128e-01,  9.96854776e-01,  9.89821442e-01,
        9.78802446e-01,  9.63842159e-01,  9.45000819e-01,  9.22354294e-01,
        8.95993774e-01,  8.66025404e-01,  8.32569855e-01,  7.95761841e-01,
        7.55749574e-01,  7.12694171e-01,  6.66769001e-01,  6.18158986e-01,
        5.67059864e-01,  5.13677392e-01,  4.58226522e-01,  4.00930535e-01,
        3.42020143e-01,  2.81732557e-01,  2.20310533e-01,  1.58001396e-01,
        9.50560433e-02,  3.17279335e-02, -3.17279335e-02, -9.50560433e-02,
       -1.58001396e-01, -

In [65]:
print(np.arange(10000))  # for large data, numpy skips displaying intermediate values

[   0    1    2 ... 9997 9998 9999]


In [66]:
print(np.arange(10000).reshape(100, 100))

[[   0    1    2 ...   97   98   99]
 [ 100  101  102 ...  197  198  199]
 [ 200  201  202 ...  297  298  299]
 ...
 [9700 9701 9702 ... 9797 9798 9799]
 [9800 9801 9802 ... 9897 9898 9899]
 [9900 9901 9902 ... 9997 9998 9999]]


if we need to disable this displaying behaviour, use 
> __np.set_printoptions(threshold=np.nan)__

##### Automatic Re-shaping

In [67]:
a = np.arange(30)

In [68]:
a.shape

(30,)

In [69]:
a.shape = 2, -1, 3  # -1 means "whatever is needed"

In [70]:
a.shape

(2, 5, 3)

In [71]:
a

array([[[ 0,  1,  2],
        [ 3,  4,  5],
        [ 6,  7,  8],
        [ 9, 10, 11],
        [12, 13, 14]],

       [[15, 16, 17],
        [18, 19, 20],
        [21, 22, 23],
        [24, 25, 26],
        [27, 28, 29]]])

### Basic Operations

In [72]:
a = np.array([20, 30, 40, 50])
a

array([20, 30, 40, 50])

In [73]:
b = np.arange(4)
b

array([0, 1, 2, 3])

In [74]:
a + b

array([20, 31, 42, 53])

In [75]:
a - b

array([20, 29, 38, 47])

In [76]:
a * b

array([  0,  30,  80, 150])

In [77]:
b / a

array([0.        , 0.03333333, 0.05      , 0.06      ])

In [78]:
a % b

  a % b


array([0, 0, 0, 2])

In [79]:
b % a

array([0, 1, 2, 3])

In [80]:
b**a

array([         0,          1,          0, -794958903])

In [81]:
b**2

array([0, 1, 4, 9])

In [82]:
d = np.arange(3)
d

array([0, 1, 2])

In [83]:
try:
    a - d
except ValueError as ex:
    print(ex)
    print("both should be of same dimension")

operands could not be broadcast together with shapes (4,) (3,) 
both should be of same dimension


In [84]:
10 * np.sin(a)  # element-wise multiplication

array([ 9.12945251, -9.88031624,  7.4511316 , -2.62374854])

In [85]:
a, a < 35

(array([20, 30, 40, 50]), array([ True,  True, False, False]))

In [86]:
A = np.array([[1, 1], [0, 1]])

B = np.array([[2, 0], [3, 4]])

In [87]:
A * B  # elementwise product

array([[2, 0],
       [0, 4]])

In [88]:
A @ B  # matrix product  - works only in Python >= 3.5

array([[5, 4],
       [3, 4]])

In [89]:
A.dot(B)  # another matrix product

array([[5, 4],
       [3, 4]])

In [90]:
a = np.ones((2, 3), dtype=int)
a

array([[1, 1, 1],
       [1, 1, 1]])

In [91]:
a *= 3

In [92]:
a

array([[3, 3, 3],
       [3, 3, 3]])

In [93]:
b = np.random.random((2, 3))
b += a

In [94]:
b

array([[3.20858879, 3.98856823, 3.1855899 ],
       [3.63431051, 3.61171107, 3.59624504]])

In [95]:
try:
    a += b  # b is not automatically converted to integer type
except Exception as ex:
    print(repr(ex))

UFuncTypeError(<ufunc 'add'>, 'same_kind', dtype('float64'), dtype('int32'), 2)


When operating with arrays of different types, the type of the resulting array corresponds to the more general or precise one (a behavior known as upcasting).

In [96]:
a = np.ones(3, dtype=np.int32)
a

array([1, 1, 1])

In [97]:
b = np.linspace(0, np.pi, 3)
print(b, b.dtype.name)

[0.         1.57079633 3.14159265] float64


In [98]:
c = a + b

c

array([1.        , 2.57079633, 4.14159265])

In [99]:
c.dtype.name

'float64'

In [100]:
c * 1j

array([0.+1.j        , 0.+2.57079633j, 0.+4.14159265j])

In [101]:
d = np.exp(c * 1j)

d

array([ 0.54030231+0.84147098j, -0.84147098+0.54030231j,
       -0.54030231-0.84147098j])

In [102]:
d.dtype.name

'complex128'

In [103]:
a = np.array([2, 4, 3, 34, 324, 213, 12, 23, 34, 45, 67, -234], float)

In [104]:
a.min()  # min of each row

-234.0

In [105]:
a.max()

324.0

In [106]:
a.sum()  # sum of each column

527.0

In [107]:
a.prod()  # product of each element

-3.728257888085453e+17

Alternatively, 

In [108]:
np.sum(a)

527.0

In [109]:
np.prod(a)

-3.728257888085453e+17

Statistical quantities

In [110]:
a.mean()

43.916666666666664

In [111]:
a.var()

15957.07638888889

In [112]:
a.std()

126.32132198836779

In [113]:
np.mean(a)

43.916666666666664

In [114]:
np.var(a)

15957.07638888889

In [115]:
np.std(a)

126.32132198836779

The argmin and argmax functions return the array indices of the minimum and maximum
values

In [116]:
a = np.array([2, 4, 3, 34, 324, 213, 12, 23, 34, 45, 67, -234], float)
#             0  1  2   3   4    5    6   7   8   9  10    11
print(f"a.argmin(): {a.argmin()}")
print(f"a.argmax(): {a.argmax()}")

a.argmin(): 11
a.argmax(): 4


In [117]:
a = np.array([[0, 2], [3, -1], [3, 5]], float)  # dim - 3, 1, 2

print(f"a.mean(axis=0):{a.mean(axis=0)}")  # (0 + 3 + 3)/3
print(f"a.mean(axis=1):{a.mean(axis=1)}")  # (0 +2)/2, (3-1)/2 , ...

a.mean(axis=0):[2. 2.]
a.mean(axis=1):[1. 1. 4.]


In [118]:
a

array([[ 0.,  2.],
       [ 3., -1.],
       [ 3.,  5.]])

In [119]:
print(f"a.min(axis=0):{a.min(axis=0)}")
print(f"a.min(axis=1):{a.min(axis=1)}")

a.min(axis=0):[ 0. -1.]
a.min(axis=1):[ 0. -1.  3.]


In [120]:
print(f"a.max(axis=0):{a.max(axis=0)}")
print(f"a.max(axis=1):{a.max(axis=1)}")

a.max(axis=0):[3. 5.]
a.max(axis=1):[2. 3. 5.]


In [121]:
a = np.array([6, 2, 5, -1, 0], float)
print(f"sorted(a): {sorted(a)}")
print(f"a        : {a}")

sorted(a): [-1.0, 0.0, 2.0, 5.0, 6.0]
a        : [ 6.  2.  5. -1.  0.]


In [122]:
a.sort()
a

array([-1.,  0.,  2.,  5.,  6.])

Values in an array can be "clipped" to be within a prespecified range. This is the same as
applying min(max(x, minval), maxval) to each element x in an array.

In [123]:
a = np.array([6, 2, 5, -1, 0], float)
a.clip(0, 5)

array([5., 2., 5., 0., 0.])

Unique elements can be extracted from an array:

In [124]:
a = np.array([6, 6, 1, 1, 4, 5, 5, 5, 7], float)
np.unique(a)  # unique sorted ascending

array([1., 4., 5., 6., 7.])

For two dimensional arrays, the diagonal can be extracted:

In [125]:
a = np.array([[1, 2], [3, 4]], float)
# [
#     [1, 2],
#     [3, 4]
# ]

a.diagonal()

array([1., 4.])

In [126]:
x = np.array([[1, 2], [3, 4]])

print(x, np.sum(x))  # Compute sum of all elements; prints "10"

[[1 2]
 [3 4]] 10


In [127]:
print(np.sum(x, axis=0))  # Compute sum of each column; prints "[4 6]"

[4 6]


In [128]:
print(np.sum(x, axis=1))  # Compute sum of each row; prints "[3 7]"

[3 7]


In [129]:
b = np.arange(12).reshape(3, 4)

In [130]:
b

array([[ 0,  1,  2,  3],
       [ 4,  5,  6,  7],
       [ 8,  9, 10, 11]])

In [131]:
b.cumsum(axis=1)  # cumulative sum along each row

array([[ 0,  1,  3,  6],
       [ 4,  9, 15, 22],
       [ 8, 17, 27, 38]])

In [132]:
b.cumsum(axis=0)  # cumulative sum along each column

array([[ 0,  1,  2,  3],
       [ 4,  6,  8, 10],
       [12, 15, 18, 21]])

In [133]:
b.cumsum()

array([ 0,  1,  3,  6, 10, 15, 21, 28, 36, 45, 55, 66])

In [134]:
b.sum()

66

In [135]:
B = np.arange(3)
B

array([0, 1, 2])

In [136]:
np.exp(B)

array([1.        , 2.71828183, 7.3890561 ])

In [137]:
np.sqrt(B)

array([0.        , 1.        , 1.41421356])

In [138]:
C = np.array([2.0, -1.0, 4.0])
C

array([ 2., -1.,  4.])

In [139]:
np.add(B, C)

array([2., 0., 6.])

In [140]:
B + C

array([2., 0., 6.])

### Array concatenation

In [141]:
a = np.array([1, 2], float)
b = np.array([3, 4, 5, 6], float)
c = np.array([7, 8, 9], float)

np.concatenate((a, b, c))

array([1., 2., 3., 4., 5., 6., 7., 8., 9.])

In [142]:
try:
    a + b + c
except ValueError as ex:
    print(ex)

operands could not be broadcast together with shapes (2,) (4,) 


If an array has more than one dimension, it is possible to specify the axis along which multiple
arrays are concatenated. By default (without specifying the axis), NumPy concatenates along
the first dimension:

In [143]:
a = np.array([[1, 2], [3, 4]], float)
b = np.array([[5, 6], [7, 8]], float)

In [144]:
try:
    np.concatenate(a, b)
except TypeError as ex:
    print(ex)

only integer scalar arrays can be converted to a scalar index


In [145]:
np.concatenate((a, b))

array([[1., 2.],
       [3., 4.],
       [5., 6.],
       [7., 8.]])

In [146]:
np.concatenate((a, b), axis=0)

array([[1., 2.],
       [3., 4.],
       [5., 6.],
       [7., 8.]])

In [147]:
np.concatenate((a, b), axis=1)

array([[1., 2., 5., 6.],
       [3., 4., 7., 8.]])

### Indexing, Slicing and Iterating

One-dimensional arrays can be indexed, sliced and iterated over, much like lists and other Python sequences.



In [148]:
a = np.arange(10) ** 3

In [149]:
a

array([  0,   1,   8,  27,  64, 125, 216, 343, 512, 729], dtype=int32)

In [150]:
a[2]

8

In [151]:
a[2:5]

array([ 8, 27, 64], dtype=int32)

In [152]:
a[:6:2]

array([ 0,  8, 64], dtype=int32)

In [153]:
a[:6:2] = -1000  # equivalent to a[0:6:2] = -1000

In [154]:
print(a)

[-1000     1 -1000    27 -1000   125   216   343   512   729]


In [155]:
a[::-1]  # reversed a

array([  729,   512,   343,   216,   125, -1000,    27, -1000,     1,
       -1000], dtype=int32)

In [156]:
for i in a:
    print(i, i ** (1 / 3.0))

-1000 nan
1 1.0
-1000 nan
27 3.0
-1000 nan
125 5.0
216 5.999999999999999
343 6.999999999999999
512 7.999999999999999
729 8.999999999999998


  print(i, i ** (1 / 3.0))


In [157]:
m = np.array([[1, 2, 3, 4], [5, 6, 7, 8], [9, 10, 11, 12]])

In [158]:
m

array([[ 1,  2,  3,  4],
       [ 5,  6,  7,  8],
       [ 9, 10, 11, 12]])

In [159]:
n = m[:2, 1:3]

In [160]:
n

array([[2, 3],
       [6, 7]])

In [161]:
m[0, 1]

2

In [162]:
n[0, 0] = 77  # n[0, 0] is the same piece of data as m[0, 1]
n

array([[77,  3],
       [ 6,  7]])

In [163]:
m[0, 1]

77

In [164]:
m

array([[ 1, 77,  3,  4],
       [ 5,  6,  7,  8],
       [ 9, 10, 11, 12]])

Multidimensional arrays can have one index per axis. These indices are given in a tuple separated by commas:

In [165]:
def f(x, y):
    return 10 * x + y


b = np.fromfunction(f, (5, 4), dtype=int)

In [166]:
b

array([[ 0,  1,  2,  3],
       [10, 11, 12, 13],
       [20, 21, 22, 23],
       [30, 31, 32, 33],
       [40, 41, 42, 43]])

In [167]:
b[2, 3]

23

In [168]:
b[0:5, 1]  # each row in the second column of b

array([ 1, 11, 21, 31, 41])

In [169]:
b[:, 1]  # equivalent to the previous example

array([ 1, 11, 21, 31, 41])

In [170]:
b[1:3, :]  # each column in the second and third row of b

array([[10, 11, 12, 13],
       [20, 21, 22, 23]])

When fewer indices are provided than the number of axes, the missing indices are considered complete slices:

In [171]:
b[-1]  # the last row. Equivalent to b[-1,:]

array([40, 41, 42, 43])

In [172]:
c = np.array(
    [
        [[0, 1, 2], [10, 12, 13]],  # a 3D array (two stacked 2D arrays)
        [[100, 101, 102], [110, 112, 113]],
    ]
)

In [173]:
c.shape

(2, 2, 3)

In [174]:
c[1, ...]  # same as c[1,:,:] or c[1]

array([[100, 101, 102],
       [110, 112, 113]])

In [175]:
c[..., 2]  # same as c[:,:,2]

array([[  2,  13],
       [102, 113]])

Iterating over multidimensional arrays is done with respect to the first axis:

In [176]:
for row in b:
    print(row)

[0 1 2 3]
[10 11 12 13]
[20 21 22 23]
[30 31 32 33]
[40 41 42 43]


In [177]:
b.flat, type(b.flat)

(<numpy.flatiter at 0x1f21ca889f0>, numpy.flatiter)

In [178]:
b.flatten()

array([ 0,  1,  2,  3, 10, 11, 12, 13, 20, 21, 22, 23, 30, 31, 32, 33, 40,
       41, 42, 43])

In [179]:
b

array([[ 0,  1,  2,  3],
       [10, 11, 12, 13],
       [20, 21, 22, 23],
       [30, 31, 32, 33],
       [40, 41, 42, 43]])

In [180]:
b.shape

(5, 4)

In [181]:
b = b.reshape(1, 20)
b

array([[ 0,  1,  2,  3, 10, 11, 12, 13, 20, 21, 22, 23, 30, 31, 32, 33,
        40, 41, 42, 43]])

### Shape Manipulation

##### Changing the shape of an array

In [182]:
a = np.floor(10 * np.random.random((3, 4)))

In [183]:
a

array([[5., 4., 7., 6.],
       [2., 1., 7., 1.],
       [8., 0., 2., 7.]])

In [184]:
a.shape

(3, 4)

In [185]:
a.flatten()

array([5., 4., 7., 6., 2., 1., 7., 1., 8., 0., 2., 7.])

In [186]:
a

array([[5., 4., 7., 6.],
       [2., 1., 7., 1.],
       [8., 0., 2., 7.]])

In [187]:
a.ravel()  # returns the array, flattened

array([5., 4., 7., 6., 2., 1., 7., 1., 8., 0., 2., 7.])

In [188]:
a

array([[5., 4., 7., 6.],
       [2., 1., 7., 1.],
       [8., 0., 2., 7.]])

In [189]:
a.reshape(6, 2)  # returns the array with a modified shape

array([[5., 4.],
       [7., 6.],
       [2., 1.],
       [7., 1.],
       [8., 0.],
       [2., 7.]])

In [190]:
a.T  # returns the array, transposed

array([[5., 2., 8.],
       [4., 1., 0.],
       [7., 7., 2.],
       [6., 1., 7.]])

In [191]:
a.T.shape

(4, 3)

In [192]:
a.shape

(3, 4)

In [193]:
a

array([[5., 4., 7., 6.],
       [2., 1., 7., 1.],
       [8., 0., 2., 7.]])

In [194]:
a.resize((2, 6))
a

array([[5., 4., 7., 6., 2., 1.],
       [7., 1., 8., 0., 2., 7.]])

If a dimension is given as -1 in a reshaping operation, the other dimensions are automatically calculated:

In [195]:
a.reshape(3, -1)

array([[5., 4., 7., 6.],
       [2., 1., 7., 1.],
       [8., 0., 2., 7.]])

##### Stacking together different arrays
Several arrays can be stacked together along different axes:


In [196]:
x = np.arange(0, 10, 2)  # x=([0,2,4,6,8])
y = np.arange(5)  # y=([0,1,2,3,4])

In [197]:
x

array([0, 2, 4, 6, 8])

In [198]:
y

array([0, 1, 2, 3, 4])

In [199]:
np.vstack([x, y])  # concate ais 0

array([[0, 2, 4, 6, 8],
       [0, 1, 2, 3, 4]])

In [200]:
np.hstack([x, y])  # flatten

array([0, 2, 4, 6, 8, 0, 1, 2, 3, 4])

In [201]:
a = np.floor(10 * np.random.random((2, 2)))
a

array([[9., 1.],
       [2., 9.]])

In [202]:
b = np.floor(10 * np.random.random((2, 2)))
b

array([[8., 8.],
       [6., 8.]])

In [203]:
np.vstack((a, b))  # Horizontal Append  -- np.concatenate(axis=0)

array([[9., 1.],
       [2., 9.],
       [8., 8.],
       [6., 8.]])

In [204]:
np.hstack((a, b))  # Vertical Append -- np.concatenate(axis=1)

array([[9., 1., 8., 8.],
       [2., 9., 6., 8.]])

The function column_stack stacks 1D arrays as columns into a 2D array. It is equivalent to hstack only for 2D arrays:

In [205]:
np.column_stack((a, b))  # with 2D arrays

array([[9., 1., 8., 8.],
       [2., 9., 6., 8.]])

In [206]:
a = np.array([4.0, 2.0])
b = np.array([3.0, 8.0])

In [207]:
np.column_stack((a, b))  # returns a 2D array

array([[4., 3.],
       [2., 8.]])

In [208]:
np.hstack((a, b))  # the result is different

array([4., 2., 3., 8.])

In [209]:
np.vstack((a, b))

array([[4., 2.],
       [3., 8.]])

In [210]:
a

array([4., 2.])

In [211]:
from numpy import newaxis

a[:, newaxis]  # this allows to have a 2D columns vector

array([[4.],
       [2.]])

In [212]:
np.column_stack((a[:, newaxis], b[:, newaxis]))

array([[4., 3.],
       [2., 8.]])

In [213]:
np.hstack((a[:, newaxis], b[:, newaxis]))  # the result is the same

array([[4., 3.],
       [2., 8.]])

##### Splitting one array into several smaller ones

In [214]:
a = np.floor(10 * np.random.random((2, 12)))

In [215]:
a

array([[2., 6., 7., 4., 2., 4., 3., 6., 1., 8., 5., 7.],
       [3., 2., 4., 1., 2., 7., 5., 0., 1., 6., 2., 9.]])

In [216]:
print(np.hsplit(a, 3))  # Split a into 3

[array([[2., 6., 7., 4.],
       [3., 2., 4., 1.]]), array([[2., 4., 3., 6.],
       [2., 7., 5., 0.]]), array([[1., 8., 5., 7.],
       [1., 6., 2., 9.]])]


In [217]:
np.hsplit(a, (3, 4))  # Split a after the third and the fourth column

[array([[2., 6., 7.],
        [3., 2., 4.]]),
 array([[4.],
        [1.]]),
 array([[2., 4., 3., 6., 1., 8., 5., 7.],
        [2., 7., 5., 0., 1., 6., 2., 9.]])]

vsplit splits along the vertical axis, and array_split allows one to specify along which axis to split.



### Copies and Views
##### No Copy at All
Simple assignments make no copy of array objects or of their data.



In [218]:
a = np.arange(12)

In [219]:
b = a  # no new object is created

In [220]:
b is a  # a and b are two names for the same ndarray object

True

In [221]:
id(a), id(b)

(2139635266832, 2139635266832)

In [222]:
b.shape = 3, 4  # changes the shape of a
b

array([[ 0,  1,  2,  3],
       [ 4,  5,  6,  7],
       [ 8,  9, 10, 11]])

In [223]:
a.shape

(3, 4)

Python passes mutable objects as references, so function calls make no copy.



In [224]:
def f(x):
    print(id(x))

In [225]:
id(a)  # id is a unique identifier of an object

2139635266832

In [226]:
f(a)

2139635266832


##### View or Shallow Copy
Different array objects can share the same data. The view method creates a new array object that looks at the same data.



In [227]:
c = a.view()

In [228]:
c

array([[ 0,  1,  2,  3],
       [ 4,  5,  6,  7],
       [ 8,  9, 10, 11]])

In [229]:
c is a

False

In [230]:
id(a), id(b), id(c)

(2139635266832, 2139635266832, 2139635135280)

In [231]:
c.base is a  # c is a view of the data owned by a

True

In [232]:
c.flags.owndata

False

In [233]:
c.shape = 2, 6  # a's shape doesn't change

In [234]:
a.shape

(3, 4)

In [235]:
c[0, 4] = 1234  # a's data changes

In [236]:
print(c)

[[   0    1    2    3 1234    5]
 [   6    7    8    9   10   11]]


In [237]:
print(a)

[[   0    1    2    3]
 [1234    5    6    7]
 [   8    9   10   11]]


Slicing an array returns a view of it:



In [238]:
s = a[:, 1:3]  # spaces added for clarity; could also be written "s = a[:,1:3]"

In [239]:
s[:] = 10  # s[:] is a view of s. Note the difference between s=10 and s[:]=10

In [240]:
a

array([[   0,   10,   10,    3],
       [1234,   10,   10,    7],
       [   8,   10,   10,   11]])

#### Deep Copy
The copy method makes a complete copy of the array and its data.



In [241]:
d = a.copy()  # a new array object with new data is created

In [242]:
d is a

False

In [243]:
d.base is a  # d doesn't share anything with a

False

In [244]:
d.flags.owndata

True

In [245]:
d[0, 0] = 9999

In [246]:
a

array([[   0,   10,   10,    3],
       [1234,   10,   10,    7],
       [   8,   10,   10,   11]])

### Broadcasting rules

Broadcasting allows universal functions to deal in a meaningful way with inputs that do not have exactly the same shape.

The first rule of broadcasting is that if all input arrays do not have the same number of dimensions, a “1” will be repeatedly prepended to the shapes of the smaller arrays until all the arrays have the same number of dimensions.

The second rule of broadcasting ensures that arrays with a size of 1 along a particular dimension act as if they had the size of the array with the largest shape along that dimension. The value of the array element is assumed to be the same along that dimension for the “broadcast” array.

After application of the broadcasting rules, the sizes of all arrays must match.

#### Indexing with Arrays of Indices

In [247]:
a = np.arange(12) ** 2  # the first 12 square numbers
a

array([  0,   1,   4,   9,  16,  25,  36,  49,  64,  81, 100, 121])

In [248]:
i = np.array([1, 1, 3, 8, 5])  # an array of indices

In [249]:
a[i]

array([ 1,  1,  9, 64, 25])

In [250]:
j = np.array([[3, 4], [9, 7]])  # a bidimensional array of indices

In [251]:
a[j]  # the same shape as j

array([[ 9, 16],
       [81, 49]])

In [252]:
palette = np.array(
    [
        [0, 0, 0],  # black
        [255, 0, 0],  # red
        [0, 255, 0],  # green
        [0, 0, 255],  # blue
        [255, 255, 255],
    ]
)  # white

image = np.array(
    [[0, 1, 2, 0], [0, 3, 4, 0]]  # each value corresponds to a color in the palette
)

print(palette[image])  # the (2,4,3) color image

[[[  0   0   0]
  [255   0   0]
  [  0 255   0]
  [  0   0   0]]

 [[  0   0   0]
  [  0   0 255]
  [255 255 255]
  [  0   0   0]]]


#### Simple Array Operations

In [253]:
a = np.array([[1.0, 2.0], [3.0, 4.0]])

In [254]:
a

array([[1., 2.],
       [3., 4.]])

In [255]:
a.transpose()

array([[1., 3.],
       [2., 4.]])

In [256]:
print(dir(np.linalg))

['LinAlgError', '__all__', '__builtins__', '__cached__', '__doc__', '__file__', '__loader__', '__name__', '__package__', '__path__', '__spec__', '_umath_linalg', 'cholesky', 'cond', 'det', 'eig', 'eigh', 'eigvals', 'eigvalsh', 'inv', 'linalg', 'lstsq', 'matrix_power', 'matrix_rank', 'multi_dot', 'norm', 'pinv', 'qr', 'slogdet', 'solve', 'svd', 'tensorinv', 'tensorsolve', 'test']


In [257]:
np.linalg.inv(a)

array([[-2. ,  1. ],
       [ 1.5, -0.5]])

In [258]:
u = np.eye(2)  # unit 2x2 matrix; "eye" represents "I"

In [259]:
u

array([[1., 0.],
       [0., 1.]])

In [260]:
j = np.array([[0.0, -1.0], [1.0, 0.0]])

In [261]:
j @ j  # matrix product  - WORKS ONLY IN PYTHON > 3.4

array([[-1.,  0.],
       [ 0., -1.]])

In [262]:
np.trace(u)  # trace - summation of principal diagonal eleements

2.0

In [263]:
y = np.array([[5.0], [7.0]])

In [264]:
np.linalg.solve(a, y)

array([[-3.],
       [ 4.]])

In [265]:
np.linalg.eig(j)

EigResult(eigenvalues=array([0.+1.j, 0.-1.j]), eigenvectors=array([[0.70710678+0.j        , 0.70710678-0.j        ],
       [0.        -0.70710678j, 0.        +0.70710678j]]))