# NumPy Indexing and Selection

#### https://github.com/SelcukDE



In this lecture we will discuss how to select elements or groups of elements from an array.

In [21]:
import numpy as np

In [22]:
arr = np.arange(0, 11)
arr

array([ 0,  1,  2,  3,  4,  5,  6,  7,  8,  9, 10])

## Bracket Indexing and Selection
The simplest way to pick one or some elements of an array looks very similar to python lists:

In [23]:
arr[8]

8

In [24]:
arr[-1]

10

In [25]:
arr[1:5]

array([1, 2, 3, 4])

In [26]:
arr[0:5]

array([0, 1, 2, 3, 4])

In [7]:
# arr[start:stop:step]

In [8]:
arr[1::2]

array([1, 3, 5, 7, 9])

In [9]:
arr[0::2]

array([ 0,  2,  4,  6,  8, 10])

## Broadcasting

Numpy arrays differ from a normal Python list because of their ability to broadcast:

In [14]:
arr

array([100, 100, 100, 100, 100,   5,   6,   7,   8,   9,  10])

In [27]:
arr[0:5] = 100

arr

array([100, 100, 100, 100, 100,   5,   6,   7,   8,   9,  10])

In [28]:
a = [0, 2, 4, 6, 8]

In [33]:
a[0:3] = 100

TypeError: can only assign an iterable

In [31]:
arr = np.arange(0,11)

In [32]:
arr

array([ 0,  1,  2,  3,  4,  5,  6,  7,  8,  9, 10])

In [34]:
slice_of_arr = arr[0:6]

In [35]:
slice_of_arr

array([0, 1, 2, 3, 4, 5])

In [36]:
slice_of_arr[:] = 99

In [37]:
slice_of_arr

array([99, 99, 99, 99, 99, 99])

Now note the changes also occur in our original array!

In [38]:
arr

array([99, 99, 99, 99, 99, 99,  6,  7,  8,  9, 10])

Data is not copied, it's a view of the original array! This avoids memory problems!

In [39]:
arr_copy = arr.copy()

arr_copy

array([99, 99, 99, 99, 99, 99,  6,  7,  8,  9, 10])

In [40]:
arr_copy[0:6] = [0,1,2,3,4,5]

In [41]:
arr_copy

array([ 0,  1,  2,  3,  4,  5,  6,  7,  8,  9, 10])

In [42]:
arr

array([99, 99, 99, 99, 99, 99,  6,  7,  8,  9, 10])

## Indexing a 2D array (matrices)

<p>The general format is <b>arr_2d[row][col]</b> or <b>arr_2d[row,col]</b>. I recommend usually using the comma notation for clarity.</p>

In [43]:
arr_2d = np.array([[5,10,15], [20,25,30], [35, 40,45]])

arr_2d

array([[ 5, 10, 15],
       [20, 25, 30],
       [35, 40, 45]])

In [44]:
arr_2d[1]

array([20, 25, 30])

In [45]:
arr_2d[1][0]

20

In [46]:
arr_2d[1, 0]

20

In [47]:
arr_2d[2]

array([35, 40, 45])

In [48]:
arr_2d[2, :]

array([35, 40, 45])

In [49]:
arr_2d

array([[ 5, 10, 15],
       [20, 25, 30],
       [35, 40, 45]])

In [50]:
arr_2d[:,1]

array([10, 25, 40])

In [51]:
arr_2d[:,2]

array([15, 30, 45])

In [52]:
arr_2d[1,1] = 55

In [53]:
arr_2d

array([[ 5, 10, 15],
       [20, 55, 30],
       [35, 40, 45]])

In [54]:
arr_2d.dtype

dtype('int32')

In [55]:
arr_2d[0,0] = 3.3

In [56]:
arr_2d

array([[ 3, 10, 15],
       [20, 55, 30],
       [35, 40, 45]])

### Fancy Indexing

Fancy indexing allows you to select entire rows or columns out of order,to show this, let's quickly build out a numpy array:

In [57]:
v = np.arange(0, 30, 3)

In [58]:
v

array([ 0,  3,  6,  9, 12, 15, 18, 21, 24, 27])

In [59]:
v[1]

3

In [60]:
v[5]

15

In [61]:
idx_list = [1,3,5]

In [None]:
v[idx_list]

In [None]:
v[[1,3,5]]

Fancy indexing allows the following

In [None]:
arr2d = np.zeros((10,10), dtype = int)

In [None]:
arr2d

In [None]:
arr2d.shape

In [None]:
arr_length = arr2d.shape[1]

In [None]:
arr_length

In [None]:
arr2d[0]

In [None]:
arr2d[3]

In [None]:
for i in range(arr_length):
    arr2d[i] = i
    
    
arr2d

In [None]:
arr2d[[2,4,6,8]]

In [None]:
arr2d[[6,4,2,7]]

<h3>any_array[[row indices], [column indices]]</h3>

In [None]:
jj = np.arange(1,17).reshape((4,4))
jj

In [62]:
jj[[1,2], [0,3]]

NameError: name 'jj' is not defined

In [63]:
jj[[0,2,3], [0,1,3]]

NameError: name 'jj' is not defined

### Using ***basic index*** and ***fancy index*** together

In [64]:
jj

NameError: name 'jj' is not defined

In [65]:
jj[1, [1,3]]

NameError: name 'jj' is not defined

In [66]:
jj[[0, 3], 1]

NameError: name 'jj' is not defined

### Using ***basic slicing*** and ***fancy index*** together

In [None]:
jj

In [None]:
jj[0:, [1,2]]

In [None]:
jj[1:3, [1,2]]

In [None]:
jj[1:3, 1:3]

## More Indexing Help
Indexing a 2d matrix can be a bit confusing at first, especially when you start to add in step size. Try google image searching NumPy indexing to find useful images.

## Selection

Let's briefly go over how to use brackets for selection based off of comparison operators.

In [None]:
arr = np.arange(1,11)

In [None]:
arr

In [None]:
arr > 4

In [None]:
bool_arr = arr > 4

In [None]:
arr[bool_arr]

In [None]:
arr[arr > 4]

In [None]:
arr[(arr != 3) & (arr != 4)]

-``& ==> and``

-``| ==> or``

# NumPy Operations

## Arithmetic Operations

#### You can easily perform array with array arithmetic, or scalar with array arithmetic.

In [None]:
arr = np.arange(0,10)
arr

In [None]:
arr + arr

In [None]:
arr * arr

In [None]:
arr - arr

In [None]:
arr / arr

In [None]:
1 / arr

In [None]:
arr ** 2

In [None]:
v = np.array([1,2,3,4,5])

In [None]:
v

In [None]:
v - 1

In [None]:
v * 5

In [None]:
v * 5 / 10 - 1

## Universal Array Functions

#### Numpy comes with many [universal array functions](http://docs.scipy.org/doc/numpy/reference/ufuncs.html), which are essentially just mathematical operations you can use to perform the operation across the array. Let's show some common ones:

In [None]:
arr

In [None]:
np.log10(10000)

In [None]:
np.sqrt(arr)

In [None]:
np.exp(arr)

In [None]:
np.sin(arr)

In [None]:
np.sin(np.pi/2)

In [None]:
np.tan(np.pi/4)

## Statistical Calculations

***

* ``np.mean(arr,axis=0)`` | Returns mean along specific axis

* ``arr.sum()`` | Returns sum of arr

* ``arr.min()`` | Returns minimum value of arr

* ``arr.max(axis=0)`` | Returns maximum value of specific axis

* ``np.var(arr)`` | Returns the variance of array

* ``np.std(arr,axis=1)`` | Returns the standard deviation of specific axis

* ``arr.corrcoef()`` | Returns correlation coefficient of array

## A sample solve in Linear Algebra using NumPy

In [None]:
# x = 2, y = 3

## [official tutorial](https://numpy.org/devdocs/user/absolute_beginners.html) 