# Getting Started with NumPy - Lab

## Introduction

Now that we have introduced NumPy, let's put it to practice. In this lab, we are going to be creating arrays, performing operations on them, and returning new array all using the NumPy library. Let's get started!

## Objectives

You will be able to: 

* Understand how to initialize NumPy arrays from nested Python lists, and access elements using square brackets
* Understand the shape attribute on NumPy arrays
* Understand how to create arrays from scratch including np.zeros, np.ones, np.full
* Learn to perform scalar and vector math  

## Import NumPy under the standard alias

In [1]:
#Your code here
import numpy as np

## Generating Some Mock Data

Create a NumPy Array for each of the following:
    1. Using a range
    2. Using a Python List
    
Below, create a list in Python that has 5 elements (i.e. [0,1,2,3,4]) and assign it to the variable `py_list`. 

Next, do the same, but instead of a list, create a range with 5 elements and assign it to the variable, `py_range`.

Finally, use the list and range to create NumPy arrays and assign the array from list to the variable `array_from_list`, and the array from the range to the variable `array_from_range`.

In [11]:
#Your code here
py_list = list(range(5))
py_range = range(5)
array_from_list = np.array(py_list)
array_from_range = np.array(py_range)
print(py_list)
print(py_range)
print(array_from_list)
print(array_from_range)

[0, 1, 2, 3, 4]
range(0, 5)
[0 1 2 3 4]
[0 1 2 3 4]


Next, we have a list of heights and weights and we'd like to use them to create a collection of BMIs. However, they are both in inches and pounds (imperial system), respectively. 

Let's use what we know to create NumPy arrays with the metric equivalent values, (height in meters & weight in kg).

> **Remember:** *NumPy can make these calculations a lot easier and with less code than a list!*

> 1.0 inch = 0.0254 meters

> 2.2046 lbs = 1 kilogram

In [14]:
# use the conversion rate for turning height in inches to meters
list_height_inches = [65, 68, 73, 75, 78]
meter_height = np.array(list_height_inches)*.0254

#Your code here

In [15]:
# use the conversion rate for turning weight in pounds to kilograms
list_weight_pounds = [150, 140, 220, 205, 265]
kilogram_weight = np.array(list_weight_pounds)/2.2046

#your code here

The metric formula for calculating BMI is as follows:

> BMI = weight (kg) ÷ height^2 (m^2)

So, to get BMI we divide weight by the squared value of height. For example, if i weighed 130kg and was 1.9 meters tall, the calculation would look like:

> BMI = 130 / (1.9*1.9)

Use the BMI calculation to create a NumPy array of BMIs

In [17]:
#Your code here

BMI = kilogram_weight / meter_height**2
BMI
np.array([2]*10)/np.array([3]*10)

array([0.66666667, 0.66666667, 0.66666667, 0.66666667, 0.66666667,
       0.66666667, 0.66666667, 0.66666667, 0.66666667, 0.66666667])

## Create an identity vector using `np.ones()`

In [20]:
#Your code here
ten_by_ten_identity = np.ones([10,10])

## Multiply the BMI_array by your identity vector

In [22]:
#Your code here
BMI*np.ones([len(BMI)])

array([24.9613063 , 21.28692715, 29.02550097, 25.62324316, 30.62382485])

## Level Up: Using NumPy to Parse a File
The pandas library that we've been using is built on top of NumPy; all columns/series in a Pandas DataFrame are built using NumPy arrays. To get a better idea of a how a built in method like pd.read_csv() works, we'll try and recreate that here!

In [26]:
#Open a text file (csv files are just plaintext separated by commas)
f = open('bp.txt')
n_rows = len(f.readlines())
print('The file has {} lines.'.format(n_rows)) #Print number of lines in the file
f = open('bp.txt') #After using readlines, we must reopen the file
n_cols = (len(f.readline().split('\t'))) #The file has values separated by tabs; we read the first line and check it's length.

f = open('bp.txt')

#Your code here
#Pseudocode outline below
#1) Create a matrix of zeros that is the same size of the file
f.readline()
ones_matrix = np.ones([n_cols,n_rows-1])
for row_index in range(n_rows-1):
    row = f.readline().split('\t')
    for col_index in range(n_cols):
        ones_matrix[col_index,row_index] =row[col_index] 
#2) Iterate through the file: "for line in f:" Hint: using enumerate will also be required
    #3) Update each row of the matrix with the new stream of data
    #Hint: skip the first row (it's just column names, not the data.)
#4) Preview your results; you should now have a NumPy matrix with the data from the file
print(ones_matrix)
print()

The file has 21 lines.
[[  1.     2.     3.     4.     5.     6.     7.     8.     9.    10.
   11.    12.    13.    14.    15.    16.    17.    18.    19.    20.  ]
 [105.   115.   116.   117.   112.   121.   121.   110.   110.   114.
  114.   115.   114.   106.   125.   114.   106.   113.   110.   122.  ]
 [ 47.    49.    49.    50.    51.    48.    49.    47.    49.    48.
   47.    49.    50.    45.    52.    46.    46.    46.    48.    56.  ]
 [ 85.4   94.2   95.3   94.7   89.4   99.5   99.8   90.9   89.2   92.7
   94.4   94.1   91.6   87.1  101.3   94.5   87.    94.5   90.5   95.7 ]
 [  1.75   2.1    1.98   2.01   1.89   2.25   2.25   1.9    1.83   2.07
    2.07   1.98   2.05   1.92   2.19   1.98   1.87   1.9    1.88   2.09]
 [  5.1    3.8    8.2    5.8    7.     9.3    2.5    6.2    7.1    5.6
    5.3    5.6   10.2    5.6   10.     7.4    3.6    4.3    9.     7.  ]
 [ 63.    70.    72.    73.    72.    71.    69.    66.    69.    64.
   74.    71.    68.    67.    76.    69.    

## Summary

In this lab, we practiced creating NumPy arrays from both lists and ranges. We then practiced performing math operations like converting imperial measurements to metric measurements on each element of a NumPy array to create new arrays with new values. Finally, we used both of our new NumPy arrays to operate on each other and create new arrays containing the BMIs from our arrays containing heights and weights.