<h1>Basic while loop</h1>
<div class=""><p>Below you can find the example from the video where the <code>error</code> variable, initially equal to <code>50.0</code>, is divided by 4 and printed out on every run:</p>
<pre><code>error = 50.0
while error &gt; 1 :
    error = error / 4
    print(error)
</code></pre>
<p>This example will come in handy, because it's time to build a <code>while</code> loop yourself! We're going to code a <code>while</code> loop that implements a very basic control system for an <a href="https://en.wikipedia.org/wiki/Inverted_pendulum" target="_blank" rel="noopener noreferrer">inverted pendulum</a>. If there's an offset from standing perfectly straight, the <code>while</code> loop will incrementally fix this offset.</p>
<p>Note that if your <code>while</code> loop takes too long to run, you might have made a mistake. In particular, remember to <strong>indent</strong> the contents of the loop!</p></div>

In [1]:
# Initialize offset
offset = 8

# Code the while loop
while offset!=0:
    print("correcting...")
    offset = offset-1
    print(offset)

correcting...
7
correcting...
6
correcting...
5
correcting...
4
correcting...
3
correcting...
2
correcting...
1
correcting...
0


<h1>Add conditionals</h1>
<div class=""><p>The <code>while</code> loop that corrects the <code>offset</code> is a good start, but what if <code>offset</code> is negative? You can try to run the following code where <code>offset</code> is initialized to <code>-6</code>: </p>
<pre><code># Initialize offset
offset = -6

#Code the while loop
while offset != 0 :
    print("correcting...")
    offset = offset - 1
    print(offset)
</code></pre>
<p>but your session will be disconnected. The <code>while</code> loop will never stop running, because <code>offset</code> will be further decreased on every run. <code>offset != 0</code> will never become <code>False</code> and the <code>while</code> loop continues forever.</p>
<p>Fix things by putting an <code>if</code>-<code>else</code> statement inside the <code>while</code> loop. If your code is still taking too long to run, you probably made a mistake!</p></div>

In [2]:
# Initialize offset
offset = -6

# Code the while loop
while offset != 0 :
    print("correcting...")
    if offset>0 :
      offset=offset-1
    else : 
      offset = offset+1    
    print(offset)

correcting...
-5
correcting...
-4
correcting...
-3
correcting...
-2
correcting...
-1
correcting...
0


<h1>Loop over a list</h1>
<div class=""><p>Have another look at the <code>for</code> loop that Filip showed in the video:</p>
<pre><code>fam = [1.73, 1.68, 1.71, 1.89]
for height in fam : 
    print(height)
</code></pre>
<p>As usual, you simply have to indent the code with 4 spaces to tell Python which code should be executed in the <code>for</code> loop.</p>
<p>The <code>areas</code> variable, containing the area of different rooms in your house, is already defined.</p></div>

In [3]:
# areas list
areas = [11.25, 18.0, 20.0, 10.75, 9.50]

# Code the for loop
for each in areas:
    print(each)

11.25
18.0
20.0
10.75
9.5


<h1>Indexes and values (1)</h1>
<div class=""><p>Using a <code>for</code> loop to iterate over a list only gives you access to every list element in each run, one after the other. If you also want to access the index information, so where the list element you're iterating over is located, you can use <a href="https://docs.python.org/3/library/functions.html#enumerate" target="_blank" rel="noopener noreferrer"><code>enumerate()</code></a>.</p>
<p>As an example, have a look at how the <code>for</code> loop from the video was converted:</p>
<pre><code>fam = [1.73, 1.68, 1.71, 1.89]
for index, height in enumerate(fam) :
    print("person " + str(index) + ": " + str(height))
</code></pre></div>

In [4]:
# areas list
areas = [11.25, 18.0, 20.0, 10.75, 9.50]

# Change for loop to use enumerate() and update print()
for index,a in enumerate(areas) :
    print("room "+str(index)+" : "+ str(a))

room 0 : 11.25
room 1 : 18.0
room 2 : 20.0
room 3 : 10.75
room 4 : 9.5


<h1>Indexes and values (2)</h1>
<div class=""><p>For non-programmer folks, <code>room 0: 11.25</code> is strange. Wouldn't it be better if the count started at 1?</p></div>

In [5]:
# areas list
areas = [11.25, 18.0, 20.0, 10.75, 9.50]

# Code the for loop
for index, area in enumerate(areas) :
    print("room " + str(index+1) + ": " + str(area))

room 1: 11.25
room 2: 18.0
room 3: 20.0
room 4: 10.75
room 5: 9.5


<h1>Loop over list of lists</h1>
<div class=""><p>Remember the <code>house</code> variable from the Intro to Python course? Have a look at its definition on the right. It's basically a list of lists, where each sublist contains the name and area of a room in your house.</p>
<p>It's up to you to build a <code>for</code> loop from scratch this time!</p></div>

In [6]:
# house list of lists
house = [["hallway", 11.25], 
         ["kitchen", 18.0], 
         ["living room", 20.0], 
         ["bedroom", 10.75], 
         ["bathroom", 9.50]]
         
# Build a for loop from scratch
for each in house:
    print("the "+each[0]+" is "+str(each[1])+" sqm")

the hallway is 11.25 sqm
the kitchen is 18.0 sqm
the living room is 20.0 sqm
the bedroom is 10.75 sqm
the bathroom is 9.5 sqm


<h1>Loop over dictionary</h1>
<div class=""><p>In Python 3, you need the <a href="https://docs.python.org/3/library/stdtypes.html#dict.items" target="_blank" rel="noopener noreferrer"><code>items()</code></a> method to loop over a dictionary:</p>
<pre><code>world = { "afghanistan":30.55, 
          "albania":2.77,
          "algeria":39.21 }

for key, value in world.items() :
    print(key + " -- " + str(value))
</code></pre>
<p>Remember the <code>europe</code> dictionary that contained the names of some European countries as key and their capitals as corresponding value? Go ahead and write a loop to iterate over it!</p></div>

In [7]:
# Definition of dictionary
europe = {'spain':'madrid', 'france':'paris', 'germany':'berlin',
          'norway':'oslo', 'italy':'rome', 'poland':'warsaw', 'austria':'vienna' }
          
# Iterate over europe
for k,v in europe.items():
    print("the capital of "+k+" is "+v)

the capital of spain is madrid
the capital of france is paris
the capital of germany is berlin
the capital of norway is oslo
the capital of italy is rome
the capital of poland is warsaw
the capital of austria is vienna


<h1>Loop over Numpy array</h1>
<div class=""><p>If you're dealing with a 1D Numpy array, looping over all elements can be as simple as:</p>
<pre><code>for x in my_array :
    ...
</code></pre>
<p>If you're dealing with a 2D Numpy array, it's more complicated. A 2D array is built up of multiple 1D arrays. To explicitly iterate over all separate elements of a multi-dimensional array, you'll need this syntax:</p>
<pre><code>for x in np.nditer(my_array) :
    ...
</code></pre>
<p>Two Numpy arrays that you might recognize from the intro course are available in your Python session: <code>np_height</code>, a Numpy array containing the heights of Major League Baseball players, and <code>np_baseball</code>, a 2D Numpy array that contains both the heights (first column) and weights (second column) of those players.</p></div>

<h1>Loop over DataFrame (1)</h1>
<div class=""><p>Iterating over a Pandas DataFrame is typically done with the <a href="http://pandas.pydata.org/pandas-docs/stable/generated/pandas.DataFrame.iterrows.html" target="_blank" rel="noopener noreferrer"><code>iterrows()</code></a> method. Used in a <code>for</code> loop, every observation is iterated over and on every iteration the row label and actual row contents are available:</p>
<pre><code>for lab, row in brics.iterrows() :
    ...
</code></pre>
<p>In this and the following exercises you will be working on the <code>cars</code> DataFrame. It contains information on the cars per capita and whether people drive right or left for seven countries in the world.</p></div>

In [10]:
# Import cars data
import pandas as pd
cars = pd.read_csv('cars.csv', index_col = 0)

# Iterate over rows of cars
for lab, row in cars.iterrows():
    print(lab)
    print(row)

US
cars_per_cap              809
country         United States
drives_right             True
Name: US, dtype: object
AUS
cars_per_cap          731
country         Australia
drives_right        False
Name: AUS, dtype: object
JAP
cars_per_cap      588
country         Japan
drives_right    False
Name: JAP, dtype: object
IN
cars_per_cap       18
country         India
drives_right    False
Name: IN, dtype: object
RU
cars_per_cap       200
country         Russia
drives_right      True
Name: RU, dtype: object
MOR
cars_per_cap         70
country         Morocco
drives_right       True
Name: MOR, dtype: object
EG
cars_per_cap       45
country         Egypt
drives_right     True
Name: EG, dtype: object


<h1>Loop over DataFrame (2)</h1>
<div class=""><p>The row data that's generated by <a href="http://pandas.pydata.org/pandas-docs/stable/generated/pandas.DataFrame.iterrows.html" target="_blank" rel="noopener noreferrer"><code>iterrows()</code></a> on every run is a Pandas Series. This format is not very convenient to print out. Luckily, you can easily select variables from the Pandas Series using square brackets:</p>
<pre><code>for lab, row in brics.iterrows() :
    print(row['country'])
</code></pre></div>

In [11]:
# Import cars data
import pandas as pd
cars = pd.read_csv('cars.csv', index_col = 0)

# Adapt for loop
for lab, row in cars.iterrows() :
    print(lab+": "+str(row['cars_per_cap']))

US: 809
AUS: 731
JAP: 588
IN: 18
RU: 200
MOR: 70
EG: 45


<h1>Add column (1)</h1>
<div class=""><p>In the video, Filip showed you how to add the length of the country names of the <code>brics</code> DataFrame in a new column:</p>
<pre><code>for lab, row in brics.iterrows() :
    brics.loc[lab, "name_length"] = len(row["country"])
</code></pre>
<p>You can do similar things on the <code>cars</code> DataFrame.</p></div>

In [12]:
# Import cars data
import pandas as pd
cars = pd.read_csv('cars.csv', index_col = 0)

# Code for loop that adds COUNTRY column
for lab, row in cars.iterrows():
    cars.loc[lab, "COUNTRY"] = row['country'].upper()


# Print cars
print(cars)

     cars_per_cap        country  drives_right        COUNTRY
US            809  United States          True  UNITED STATES
AUS           731      Australia         False      AUSTRALIA
JAP           588          Japan         False          JAPAN
IN             18          India         False          INDIA
RU            200         Russia          True         RUSSIA
MOR            70        Morocco          True        MOROCCO
EG             45          Egypt          True          EGYPT


<h1>Add column (2)</h1>
<div class=""><p>Using <a href="http://pandas.pydata.org/pandas-docs/stable/generated/pandas.DataFrame.iterrows.html" target="_blank" rel="noopener noreferrer"><code>iterrows()</code></a> to iterate over every observation of a Pandas DataFrame is easy to understand, but not very efficient. On every iteration, you're creating a new Pandas Series.</p>
<p>If you want to add a column to a DataFrame by calling a function on another column, the <a href="http://pandas.pydata.org/pandas-docs/stable/generated/pandas.DataFrame.iterrows.html" target="_blank" rel="noopener noreferrer"><code>iterrows()</code></a> method in combination with a <code>for</code> loop is not the preferred way to go. Instead, you'll want to use <a href="http://pandas.pydata.org/pandas-docs/stable/generated/pandas.Series.apply.html" target="_blank" rel="noopener noreferrer"><code>apply()</code></a>.</p>
<p>Compare the <a href="http://pandas.pydata.org/pandas-docs/stable/generated/pandas.DataFrame.iterrows.html" target="_blank" rel="noopener noreferrer"><code>iterrows()</code></a> version with the <a href="http://pandas.pydata.org/pandas-docs/stable/generated/pandas.Series.apply.html" target="_blank" rel="noopener noreferrer"><code>apply()</code></a> version to get the same result in the <code>brics</code> DataFrame:</p>
<pre><code>for lab, row in brics.iterrows() :
    brics.loc[lab, "name_length"] = len(row["country"])

brics["name_length"] = brics["country"].apply(len)
</code></pre>
<p>We can do a similar thing to call the <a href="https://docs.python.org/2/library/stdtypes.html#str.upper" target="_blank" rel="noopener noreferrer"><code>upper()</code></a> method on every name in the <code>country</code> column. However, <a href="https://docs.python.org/2/library/stdtypes.html#str.upper" target="_blank" rel="noopener noreferrer"><code>upper()</code></a> is a <strong>method</strong>, so we'll need a slightly different approach:</p></div>

In [14]:
# Import cars data
import pandas as pd
cars = pd.read_csv('cars.csv', index_col = 0)

# Use .apply(str.upper)
cars["COUNTRY"] = cars["country"].apply(str.upper)

cars["country_length"] = cars["country"].apply(len)

print(cars)

     cars_per_cap        country  drives_right        COUNTRY  country_length
US            809  United States          True  UNITED STATES              13
AUS           731      Australia         False      AUSTRALIA               9
JAP           588          Japan         False          JAPAN               5
IN             18          India         False          INDIA               5
RU            200         Russia          True         RUSSIA               6
MOR            70        Morocco          True        MOROCCO               7
EG             45          Egypt          True          EGYPT               5
