# Making Choices #

In our last lesson, we discovered something suspicious was going on in our inflammation data by drawing some plots. How can we use Python to automatically recognize the different features we saw, and take a different action for each? In this lesson, we’ll learn how to write code that runs only when certain conditions are true.

## Conditionals ##

Just as in real life, if we give a command to a person, but only want them to do so under certain situations.  _If_ it's raining, bring an umbrella.  Well the same is going to be true for out commands in Python.  If we only want certain pieces of code to be executed during certain conditions, we use what are appropriately called _conditionals_.  For example, consider the following code and its output.

In [1]:
num = 37
if num > 100:
    print('greater')
else:
    print('not greater')
print('done')

not greater
done


In the first line of the code, we set the variable `num` equal to the number `37`.  In the next line, using the `if` statement, we say that _if_ the variable `num` is greater than the number `100`, then execute code in the indented lines beneath it.  The `else` statement handles all case where the condition the above condition is not true.  In our example, anytime `num` is less than or equal to `100`, the indented lines after the `else` statement will be executed.  Because the `print('done')` statement occurs in neither of the indented blocks of our `if/else` statement, it will get executed 100% of the time.  The above algorithm can be visualized in the following graphic:

![Executing a Conditional](fig/python-flowchart-conditional.png)


In the previous example we used an `else` statement to take care of everything that did not meet the condition in our `if` statement.  However, sometimes `else` statements are trivial and are thus not required to be included after an `if` statement.  If there is no `else` clause, Python simply does nothing if the condition is evaluated to be false.

In [2]:
num = 53
print('before conditional...')
if num > 100:
    print('53 is greater than 100')
print('...after conditional')

before conditional...
...after conditional


What if we have more than a few conditional tests we need to evaluate?  Well we can link several tests together by using `elif`, which is a portmanteau of "else if". Consider the following code that uses several conditionals to print the sign of a number. 

In [3]:
num = -3

if num > 0:
    print(num, "is positive")
elif num == 0:
    print(num, "is zero")
else:
    print(num, "is negative")

-3 is negative


An important syntax note is when evaluating the equality of objects in Python, we use `==` as opposed to `=`, which as we know, is reserved for variable assignment.

Sometimes it is beneficial to evaluate two tests in a single statement.  Using `and` and `or` we can do just that.  In a conditional combining two tests with `and` will only evaluate to true if both parts and true.  On the other hand, combining two tests with `or` will yield true if only one test turns out to be true.  

In [4]:
if (1 > 0) and (-1 > 0):
    print('both parts are true')
else:
    print('at least one part is false')

at least one part is false


In [5]:
if (1 < 0) or (-1 < 0):
    print('at least one test is true')

at least one test is true


## Checking Our Data ##

Now that we have seen how conditionals are used, we can apply them to check for the suspecious features in our inflammation data.  In the first few plots, the inflammation per data seemed to rise like a straight line.  We can check for this inside a `for` loop we wrote with the following conditional:
```python
if numpy.max(data, axis=0)[0] == 0 and numpy.max(data, axis=0)[20] == 20:
    print('Suspicious looking maxima!')
```
Similarly, we also saw a problem in the third data set.  The Minima per day were all zero (perhaps a healthy person snuck into our study?).  We can also check for this with an `elif` confidtion.  
```python
elif numpy.sum(numpy.min(data, axis=0)) == 0:
    print('Minima add up to zero!')
```
Of course we can forget about the case where the data looks okay.  So, we can use a terminal `else` statement to give the all-clear! if neither of the above conditions were caught. 
```python
else:
    print('Seems ok!')
```

Let's bring it all together by using it on our real data.

In [6]:
import numpy

data = numpy.loadtxt(fname='../data/inflammation-01.csv', delimiter=',')
if numpy.max(data, axis=0)[0] == 0 and numpy.max(data, axis=0)[20] == 20:
    print('Suspicious looking maxima!')
elif numpy.sum(numpy.min(data, axis=0)) == 0:
    print('Minima add up to zero!')
else:
    print('Seems OK!')

Suspicious looking maxima!


In [7]:
data = numpy.loadtxt(fname='../data/inflammation-03.csv', delimiter=',')
if numpy.max(data, axis=0)[0] == 0 and numpy.max(data, axis=0)[20] == 20:
    print('Suspicious looking maxima!')
elif numpy.sum(numpy.min(data, axis=0)) == 0:
    print('Minima add up to zero!')
else:
    print('Seems OK!')

Minima add up to zero!


In this way, we have asked Python to do something different depending on the condition of our data. Here we printed messages in all cases, but we could also imagine not using the else catch-all so that messages are only printed when something is wrong, freeing us from having to manually examine every plot for features we’ve seen before.

## Ex. 1: How Many Paths? ##

Which of the following would be printed if you were to run this code? Why did you pick this answer?

```python
if 4 > 5:
    print('A')
elif 4 == 5:
    print('B')
elif 4 < 5:
    print('C'
```

1. A
2. B
3. C
4. B and C

In [None]:
### answer here ###

## Ex. 2: What Is Truth?

`True` and `False` are special words in Python called booleans which represent true and false statements. However, they aren’t the only values in Python that are true and false. In fact, any value can be used in an if or elif. After reading and running the code below, explain what the rule is for which values are considered true and which are considered false.

In [9]:
if '':
    print('empty string is true')
if 'word':
    print('word is true')
if []:
    print('empty list is true')
if [1, 2, 3]:
    print('non-empty list is true')
if 0:
    print('zero is true')
if 1:
    print('one is true')

word is true
non-empty list is true
one is true


In [11]:
### answer here ###

## Ex. 3: That’s Not Not What I Meant.

Sometimes it is useful to check whether some condition is not true. The Boolean operator not can do this explicitly. After reading and running the code below, write some if statements that use not to test the rule that you formulated in the previous challenge.

In [12]:
if not '':
    print('empty string is not true')
if not 'word':
    print('word is not true')
if not not True:
    print('not not True is true')

empty string is not true
not not True is true


In [None]:
### answer here ###

## Ex. 4: Close Enough.

Write some conditions that print True if the variable a is within 10% of the variable b and False otherwise. Compare your implementation with your partner’s: do you get the same answer for all possible pairs of numbers?

In [None]:
### answer here ###

## Ex. 5: In-Place Operators ##

Python (and most other languages in the C family) provides in-place operators that work like this:

In [13]:
x = 1  # original value
x += 1 # add one to x, assigning result back to x
x *= 3 # multiply x by 3
print(x)

6


Write some code that sums the positive and negative numbers in a list separately, using in-place operators. Do you think the result is more or less readable than writing the same without in-place operators?

In [None]:
### answer here ###

## Ex. 6: Sorting a List Into Buckets ##

The folder containing our data files has large data sets whose names start with “inflammation-“, small ones whose names with “small-“, and possibly other files whose sizes we don’t know. Our goal is to sort those files into three lists called large_files, small_files, and other_files respectively. Add code to the template below to do this. Note that the string method startswith returns `True` if and only if the string it is called on starts with the string passed as an argument.

Your solution should:

1. loop over the names of the files
2. figure out which group each filename belongs
3. append the filename to that list
In the end the three lists should be:
```python
large_files = ['inflammation-01.csv', 'inflammation-02.csv']
small_files = ['small-01.csv', 'small-02.csv']
other_files = ['myscript.py']
```

In [14]:
### answer here ###

files = ['inflammation-01.csv', 'myscript.py', 'inflammation-02.csv', 'small-01.csv', 'small-02.csv']
large_files = []
small_files = []
other_files = []

### Key Points ###

* Use if condition to start a conditional statement, elif condition to provide additional tests, and else to provide a default.
* The bodies of the branches of conditional statements must be indented.
* Use == to test for equality.
* X and Y is only true if both X and Y are true.
* X or Y is true if either X or Y, or both, are true.
* Zero, the empty string, and the empty list are considered false; all other numbers, strings, and lists are considered true.
* Nest loops to operate on multi-dimensional data.
* Put code whose parameters change frequently in a function, then call it with different parameter values to customize its behavior.