# Introduction to Python
This notebook will orient us to some of the key ideas from Python. Follow along with the lecture. There will be some places to write code on your own and those will be marked by comments. Here we go!


## IPython notebooks
This file - an IPython notebook - does not follow the standard pattern with Python code in a text file. Instead, an IPython notebook is stored as a file in the JSON format. The advantage is that we can mix formatted text, Python code and code output. It requires the IPython notebook server to run it though, and therefore isn't a stand-alone Python program as described above. Other than that, there is no difference between the Python code that goes into a program file or an IPython notebook.

## Modules
Most of the functionality in Python is provided by modules. The Python Standard Library is a large collection of modules that provides cross-platform implementations of common facilities such as access to the operating system, file I/O, string management, network communication, and much more.

## References
* The Python Language Reference: https://docs.python.org/3/reference/index.html
* The Python Standard Library: https://docs.python.org/3/library/index.html
To use a module in a Python program it first has to be imported. A module can be imported using the import statement. For example, to import the module `random`, which contains many of the functions we'd use for sampling do:

### Best practices
1. Regularly restart your notebook kernels.
1. Run systematically from the top down. 
1. It's okay to clean up cells that didn't work or that you built upon.
1. Use comments so you know what you've done.
1. Be _very_ careful reusing names. Work hard to come up with names that make sense.
1. Do module imports at the top.

In [1]:
# Let's get started with the basics
2 + 2

4

In [2]:
3 * 7

21

In [3]:
9**2

81

In [4]:
14 % 3

2

Sometimes we need functions thar live in other modules. This piece of code brings in the square root operator.

In [5]:
from numpy import sqrt
sqrt(25)

5.0

The quadratic formula finds the zeros of a polynomial in the form $a \cdot x^2 + b\cdot x + c$. (Find zeros means setting the equation equal to zero and figuring out which values of `x` make it true.) The quadratic formula is:
$$
x = \frac{-b \pm \sqrt(b^2 - 4ac)}{2a}.
$$
Find the roots of $2 \cdot x^2 + 3\cdot x + 1$:



In [7]:
a = 2
b = 3
c = 1

In [None]:
(-1*b - sqrt(b**2 - 4 * a * c))/(2*a)

There are a bunch of basic mathematical functions that exist: [Python 3 Built-in Functions]( https://docs.python.org/3/library/functions.html).

In [11]:
abs(-393838)

393838

In [12]:
min([1,3,5,7,9])

1

In [13]:
max([1,3,5,7,9])

9

In [14]:
sum([1,3,5,7,9])

25

Logical operators are simpler than in many other languages. To remind yourself how they work, evaluate the following logical expressions and pay attention to what you get:
* True and True
* True and False
* True or True
* True or False
* True or not False
* not True and not False

In [15]:
True and True

True

In [16]:
True and False

False

In [17]:
True or True

True

In [18]:
True or False

True

In [19]:
True or not False

True

In [20]:
not True and not False

False

In [23]:
# Comparisons
4 < 5

True

In [22]:
4 <= 4

True

In [21]:
(3*1/3)==1

True

In [24]:
1 != 2

True

### Strings
Strings can get complicated and we'll spend a decent amount of time working with them. They are enclosed in quotes (single or double) and are case sensitive.

In [25]:
s = "Hello!"
s=='Hello!'

True

In [26]:
s=="hello!"

False

In [29]:
s + ", and how are you?"

'Hello!, and how are you?'

Triple quotes can make strings that break across multiple lines:

In [30]:
s = """Here's a string
that goes across multiple
lines like it's no big deal."""

In [31]:
print(s)

Here's a string
that goes across multiple
lines like it's no big deal.


In [44]:
# what does this code do? 
x = s.split()
x

["Here's",
 'a',
 'string',
 'that',
 'goes',
 'across',
 'multiple',
 'lines',
 'like',
 "it's",
 'no',
 'big',
 'deal.']

There are a bunch of convenience functions that allow us to work with strings:

In [32]:
x = 'bonjour'

In [33]:
len(x)

7

In [36]:
'hello' in x

False

In [37]:
x[0]

'b'

In [38]:
x*2

'bonjourbonjour'

In [None]:
x[-1]

In [None]:
x[-2]

### Data Types
The `type` function will give us the type of an object. This can be useful for debugging.

In [43]:
type(1)

int

In [40]:
type(1.0)

float

In [41]:
type("l'ours")

str

What data type are the `True` and `False` values?

In [None]:
# figure out the data type for True here
type(True) # or False

### More on Strings
Python has a bunch of useful functions for working with strings:

In [45]:
x = "here's an example of a string."

In [46]:
x.count('a')

3

In [47]:
x.upper()

"HERE'S AN EXAMPLE OF A STRING."

In [48]:
"A cRaZy EXAMple".lower()

'a crazy example'

In [49]:
c = 'cat'
h = 'hat'
c + h

'cathat'

In [50]:
ch = c + h
print(ch)

cathat


In [51]:
print(c + ' in the ' + h)

cat in the hat


In [55]:
A = 5
print(str(A) + c)

5cat


In [56]:
print('We have ' + str(A) + ' ' + c + 's')

We have 5 cats


In [58]:
x = 'GO GRIZ'
print(x.lower())
x = x.lower()
print(x)

go griz
go griz


In [59]:
len(x)

7

In [68]:
x = "The best of times, the worst of times."
x.find("griz")

-1

In [73]:
x = x.replace("griz","burritos") 

x

'The best of times, the worst of times.'

## Data Structures
Python has a bunch of data structures that are designed for different uses. The main ones we'll use are lists, tuples, sets, and dictionaries.

### Lists
A Python list stores comma separated values. In our cases these values will be strings, and numbers.

In [74]:
mylist = ['a','b','c']

In [75]:
mylist

['a', 'b', 'c']

In [76]:
mylist2 = [1,2,3,4,5]
mylist2

[1, 2, 3, 4, 5]

Each item in the list has a position or index. By using a list index you can get back individual list item.
Remember that in (most) programming (but not R!), counting starts at 0, so to get the first item, we would call index 0.

In [77]:
mylist[0]

'a'

In [78]:
mylist2[0]

1

We can also use a range of indexes to call back a range from out list.

In [79]:
mylist[0:2] # Notice what gets returned

['a', 'b']

In [80]:
mylist[:3]

['a', 'b', 'c']

In [81]:
mylist2[2:]

[3, 4, 5]

You can also get the ends of lists:

In [None]:
mylist2[-2:]

The classic way to add something to a list is `append`:

In [None]:
mylist.append('g')
mylist

### Tuples
We won't talk about tuples a bunch right now, but think of them as lists immutable cousins. That means you can't change the elements.

In [None]:
mytuple = ('a','b','c')
mytuple

In [None]:
mytuple[0] = 'g'

### Sets
Sets are like lists where you can only have unique elements. They are _incredibly_ useful.

In [None]:
myset = set(['a','b','c','a','d','b'])
myset # notice the change from square brackets to braces

### Dictionaries
These need an entire lecture of their own, but here's an opportunity for self-study.

What does zip do? What does `dict(zip(mylist,[1,2,3,4]))` do?

## Modules

In [82]:
import random

This includes the whole module and makes it available for use later in the program. For example, we can do:

In [83]:
random.seed(42)

x = [1,2,3,4,5,6,7,8]

print(random.choice(x))

print(random.sample(x,3))

2
[1, 6, 3]


Some notes on what happened in that previous cell:
* setting a seed allows our results to be reproducible;
* `random.choice` picks an element out of `x`;
* `random.sample` picks a specified number of elements out of `x`;
* we use `print` or, often, `pprint` to print out intermediate results. 

One thing that takes some getting used to: everything that has run before is available in memory. So typing `x` in the below cell shows us the list defined above.

In [None]:
x

Now, without being in "edit" mode, type "0" twice and restart the kernel. Now this cell throws an error:

In [None]:
x

Similarly, the libraries that have been loaded aren't available:

In [None]:
random.random()

### Best practices
1. Regularly restart your notebook kernels.
1. Run systematically from the top down. 
1. It's okay to clean up cells that didn't work or that you built upon.
1. Use comments so you know what you've done.
1. Be _very_ careful reusing names. Work hard to come up with names that make sense.

In [84]:
x

[1, 2, 3, 4, 5, 6, 7, 8]