# Reminder about Python

The following is not an introduction to Python, but a reminder about syntax and a couple of key points.

This is a Jupyter notebook.
To evaluate a cell, press Shift-Enter or Ctrl-Enter. Shift-Enter moves you onto the next cell. 
The cells in this tutorial depend on each other, so evaluate them in order.
Take a minute to poke around the menus, looking at the keyboard shortcuts if that's your kind of thing.

## Prelaunch

Python has many packages. 
To make those available, use the `import` statement in various ways.

In [37]:
from os import getcwd
import numpy as np
import re

print getcwd()
print np.log(10)
print re.sub('big', 'medium', 'big data')

/home/matsen/repos/i-heart-pandas
2.30258509299
medium data


## Syntax

### Functions, control flow, and loops

In [38]:
range(4)

[0, 1, 2, 3]

In [39]:
def square(x):
    if isinstance(x, int):
        return x*x
    elif isinstance(x, str):
        return x + " is so square."
    else:
        return "■"

for x in [3, 'Owning a car', 3.]:
    print square(x)

9
Owning a car is so square.
■


### Lists

You can put various things in a list.
Note that lists are zero-indexed, meaning that `l[0]` gives you the first element in a list.

In [40]:
l = [square, 5]
l[1]

5

In [41]:
l[0](l[1])

25

### Dictionaries

Dictionaries map "keys" to "values.

In [42]:
d = {'tuba': 'Party'}
d['plastic'] = ' skunk'
d[-24] = -5
d

{-24: -5, 'plastic': ' skunk', 'tuba': 'Party'}

### Comprehensions

List and dictionary comprehensions are handy ways to make things.

In [43]:
[type(x) for x in l]

[function, int]

In [8]:
[k + d[k] for k in d]

[-29, 'tubaParty', 'plastic skunk']

In [9]:
{'fun '+str(k): d[k] for k in d}

{'fun -24': -5, 'fun plastic': ' skunk', 'fun tuba': 'Party'}

### Objects

Everything in Python is an object.
Given an object `x`, you can call methods using the period syntax. 

In [10]:
'skunk'.upper()

'SKUNK'

In [11]:
d.items()

[(-24, -5), ('tuba', 'Party'), ('plastic', ' skunk')]

In Jupyter, you can see what methods an object has by typing the object name, period, and then pressing tab.
Try it!

In [12]:
d.

SyntaxError: invalid syntax (<ipython-input-12-6b23fc89e26c>, line 1)

You can also see documentation about the method by typing the method name, an open parenthesis, and then pressing Shift-Tab.

In [None]:
d.pop(

### Python "variables" are better thought of as nametags

Python is a little different than other languages concerning how variables work. 
These differences are wonderfully explained by [this article by David Goodger](http://python.net/~goodger/projects/pycon/2007/idiomatic/handout.html).
I've extracted the following snippet because of its importance.


In Python, a "name" or "identifier" is like a parcel tag (or nametag) attached to an object.
```
a = 1
```
![](http://python.net/~goodger/projects/pycon/2007/idiomatic/a1tag.png)

Here, an integer 1 object has a tag labelled `a`.

If we reassign to "a", we just move the tag to another object:

```
a = 2
```
![](http://python.net/~goodger/projects/pycon/2007/idiomatic/a2tag.png) 

Now the name `a` is attached to an integer 2 object.
The original integer 1 object no longer has a tag `a`. 

![](http://python.net/~goodger/projects/pycon/2007/idiomatic/1.png)


It may live on, but we can't get to it through the name `a`. (When an object has no more references or tags, it is removed from memory.)

If we assign one name to another, we're just attaching another nametag to an existing object:

```
b = a
```
![](http://python.net/~goodger/projects/pycon/2007/idiomatic/ab2tag.png)
The name `b` is just a second tag bound to the same object as `a`.
Although we commonly refer to "variables" even in Python (because it's common terminology), we really mean "names" or "identifiers". In Python, "variables" are nametags for values, not labelled boxes.

Let's see how this works.

In [13]:
a = 2
b = a
b

2

Quiz: what happens now?

In [14]:
a=3
b

2

We reassigned the `a` tag, which didn't impact the `b` tag. 
However, the situation is quite different when the tag is to a structure stored in memory such as a list.

In [15]:
a = [7,8,9]
b = a
b

[7, 8, 9]

Now when we modify the entries of `b`, we also modify `a`:

In [16]:
b[1] = 'chicken'
a

[7, 'chicken', 9]

### Python is pass by reference

In [17]:
def noodle(l):
    l[2] = 'different!'

print 'before: ', b
noodle(b)
b

before:  [7, 'chicken', 9]


[7, 'chicken', 'different!']

### Python indexing
Let's play with strings to get used to Python's indexing rules.

In [18]:
ab = list('abcdefgh')
ab

['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h']

Below: look carefully! The third entry is not included.

In [19]:
ab[1:3]

['b', 'c']

In [20]:
ab[:3]

['a', 'b', 'c']

In [21]:
ab[-1]

'h'

In [22]:
ab[-2:]

['g', 'h']

In [23]:
ab[:-2]

['a', 'b', 'c', 'd', 'e', 'f']

In [24]:
ab[-4:-2]

['e', 'f']

Indexing works for assignment too.

In [25]:
ab[-4:-2] = ['BIG', 'ONE']
ab

['a', 'b', 'c', 'd', 'BIG', 'ONE', 'g', 'h']

### Generators, iterators, enumerators

Generators are ways of expressing "implicit" lists but without having to store them in memory.
We can make them with syntax similar to list comprensions.

In [26]:
it = (square(x) for x in range(500))
it

<generator object <genexpr> at 0x7fa0af74e230>

In [27]:
sum(it)

41541750

*Note:* generators get "used up". 

In [28]:
sum(it)

0

For this reason it's sometimes useful to turn them into lists.

In [29]:
it = (square(x) for x in range(4))
list(it)

[0, 1, 4, 9]

Enumerators are similar.

In [30]:
e = enumerate(ab)
e

<enumerate at 0x7fa0af74ebe0>

In [31]:
e.next()

(0, 'a')

In [32]:
list(e)

[(1, 'b'), (2, 'c'), (3, 'd'), (4, 'BIG'), (5, 'ONE'), (6, 'g'), (7, 'h')]

### Strings

You may have noticed that there are two kinds of quotes in Python, single and double quotes.
They are equivalent, although one protects the other.

In [33]:
print 'He said "wow".', "That's kind of strange."

He said "wow". That's kind of strange.


Python has nice [string formatting](https://docs.python.org/2/library/string.html#format-string-syntax).

In [34]:
for i, c in enumerate(a):
    print 'Entry {} is "{}".'.format(i+1, c)

Entry 1 is "7".
Entry 2 is "chicken".
Entry 3 is "different!".


In [35]:
sl = 'This is handy.'.split()
sl

['This', 'is', 'handy.']

In [36]:
' '.join(sl)

'This is handy.'