### Introduction to Software Engineering, Data Science, and Deep Learning #
Shenzhen University  
Computer Vision and Machine Learning Research Group  
Instructor: Yan Yan  
Email: yyan@szu.edu.cn  
Lec 2: 
- item Introduction to Python 3  
- item Data Structures in Python

# Core Python Language #

Mostly copied from the [official python tutorial](https://docs.python.org/3/tutorial/)  
and from the [Python for Scientific Computing](https://github.com/rabernat/python_teaching)

There are three main ways to use python.

1. By running a python file, e.g. `python myscript.py`.
2. Through an interactive console (python interpreter or ipython shell).
3. In an interactive jupyter notebook.

We will be using the jupyter notebook.

## Basic Variables: Numbers and String ##

In [2]:
# comments are anything that comes after the "#" symbol
a = 1       # assign 1 to variable a
b = "hello" # assign "hello" to variable b

In [3]:
# how to we see our variables?
print(a)
print(b)
print(a,b)

1
hello
1 hello


All variables are objects. Every object has a type (class). To find out what type your variables are

In [4]:
print(type(a))
print(type(b))

<class 'int'>
<class 'str'>


In [5]:
# as a shortcut, iPython notebooks will automatically print whatever is on the last line
type(b)

str

In [6]:
# we can check for the type of an object
print(type(a) is int)
print(type(a) is str)

True
False


Different objects attributes and methods, which can be accessed via the syntax variable.method

In [7]:
# this returns the method itself
b.capitalize

<function str.capitalize>

In [8]:
# this calls the method
b.capitalize()
# there are lots of other methods

'Hello'

In [9]:
# binary operations act differently on different types of objects
c = 'World'
print(b + c)
print(a + 2)
print(a + b)

helloWorld
3


TypeError: unsupported operand type(s) for +: 'int' and 'str'

There are many different ways to interact with lists. Exploring them is part of the fun of python.

__list.append(x)__ Add an item to the end of the list. Equivalent to a[len(a):] = [x].

__list.extend(L)__ 
Extend the list by appending all the items in the given list. Equivalent to a[len(a):] = L.

__list.insert(i, x)__ Insert an item at a given position. The first argument is the index of the element before which to insert, so a.insert(0, x) inserts at the front of the list, and a.insert(len(a), x) is equivalent to a.append(x).

__list.remove(x)__ Remove the first item from the list whose value is x. It is an error if there is no such item.

__list.pop([i])__ Remove the item at the given position in the list, and return it. If no index is specified, a.pop() removes and returns the last item in the list. (The square brackets around the i in the method signature denote that the parameter is optional, not that you should type square brackets at that position. You will see this notation frequently in the Python Library Reference.)

__list.clear()__ Remove all items from the list. Equivalent to del a[:].

__list.index(x)__ Return the index in the list of the first item whose value is x. It is an error if there is no such item.

__list.count(x)__ Return the number of times x appears in the list.

__list.sort()__ Sort the items of the list in place.

__list.reverse()__ Reverse the elements of the list in place.

__list.copy()__ Return a shallow copy of the list. Equivalent to a[:].


Don't assume you know how list operations work!

In [12]:
# "add" two lists
x = list(range(5))
y = list(range(10,15))
z = x + y
z

[0, 1, 2, 3, 4, 10, 11, 12, 13, 14]

In [21]:
# access items from a list
print('first', z[0])
print('last', z[-1])
print('first 3', z[:3])
print('last 3', z[-3:])
print('middle, skipping every other item', z[5:10:2])

first 0
last 14
first 3 [0, 1, 2]
last 3 [12, 13, 14]
middle, skipping every other item [10, 12, 14]


__MEMORIZE THIS SYNTAX!__ It is central to so much of python and often proves confusing for users coming from other languages.

In terms of set notation, python indexing is _left inclusive_, _right exclusive_. If you remember this, you will never go wrong.

In [24]:
# that means we get an error from the following
N = len(z)
z[N-1]

14

In [25]:
# this index notation also applies to strings
name = 'Yan Yan'
print(name[:4])

Yan 


In [26]:
# you can also test for the presence of items in a list
5 in z

False

Lists are not meant for math! They don't have a datatype.

In [27]:
z[4] = 'fish'
z

[0, 1, 2, 3, 'fish', 10, 11, 12, 13, 14]

Python is full of tricks for iterating and working with lists

In [28]:
# a cool python trick: list comprehension
squares = [n**2 for n in range(5)]
squares

[0, 1, 4, 9, 16]

In [29]:
# iterate over two lists together uzing zip
for item1, item2 in zip(x,y):
    print('first:', item1, 'second:', item2)

first: 0 second: 10
first: 1 second: 11
first: 2 second: 12
first: 3 second: 13
first: 4 second: 14


## Other Data Structures ##

We are almost there. We have the building blocks we need to do basic programming. But python has some other data structures we need to learn about.

## Tuples ##

Tuples are similar to lists, but they are _immutable_—they can't be extended or modified. What is the point of this? Generally speaking: to pack together inhomogeneous data. Tuples can then be unpacked and distributed by other parts of your code.

Tuples may seem confusing at first, but with time you will come to appreciate them.

In [32]:
# tuples are created with parentheses, or just commas
a = ('orange', 'sweet' )
b = 'biscuit', 'crunch'
type(b)

tuple

In [33]:
# can be indexed like arrays
print(a[1]) # not the first element!

sweet


In [43]:
# and they can be unpacked
name, feature = a
print(name)
print(feature)

orange
sweet


## Dictionaries ##

This is an extremely useful data structure. It maps __keys__ to __values__.

Dictionaries are unordered!

In [44]:
# different ways to create dictionaries
d = {'name': 'orange', 'feature': 'sweet'}
e = dict(name='biscuit', feature='crunch')
e

{'feature': 'crunch', 'name': 'biscuit'}

In [46]:
# access a value
d['name']

'orange'

Square brackets ``[...]`` are python for "get item" in many different contexts.

In [47]:
# test for the presence of a key
print('name' in d)
print('height' in e)

True
False


In [48]:
# try to access a non-existant key
d['height']

KeyError: 'height'

In [49]:
# add a new key
d['size'] = (5,6) # a tuple
d

{'feature': 'sweet', 'name': 'orange', 'size': (5, 6)}

In [50]:
# keys don't have to be strings
d[99] = 'ninety nine'
d

{99: 'ninety nine', 'name': 'orange', 'feature': 'sweet', 'size': (5, 6)}

In [51]:
# iterate over keys
for k in d:
    print(k, d[k])

99 ninety nine
name orange
feature sweet
size (5, 6)


In [53]:
# better way
### python 2
### for key, val in d.iteritems()
for key, val in d.items():
    print(key, val)

99 ninety nine
name orange
feature sweet
size (5, 6)


## Functions ##

Functions are a central part of advanced python programming. You should try to write and use your own functions as often as possible.

In [54]:
# define a function
def say_hello():
    """Return the word hello."""
    return 'Hello'

In [55]:
# functions are also objects
type(say_hello)

function

In [56]:
# this doesnt call
say_hello?

In [57]:
# this does
say_hello()

'Hello'

In [58]:
# assign the result to something
res = say_hello()
res

'Hello'

In [59]:
# take some arguments
def say_hello_to(name):
    """Return a greeting to `name`"""
    return 'Hello ' + name

In [60]:
# intended usage
say_hello_to('World')

'Hello World'

In [61]:
say_hello_to(10)

TypeError: Can't convert 'int' object to str implicitly

In [62]:
# redefine the function
def say_hello_to(name):
    """Return a greeting to `name`"""
    return 'Hello ' + str(name)

In [63]:
say_hello_to(10)

'Hello 10'

In [64]:
# take an optional keyword argument
def say_hello_or_hola(name, spanish=False):
    """Say hello in multiple languages."""
    if spanish:
        greeting = 'Hola '
    else:
        greeting = 'Hello '
    return greeting + name

In [66]:
print(say_hello_or_hola('Yan'))
print(say_hello_or_hola('Yan', spanish=True))


Hello Yan
Hola Yan


In [67]:
# flexible number of arguments
def say_hello_to_everyone(*args):
    return ['hello ' + str(a) for a in args]

In [72]:
say_hello_to_everyone('Yan', 'Hu', 'Jin')

['hello Yan', 'hello Hu', 'hello Jin']

We could spend the rest of the day talking about functions, but we have to move on.

# Individual Exercises #

## Fibonacci Sequence ##

The Fibonacci sequence is the 1,1,2,3,5,8..., the sum of each number with the preceding one. Write a function to compute the Fibonacci sequence of length n. (Hint, use some list methods.)

In [73]:
def fib(n):
    l = [1,1]
    for i in range(n-2):
        l.append(l[-1] + l[-2])
    return l

In [74]:
fib(10)

[1, 1, 2, 3, 5, 8, 13, 21, 34, 55]

## Add and Multiply Lists Item by Item ##

Write functions to add and multiply each item in a list.

In [75]:
def add(x,y):
    return [a + b for a,b in zip(x,y)]

def multiply(x,y):
    return [a * b for a,b in zip(x,y)]

In [76]:
add(range(10), range(10))

[0, 2, 4, 6, 8, 10, 12, 14, 16, 18]

In [77]:
multiply(range(5), [9,3,2,5,3])

[0, 3, 4, 15, 12]

In [150]:
N = 100000
%timeit multiply(range(N), range(N))

100 loops, best of 3: 12.7 ms per loop


In [151]:
import numpy as np
%timeit np.arange(N) * np.arange(N)

The slowest run took 7.60 times longer than the fastest. This could mean that an intermediate result is being cached 
10000 loops, best of 3: 173 µs per loop


On to numpy