# Modules/packages/libraries

Definitions:

  * Modules:
  A module is a file which contains python functions, global variables etc. It is nothing but .py file which has python executable code / statement.

  * Packages:
  A package is namespace which contains multiple package/modules. It is a directory which contains a special file `__init__.py`
  
  * Libraries:
  A library is a collection of various packages. There is no difference between package and python library conceptually.
  
Modules/packages/libraries can be easily "imported" and made functional in your python code. A set of libriaries comes with every python installation. Others can be installed locally and then imported. Your own code sitting somewhere else in your local computer can be imported too.

Further details (very important!) on packages and how to create them can be found online. We may find the need of creating our own during the course.

In [1]:
###### all the "stuff" that is in the math library can be used
import math
print(math.pi)

# you can give math a label for convenience
import math as m
print (m.pi)

# alternatively you can import only a given "thing" from the library
from math import pi    #you can add several libraries at once, just list them separated by a ", "
print (pi)

# or just get everything (very dangerous!!!), this can cause name conflict
from math import *
print (sqrt(7))

3.141592653589793
3.141592653589793
3.141592653589793
2.6457513110645907


To know which modules are there for you to use just type:

In [2]:
print (help('modules') )



Please wait a moment while I gather a list of all available modules...



The matplotlib.compat module was deprecated in Matplotlib 3.3 and will be removed two minor releases later.
  __import__(info.name)
    Install tornado itself to use zmq with the tornado IOLoop.
    
  yield from walk_packages(path, info.name+'.', onerror)


DEBUG:pip.vcs:Registered VCS backend: git
DEBUG:pip.vcs:Registered VCS backend: hg
DEBUG:pip.vcs:Registered VCS backend: svn
DEBUG:pip.vcs:Registered VCS backend: bzr
AptUrl              chunk               libImt              ptyprocess
CommandNotFound     cmath               libJupyROOT         pvectorc
Crypto              cmd                 libKrb5Auth         pwd
DistUpgrade         cmdLineUtils        libMLP              py_compile
HweSupportStatus    code                libMathCore         pyatspi
IPython             codecs              libMathMore         pyclbr
JsMVA               codeop              libMatrix           pycparser
JupyROOT            collections         libMemStat          pydoc
LanguageSelector    colorama            libMinuit           pydoc_data
NvidiaDetector      colorsys            libMinuit2          pyexpat
PIL                 compileall          libMultiProc        pygments
Quirks              concurrent          libNet              pygtkcompat
ROOT   

`pip` is a special package. It is used from the command line to install properly (e.g. matching the version of the local packages) new packages. It can also be used from within python to check i.e. the set installed packages and their versions. N.B.: only the installed packages on top of the default ones will be listed 

In [5]:
#pip is an executable
import pip
sorted(["%s==%s" % (i.key, i.version) for i in pip.get_installed_distributions()])

['argon2-cffi==20.1.0',
 'async-generator==1.10',
 'attrs==20.2.0',
 'backcall==0.2.0',
 'bleach==3.2.1',
 'certifi==2020.6.20',
 'cffi==1.14.3',
 'chardet==3.0.4',
 'cycler==0.10.0',
 'decorator==4.4.2',
 'defusedxml==0.6.0',
 'entrypoints==0.3',
 'idna==2.10',
 'importlib-metadata==2.0.0',
 'ipykernel==5.3.4',
 'ipython-genutils==0.2.0',
 'ipython==7.16.1',
 'jedi==0.17.2',
 'jinja2==2.11.2',
 'joblib==0.16.0',
 'json5==0.9.5',
 'jsonschema==3.2.0',
 'jupyter-client==6.1.7',
 'jupyter-core==4.6.3',
 'jupyterlab-pygments==0.1.2',
 'jupyterlab-server==1.2.0',
 'jupyterlab==2.2.8',
 'kiwisolver==1.2.0',
 'markupsafe==1.1.1',
 'matplotlib==3.3.2',
 'mistune==0.8.4',
 'nbclient==0.5.0',
 'nbconvert==6.0.6',
 'nbformat==5.0.7',
 'nest-asyncio==1.4.1',
 'notebook==6.1.4',
 'numpy==1.19.2',
 'packaging==20.4',
 'pandas==1.1.2',
 'pandocfilters==1.4.2',
 'parso==0.7.1',
 'pexpect==4.8.0',
 'pickleshare==0.7.5',
 'pillow==7.2.0',
 'prometheus-client==0.8.0',
 'prompt-toolkit==3.0.7',
 'ptyproc

# Functions

In [7]:
def square(x):
    """Square of x."""
    return x*x

def cube(x):
    """Cube of x."""
    return x*x*x

# create a dictionary of functions
funcs = {
    'square': square,
    'cube': cube,
}

x = 3
print(square(x))
print(cube(x))

for func in sorted(funcs):
    print (func, funcs[func](x))

9
27
cube 27
square 9


## Functions arguments

What is passsed to a function is a copy of the input. Imagine we have a list *x =[1, 2, 3]*, i.e. a mutable object. If within the function the content of *x* is directly changed (e.g. *x[0] = 999*), then *x* changes outside the funciton as well. 

In [7]:
def modify(x):
    x[0] = 999
    return x

x = [1,2,3]
print (x)
print (modify(x))
print (x)

[1, 2, 3]
[999, 2, 3]
[999, 2, 3]


However, if *x* is reassigned within the function to a new object (e.g. another list), then the copy of the name *x* now points to the new object, but *x* outside the function is unhcanged.

In [18]:
def no_modify(x):
    x = [4,5,6]
    return x

x = [1,2,3]
print (x)
print (no_modify(x))
print (x)


[1, 2, 3]
[4, 5, 6]
[1, 2, 3]


What if the function tries to modify the value of an immutable object?

Binding of default arguments occurs at function definition:

In [19]:
def f(x = []):
    x.append(1)
    return x

print (f())
print (f())
print (f(x = [9,9,9]))
print (f())
print (f())

[1]
[1, 1]
[9, 9, 9, 1]
[1, 1, 1]
[1, 1, 1, 1]


Try to aviod that!!

In [4]:
def f(x = None):
    if x is None:
        x = []
    x.append(1)
    return x

print (f())
print (f())
print (f(x = [9,9,9]))
print (f())
print (f())

[1]
[1]
[9, 9, 9, 1]
[1]
[1]


## Higher order functions

A function that uses another function as an input argument or returns a function is known as a higher-order function (HOF). The most familiar examples are `map` and `filter`.

### map

The map function applies a function to each member of a collection

In [9]:
x = list(map(square, range(5))) 

#we are doing a cast putting the results in a list, since the return of the map is an iterator; in fact doing for i in x print (type(i)) you get int
#print (type(map(square, range(5))))

# Note the difference w.r.t python 2. In python 3 map retuns an iterator so you can do stuff like:
for i in map(square,range(5)): print(i)
    

0
1
4
9
16


### filter

The filter function applies a predicate to each memmber of a collection, retaining only those members where the predicate is True

In [24]:
def is_even(x):
    return x%2 == 0

print (list(filter(is_even, range(5))))

[0, 2, 4]


In [25]:
list(map(square, filter(is_even, range(5))))


[0, 4, 16]

### reduce

The reduce function reduces a collection using a binary operator to combine items two at a time. More often than not reduce can be substituted with a more efficient for loop. It is worth mentioning it for its key role in big-data applications together with map (the map-reduce paradigm). 
N.B.: it no loger exist as built-in function in python 3, it is now part of the `functools` library

In [26]:
from functools import reduce
#it's no more used that often
def my_add(x, y):
    return x + y

# another implementation of the sum function
reduce(my_add, [1,2,3,4,5])

15

### zip

zip is useful when you need to iterate over matched elements of multiple lists

In [29]:
xs = [1, 2, 3, 4]
ys = [10, 20, 30, 40]
zs = ['a', 'b', 'c', 'd', 'e']

for x, y, z in zip(xs, ys, zs):
    print (x, y, z)

1 10 a
2 20 b
3 30 c
4 40 d


### Custom HOF

In [30]:
def custom_sum(xs, transform):
    """Returns the sum of xs after a user specified transform."""
    return sum(map(transform, xs))

xs = range(5)
print (custom_sum(xs, square))
print (custom_sum(xs, cube))



30
100


### Returning a function

In [38]:
def make_logger(target):
    def logger(data):
        with open(target, 'a') as f:
            f.write(data + '\n')
    return logger

foo_logger = make_logger('foo.txt') #foo.txt will be created if not there already
foo_logger('Hello')
foo_logger('World')

In [39]:

! cat 'foo.txt'

#in a code cell, whatever comes after ! is executed in the unix shell
#cat prints on the screen the content of a given file

Hello
World


## Anonimous functions (lambda)

When using functional style, there is often the need to create specific functions that perform a limited task as input to a HOF such as map or filter. In such cases, these functions are often written as anonymous or lambda functions. 
The syntax is as follows:

lambda *arguments* : *expression*


If you find it hard to understand what a lambda function is doing, it should probably be rewritten as a regular function.

In [40]:
sum = lambda x,y: x+y
sum(3,4)

7

In [43]:
for i in map(lambda x: x*x, range(5)): print (i)

0
1
4
9
16


In [42]:
# what does this function do?
from functools import reduce
s1 = reduce(lambda x, y: x+y, map(lambda x: x**2, range(1,10)))
print(s1)


285


## Recursive functions 

In [44]:
def fib1(n):
    """Fib with recursion."""

    # base case
    if n==0 or n==1:
        return 1
    # recurssive case
    else:
        return fib1(n-1) + fib1(n-2)

    
print ([fib1(i) for i in range(10)])

[1, 1, 2, 3, 5, 8, 13, 21, 34, 55]


In [45]:
# In Python, a more efficient version that does not use recursion is

def fib2(n):
    """Fib without recursion."""
    a, b = 0, 1
    for i in range(1, n+1):
        a, b = b, a+b
    return b

print ([fib2(i) for i in range(10)])

[1, 1, 2, 3, 5, 8, 13, 21, 34, 55]


In [47]:
# check indeed the timing:

%timeit fib1(20)
%timeit fib2(20)
#the second one is an undred time more efficient

4.23 ms ± 75.9 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
1.66 µs ± 17.3 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)


## Iterators

Iterators represent streams of values. Because only one value is consumed at a time, they use very little memory. Use of iterators is very helpful for working with data sets too large to fit into RAM.

In [51]:
# Iterators can be created from sequences with the built-in function iter()

xs = [1,2,3]
x_iter = iter(xs)

print (next(x_iter))
print (next(x_iter))
print (next(x_iter))
#print (next(x_iter)), it gives an error since it points to not allocated memory 

1
2
3


In [52]:
# Most commonly, iterators are used (automatically) within a for loop
# which terminates when it encouters a StopIteration exception

x_iter = iter(xs)
for x in x_iter:
    print (x)

1
2
3


## More on comprehensions

In [53]:
# A generator expression

print ((x for x in range(10)))

# A list comprehesnnion

print ([x for x in range(10)])

# A set comprehension

print ({x for x in range(10)})

# A dictionary comprehension

print ({x: x for x in range(10)})

<generator object <genexpr> at 0x7f91e0451a98>
[0, 1, 2, 3, 4, 5, 6, 7, 8, 9]
{0, 1, 2, 3, 4, 5, 6, 7, 8, 9}
{0: 0, 1: 1, 2: 2, 3: 3, 4: 4, 5: 5, 6: 6, 7: 7, 8: 8, 9: 9}


## Useful Modules

You may want to have a look at the content of the following modules for further usage of (HO) functions:
  - [operator](https://docs.python.org/3/library/operator.html)
  - [functools](https://docs.python.org/3/library/functools.html)
  - [itertools](https://docs.python.org/3/library/itertools.html)
  - [toolz](https://pypi.org/project/toolz/)
  - [funcy](https://pypi.org/project/funcy/)

## Decorators

Decorators are a type of HOF that take a function and return a wrapped function that provides additional useful properties.

Examples:

  - logging
  - profiling
  - Just-In-Time (JIT) compilation

In [13]:
#no decorator
def my_decorator(func):
    def wrapper():
        print("Something is happening before the function is called.")
        func()
        print("Something is happening after the function is called.")
    return wrapper

def say_whee():
    print("Whee!")

say_whee = my_decorator(say_whee)


In [10]:
say_whee

<function __main__.my_decorator.<locals>.wrapper()>

Python allows you to use decorators in a simpler way with the @ symbol, sometimes called the “pie” syntax

In [3]:
def my_decorator(func):
    def wrapper():
        print("Something is happening before the function is called.")
        func()
        print("Something is happening after the function is called.")
    return wrapper

@my_decorator
def say_whee():
    print ("Whee!")

In [4]:
say_whee()

Something is happening before the function is called.
Whee!
Something is happening after the function is called.


# Classes and Objects

Old school object-oriented programming is possible and often used in python. Classes are defined similarly to standard object-oriented languages, with similar functionalities.

The main python doc [page](https://docs.python.org/3.6/tutorial/classes.html) is worth reading through 

In [58]:
class Pet:
    # the "constructor"
    def __init__(self, name, age):  #inizialize the elements of the class
        self.name=name
        self.age=age
    # class functions take the "self" parameter !!!
    def set_name(self,name):
        self.name=name
    def convert_age(self,factor):
        self.age*=factor

buddy=Pet("buddy",12)
print (buddy.name, buddy.age)
buddy.age=3 #equivalent of setter function
print (buddy.age)



buddy 12
3


In [60]:
# ineritance is straightforward
class Dog(Pet):
    # the following variables is "global", i.e. holds for all "Dog" objects
    species = "mammal"
    # functions can be redefined as usual
    def convert_age(self):
        self.age*=7
    def set_species(self, species):
        self.species = species
        
puppy=Dog("tobia",10)
print(puppy.name)
puppy.convert_age()
print(puppy.age)



tobia
70
