# Laboration 2

---
**Student:**  jakfj222 (Jakob Fjellström)

**Student:** ghijk456

---

# Introduction 
In this first part of the lab, we will be exploring 
* Functions
    * How functions are called.
    * Argument passing
    * Return values.
* Function usage
    * Construction of simple multi-function programs.
    * Functions that work on several kinds of inputs (ie simple polymorphism via duck typing).

Additionally we will touch upon
* Exceptions and 
* simple assertion testing and debugging.

This lab might require you to search for information on your own to a larger extent than in lab 1. As in the last lab, Lutz' Learning Python and the [official documentation](https://docs.python.org) might be helpful. Also make sure to make use of the available lab assistance!

# A note on rules

Please make sure to conform to the (previously mentioned) [IDA lab rules](https://www.ida.liu.se/~732A74/labs/index.en.shtml).

## Functions in Python

a) Write a function that takes a radius and returns area of a circle with that radius. What would be a good name for the function and the argument? Python has a value for $\pi$ in a certain standard library module. Which might that be? Don't type in the constant yourself.

In [1]:
import math
def Area(r):
    A = math.pi*r**2
    return(A)

[Hint: Google. Or consider modules we have `import`ed previously.]

b) How would you call the function, if you wanted to calculate the area of a circle with radius 10cm?

In [2]:
Area(10)

314.1592653589793

c) How would you call the function using named arguments/keyword arguments?

In [3]:
Area(r=10)

314.1592653589793

[Note: In this case, the calling of the function is somewhat artificial. When writing scripts or working with programs that take several parameters, this style can be quite useful. This sidesteps questions of if this particular library takes the input or the output as the first argument, or the like. The code of course becomes more verbose.]

d) Write a function `circle_area_safe(radius)` which uses an if statement to check that the radius is positive and prints `The radius must be positive` to the screen if it is not, and otherwise calls the `circle_area` function. Also, if the radius is not positive the `circle_area_safe` function should signal to the code calling it that it has failed by returning `None`.

In [4]:
def circle_area_safe(radius):
    if radius < 0:
        print("The radius must be positive")
        return(None)
    else: 
        return(Area(r = radius))

e) Recreate the `circle_area_safe` function (call this version `circle_area_safer`) but instead of printing a message to the screen and returning `None` if the radius is negative, _raise_ a ValueError exception with suitable error message as argument.

In [5]:
def circle_area_safer(radius):
    if radius < 0:
        raise ValueError('Radius must be positive!')
    else:
        return(Area(r=radius))

f) To test out how functions are called in Python, create a function `print_num_args` that prints the number of arguments it has been called with. The count should not include keyword arguments.

In [6]:
def print_num_args(*args):
    print(len(args))

print_num_args("a", "b", "c")  # Should print the number 3.

3


g) Write a function `print_kwargs` that prints all the keyword arguments.

In [7]:
def print_kwargs(*args, **kwargs):
    print("The", len(args) ,"regular arguments are:")
    
    for i,j in enumerate(args):
        print(i,":",j)
    
    print("\n And the keyword arguments are (the ordering here is arbitrary):")
    for i, j in kwargs.items():
        print("{} is set to {}".format(i,j))


h) Below we have a very simple program. Run the first cell. It will succeed. What happens when you run the second cell, and why? In particular, consider the error produced. What does it mean. What value has been returned from the function, and how would you modify the function in order for it to work?

In [8]:
def my_polynomial(x):
    """Return the number x^2 + 30x + 225."""
    print(x**2 + 30*x + 225)

polyval = my_polynomial(100)

13225


In [10]:
double_the_polyval = 2 * my_polynomial(100)

13225


TypeError: unsupported operand type(s) for *: 'int' and 'NoneType'

In [1]:
# Python throws error  unsupported operand type(s) for *: 'int' and 'NoneType'. This is because my_polynomial doesnt return any values,
# it simply prints the (correct) value. This then means that polyval is not numeric, and double that via 2*my_polynomial(100) is unsupported.

## Script/program construction (a tiny example)

Regardless of which programming language we use, we will likely construct programs or scripts that consist of several functions that work in concert. Below we will create a very simple Monte Carlo simulation as a basis for breaking down a larger (though small) problem into sensible, (re)usable discrete pieces. The resulting program will likely utilise control structures that you have read about before.

**Hint: read all of the subtasks related to this task before coding.**

a) The following is a well-known procedure for approximating $\pi$: pick $n$ uniformly randomly selected coordinates in an $2R\times 2R$ square. Count the number of the points that fall within the circle of radius $R$ with its center at $(R,R)$. The fraction of these points to the total number of points is used to approximate $\pi$ (exactly how is for you to figure out). (Note that this is not to be confused with MCMC.)

Write a program consisting of **several (aptly selected and named) functions**, that present the user with the following simple text user interface. The <span style="background: yellow;">yellow</span> text is an example of user input (the user is prompted, and enters the value). It then prints the results of the simulations:

`pi_simulation()`

<p style="font-family: console, monospace">Welcome to the Monty Carlo PI program!</p>

<p style="font-family: console, monospace">
Please enter a number of points (or the letter "q" to quit): <span style="background: yellow;">100</span><br/>
Using 100 points we (this time) got the following value for pi: 3.08<br/>
This would mean that tau (2xPI) would be: 6.16
</p>

<p style="font-family: console, monospace">
Please enter a number of points (or the letter "q" to quit): <span style="background: yellow;">100</span><br/>
Using 100 points we (this time) got the following value for pi: 3.12<br/>
This would mean that tau (2xPI) would be: 6.24
</p>

<p style="font-family: console, monospace">
Please enter a number of points (or the letter "q" to quit): <span style="background: yellow;">q</span>
</p>

<p style="font-family: console, monospace">
Thank you for choosing Monty Carlo.
</p>

[**Note**: This is a task largely about program structure. Unless there are substantial performance drawbacks, prefer readability over optimisation.]

---
**REMEMBER: YOU DO NOT WRITE CODE FOR THE INTERPRETER. YOU WRITE IT FOR OTHER HUMAN READERS.**

---

An important part of programming is to allow a reader who is perhaps unfamiliar with the code to be able to understand it, and convince themselves that it is correct with respect to specification. There should also be as few surprises as possible.

In [19]:
import random

def Input():
    In = input("Please enter a number of points (or the letter 'q' to quit): ")
    if In == "q":
        return In
    else:    
       In = int(In)
       return In

def Generate_random_numbers(n,R):
    y = []
    x = []    
    i = 0
    while(i < n):
        y.append(random.uniform(-R,R))
        x.append(random.uniform(-R,R))
        i += 1
    
    return y, x
    
def Approximate_pi(*args):
    
    
    n = args[0]
    R = random.sample(range(1,15),1)[0]
    Rand_numbers = Generate_random_numbers(n, R)
    
    Y = Rand_numbers[0]
    X = Rand_numbers[1]
    
    i = 0
    Circ = 0
    while(i < n):    
        if Y[i]**2 + X[i]**2 < R**2:
            Circ += 1
        i += 1
    
    Pi_app = 4*Circ/i
    
    return Pi_app, n , R    


def pi_simulation():
    
    print("Welcome to the Monte Carlo PI program!")
    
    IP = Input()
    
    while(IP != "q"):    
        Pi2 = Approximate_pi(IP)
            
        print("Using",Pi2[1],"points and", Pi2[2]," as radious we (this time) got the following value for pi:", round(Pi2[0],4))
        print("This would mean that tau (2xPI) would be:", round(2*Pi2[0],4), "\n")
            
        IP = Input()
    print("Thank you for choosing Monty Carlo.")
    
pi_simulation()


Welcome to the Monte Carlo PI program!
Please enter a number of points (or the letter 'q' to quit): 50
Using 50 points and 10  as radious we (this time) got the following value for pi: 3.04
This would mean that tau (2xPI) would be: 6.08 

Please enter a number of points (or the letter 'q' to quit): q
Thank you for choosing Monty Carlo.


[Hint: You might want to consider the function `input`. Try it out and see what type of value it returns.]

b) One feature of Python's simplicity is the possibility to (comparatively) quickly produce code to try out our intuitions. Let's say we want to compare how well our approximation performs, as compared to some gold standard for pi (here: the version in the standard library). Run 100 simulations. How large is the maximum relative error (using the definition above) in this particular run of simulations, if each simulation has $n=10^4$ points? Is it larger or smaller than 5%? Write code that returns this maximum relative error.

In [24]:
import math

def rel_error(n):    
    error = []
    for i in range(n):
        A = Approximate_pi(10**4)
        r_error = abs((math.pi - A[0])/math.pi)*100
        error.append(r_error)
        
    return(max(error))

erf = rel_error(100)
print(round(erf, 4))

#Lower than 5%

1.7318


[Note: This is only to show a quick way of testing out your code in a readable fashion. You might want to try to write it in a pythonic way. But in terms of performance, it is very likely that the true bottleneck will still be the approximation function itself.]

## Fault/bugspotting and tests in a very simple setting

It is inevitable that we will make mistakes when programming. An important skill is not only to be able to write code in the first place, but also to be able to figure where one would start looking for faults. This also involves being able to make the expectations we have on the program more explicit, and at the very least construct some sets of automatic "sanity checks" for the program. The latter will likely not be something done for every piece of code you write, but it is highly useful for code that might be reused or is hard to understand (due either to programming reasons, or because the underlying mathemetics is dense). When rewriting or optimising code, having such tests are also highly useful to provide hints that the changes haven't broken the code.

**Task**: The following program is supposed to return the sum of the squares of numbers $0,...,n$.

In [25]:
# Do not modify this code! You'll fix it later.

def update_result(result, i):
    result = result + i*i
    return result

def sum_squares(n):
    """Return the sum of squares 0^2 + 1^2 + ... + (n-1)^2 + n^2."""
    result = 0
    for i in range(n):
        result = update_result(n, result)

a) What mistakes have the programmer made when trying to solve the problem? Name the mistakes in coding or thinking about the issue that you notice (regardless of if they affect the end result). In particular, write down what is wrong (not just "line X should read ..."; fixing the code comes later). Feel free to make a copy of the code (pressing `b` in a notebook creates a new cell below) and try it out, add relevant print statements, assertions or anything else that might help. Note down how you spotted the faults.

In [None]:
"""
Well, first of all we should understand what these two functions actually are doing. The function sum_squres(n) takes the argument n,
this is then passed to the for loop in which the result (the sum of n squares) is calculated. This is then done by using the update_result(result,i)
function, that takes result and i as arguments. So if we want to calculate SS of n=2, we first pass n in update_result function,
which leads to seeting the result to 2, and since the result in the first iteration = 0, result after first iteration is 2. Then 
in the second iteration n is still 2 but the result is now 2, so result  = 2 + 2*2 = 6. 
The sum of the squares 0^2 + 1^2 is not 6. 
And if we wanted to know the sum of the 6 squares, we get a crazy large number. So what's wrong? The arguments n & result when calling
update_result has been swapped, and we aren't summing n squared numbers (but rather n-1) but more importantly we aren't using the 
loop index i, but instead n. 
"""

b) Write a few simple assertions that should pass if the code was correct. Don't forget to include the *why* of the test, preferably in the error message provided in the `AssertionError` if the test fails.

In [26]:
def test_sum_squares():
    # Format: assert [condition], message
    assert sum_squares(0) == 0, "after 0 iterations, 0 should be returned (sum_squares(0))"
    assert sum_squares(5) == 55, "after 5 iterations, 55 should be returned (sum_squares(5))"
    assert sum_squares("Hey") == None , "Error: Input must be numeric!"
    
    print("--- test_sum_squares finished successfully")

Hint: might there be any corner/edge cases here?

c) Write a correct version of the code, which conforms to the specification.

In [27]:
def sum_squares(n):
    if type(n) == int:
        result = 0
        for i in range(n+1):
            result = update_result(result, i)
        return(result)
    else:
        return(None)

[Note: This is a rather primitive testing strategy, but it is sometimes enough. If we wanted to provide more advanced testing facilities, we might eg use a proper unit test framework, or use tools to do property based testing. This, as well as formal verification, is outside the scope of this course. The interested reader is referred to [pytest](https://docs.pytest.org/en/latest/) or the built-in [unittest](https://docs.python.org/3/library/unittest.html).

Those interested in testing might want to consult the web page for the IDA course [TDDD04 Software testing](https://www.ida.liu.se/~TDDD04/) or the somewhat abbreviation-heavy book by [Ammann & Offutt](https://cs.gmu.edu/~offutt/softwaretest/), which apparently also features video lectures.]

## Polymorphic behaviour (via duck typing)

In Python we often write functions that can handle several different types of data. A common pattern is writing code which is expected to work with several types of collections of data, for instance. This expectation is however in the mind of the programmer (at least without type annotations), and not something that the interpreter will enforce until runtime. This provides a lot of flexibility, but also requires us to understand what our code means for the different kinds of input. Below we try this out, and in particular return to previously known control structures.

a) Write a function `last_idx` that takes two arguments `seq` and `elem` and returns the index of the last occurrence of the element `elem` in the iterable `seq`. If the sequence doesn't contain the element, return -1. (You may not use built-ins like .find() here.)

In [28]:
def last_idx(seq, elem):   
  ind = -1  
  for position, item in enumerate(seq):
    if item == elem:
        ind = position 
  return(ind)

b) What does your function require of the input? Also include if it would work with a string, a list or a dictionary. In the latter case, what would `elem` be matched against? What will`last_idx("cat", "a cat catches fish")` yield?

In [None]:
"""
    My function works with strings, lists and dictionaries. In the latter case elem would be matched against the word element and 
    not the numeric part.
    last_idx("cat", "a cat catches fish") yields -1.
"""

c) Add some `assert`-style tests that your code should satisfy. For each test, provide a description of what it tests, and why. That can be made as part of the assert statement itself.

In [29]:
def test_last_idx():
    assert last_idx([1,2,3,2], 2) == 3, "last_idx should return last index, for sequences with several occurrences"
    assert last_idx("Hello my friend", "e") == 12
    assert last_idx({"R":5000, "C++":50, "Python": -5}, "R") == 0
    assert last_idx([1,2,3,1,4,2,12,6], 2) == 5
    assert last_idx([1,2,3,1,4,2,12,6], 22) == -1
    assert last_idx([1,2,3,1,4,2,12,6], "e") == -1
    
    
    print("--- test_last_idx finished successfully")
        
test_last_idx()

--- test_last_idx finished successfully


The fact that a program doesn't crash when given a certain input doesn't necessarily ensure that the results are what  we expect. Thus we need to get a feel for how eg iteration over different types of data behaves, in order to understand how our function behaves.

d) Can we use `last_idx` with a text file? What would the program try to match `elem` against? What would the return value signify (eg number of words from the start of the file, lines from the start of the file, bytes read...)?

In [32]:
with open("shakespeare.txt", "r", encoding='utf-8') as shake:
       read = shake.read()
       print(last_idx(read,"a"))
 

#  Can we use last_idx with a text file? 
# - Yes, if we then process the file handle, for example via shake.read()

# What would the program try to match elem against?
# - Assuming program is the function last_idx and 'elem' the argument of that function, we would loop thorugh 'seq' and try to 
# match elem against every variable in 'seq' in the process.
# Concrete - the elem 'a' is being matched against every character in shake (i.e. shakespeare.txt).

# What would the return value signify (eg number of words from the start of the file, lines from the start of the file, bytes read...)?
# - If there is a match between elem and a character in shake, it returns the last index of which there is such a match. So that should be
# the number of characthers from the start of the file. If there is no match however, it returns -1.


5657703


[Hint: Try it out! Open a file like in lab 1, using a `with` statement, and pass the file handle to the function. What is the easiest way for you to check what the function is comparing?]

### Attribution

Lab created by Anders Märak Leffler (2019), using some material by Johan Falkenjack. Feel free to reuse the material, but do so with attribution. License [CC-BY-SA 4.0](https://creativecommons.org/licenses/by-sa/4.0/).