# 1. Math Drills

Give an example of a binary relation on a set which is

1. Reflexive and symmetric, but not transitive  
2. Reflexive, but neither symmetric nor transitive  
3. Symmetric, but neither reflexive nor transitive  
4. Transitive, but neither reflexive nor symmetric  

Recall the definitions from the lectures if you need to!

# Exercise 2: A bunch of Math!

## Polynomial

Consider the polynomial

$$
p(x)
= a_0 + a_1 x + a_2 x^2 + \cdots a_n x^n
= \sum_{i=0}^n a_i x^i \tag{1}
$$

Write a function `p` such that `p(x, coeff)` that computes the value in given a point `x` and a list of coefficients `coeff`.

```
p(5, [1, 1]) = 1 + 5 = 6
p(5, [2, 1, 1]) = 2 + 5 + 25 = 32
```

In [7]:
def p(x, coeff):
    value = 0
    for i in range(len(coeff)):
        value += coeff[i]*(x**i)
    print(value)
p(5, [1,1])
p(5, [2,1,1])

6
32


# Variance

Define a function named `var` that takes a list of numbers and computes the variance. The variance is:

$$variance(x) = \frac{∑_i^N(x_i − average(x))^2}{N-1}$$

Don't cheat and use `numpy.var`! You should only use that function to test that your function is correct

In [33]:
import numpy as npy

"""
Not quite sure why my function doesnt return the same as numpy?
"""
def var(numbers):
    N = len(numbers)
    avge = npy.mean(numbers)
    def numerator():
        value = 0
        for number in numbers: value += (number - avge)**2
        return value
    return numerator() / (N - 1)

l = [2, 4, 6, 3, 5, 4, 5, 3, 2, 6, 2, 5, 4]

print(npy.var(l))
print(var(l))
        

1.9171597633136095
2.0769230769230766


# RMSE

Calculate the root mean squared error (RMSE) of a machine learning model's output. The function takes in two lists: one with actual values, one with predictions. The formula for RMSE is:

$$RMSE(y_1, y_2) = \sqrt{\dfrac{1}{N} \sum_{i=1}^N (y_{1i} - y_{2i})^2}$$

```
    rmse([1, 2], [1, 2]) = 0
    rmse([1, 2, 3], [3, 2, 1]) = 1.63
```

You can use 

```
sklearn.metrics.mean_squared_error(y_actual, y_predicted, squared=False)
```

To test your function

In [36]:
import math

def rmse(actuals, predics):
    N = len(predics)
    #arbitrary value to throw as exception
    if len(predics) is not len(actuals): return -1 
    def square():
        sums = 0
        for i in range(N): sums += (actuals[i] - predics[i])**2
        return sums
    return math.sqrt(square() / N)

print(rmse([1, 2], [1, 2]))
print(rmse([1, 2, 3], [3, 2, 1]))

rmse([1, 2, 4, 6, ], [3, 2, 1])
            

0.0
1.632993161855452


-1

# Jaccard Similarity

The Jaccard similarity between two sets is the size of intersection divided by the size of union. Write a function that computes it:

$$jaccard(A, B) = \dfrac{|A \cap B|}{|A \cup B|}$$


```
jaccard({'a', 'b', 'c'}, {'a', 'd'}) = 1 / 4
```



In [38]:
def jaccard(s1, s2): return len(s1.intersection(s2)) / len(s1.union(s2))

print(jaccard({'a', 'b', 'c'}, {'a', 'd'}))

0.25


# Exercise 3

First, write a function that returns one realization of the following random device

1. Flip an unbiased coin 10 times.  
1. If a head occurs `k` or more times consecutively within this sequence at least once, pay one dollar.  
1. If not, pay nothing.  


Second, write another function that does the same task except that the second rule of the above random device becomes

- If a head occurs `k` or more times within this sequence, pay one dollar.  


Use no import besides `from numpy.random import uniform`.

In [58]:
from numpy.random import uniform

def is_heads(number): return number > 0.5

"""
Convention: 
- flip values <=0.5 are considered tails
- flip values > 0.5 are considered heads
"""
def rand_thing(k):
    # more than 10 consecutive Heads in 10 flips is impossible
    if k < 0 or k > 10: return "Paid nothing"
    # we simplify the list of flips into binary results to simplify the tests
    flips = [ (1 if is_heads(flip) else 0) for flip in uniform(0, 1, 10) ]
    print("Flips:", flips)
    # we create a sublist of the smallest length possible
    sublist = [1] * k
    l1 = len(flips)
    for i in range(l1):
        # if we make it to an index where adding the smallest length of consecutive heads
        # gives us an index outside the range of the list of flips, then we don't need to 
        # check the remaining elements
        if i+k >= l1: return "Paid Nothing"
        if flips[i:i+k] == sublist: return "Paid a Dollar"
    return "Paid nothing"
            
        
print(rand_thing(2))

Flips: [1, 1, 1, 0, 0, 0, 0, 0, 1, 0]
Paid a Dollar


# Exercise 4: Logistic Map fixed point

The **Logistic Map** is a famous function from Chaos Theory which is defined as:

$$x_{t+1} = r \cdot x_t(1−x_t)$$

with the conditions:

$$x_0 ∈ [0,1], r ∈[0,4]$$

Write a lambda $f = logistic(x, r)$, that's successively applied to itself $n$ times through a second function `logistic_n_times(x0, f, r, n)` with the inital point $X_0$

Make a few runs of this for various values of `x0` and `r`. Answer the following:

- Can you find a fixed point? 

- At what values of `r` are there fixed points? 

- Are there any ranges of input for which the function is an attractor?

In [28]:
'''
Methods from 
https://www.reddit.com/r/learnpython/comments/zzh28/a_simple_python_implementation_of_the_logistic_map/
'''

from random import randint

iterations = 10   # Number of iterations per point
seed = 0.5          # Seed value for x in (0, 1)
spacing = .01     # Spacing between points on domain (r-axis)
res = 8             # Largest n-cycle visible

def logistic(r, x): return r * x * (1 - x)

# Return nth iteration of logisticmap(x. r)
def logistic_n_times(n, x, r):
    for i in range(1,n): 
        x = logistic(x, r)
    return x

def gen_lists(xo, xRange, rRange):
    rlist = []
    xlist = []
    previousX = 0
    for r in [i * spacing for i in range(int(xRange[0]/spacing),int(xRange[1]/spacing))]:
        rlist.append(r)
        previousX = logistic_n_times(randint(iterations-res/2,iterations+res/2), xo, r)
        xlist.append(previousX)
        
        #rlist.append(r)
        #previousX = logistic_n_times(randint(iterations-res/2,iterations+res/2), xo, r)
        #xlist.append(previousX)
    return {"rlist": rlist, "xlist": xlist}



    
print(gen_lists(seed))
        

{'rlist': [1.0, 1.01, 1.02, 1.03, 1.04, 1.05, 1.06, 1.07, 1.08, 1.09, 1.1, 1.11, 1.12, 1.1300000000000001, 1.1400000000000001, 1.1500000000000001, 1.16, 1.17, 1.18, 1.19, 1.2, 1.21, 1.22, 1.23, 1.24, 1.25, 1.26, 1.27, 1.28, 1.29, 1.3, 1.31, 1.32, 1.33, 1.34, 1.35, 1.36, 1.37, 1.3800000000000001, 1.3900000000000001, 1.4000000000000001, 1.41, 1.42, 1.43, 1.44, 1.45, 1.46, 1.47, 1.48, 1.49, 1.5, 1.51, 1.52, 1.53, 1.54, 1.55, 1.56, 1.57, 1.58, 1.59, 1.6, 1.61, 1.62, 1.6300000000000001, 1.6400000000000001, 1.6500000000000001, 1.6600000000000001, 1.67, 1.68, 1.69, 1.7, 1.71, 1.72, 1.73, 1.74, 1.75, 1.76, 1.77, 1.78, 1.79, 1.8, 1.81, 1.82, 1.83, 1.84, 1.85, 1.86, 1.87, 1.8800000000000001, 1.8900000000000001, 1.9000000000000001, 1.9100000000000001, 1.92, 1.93, 1.94, 1.95, 1.96, 1.97, 1.98, 1.99, 2.0, 2.0100000000000002, 2.02, 2.0300000000000002, 2.04, 2.05, 2.06, 2.07, 2.08, 2.09, 2.1, 2.11, 2.12, 2.13, 2.14, 2.15, 2.16, 2.17, 2.18, 2.19, 2.2, 2.21, 2.22, 2.23, 2.24, 2.25, 2.2600000000000002, 

# Exercise 5 (stretch): Famous Chaos Theory Plot 

There is a famous plot in chaos theory of the logistic map that relates values of the attractors in $x_t$ for values of $r$, detailing where the function tends to "end up" for each value of $r$.

<img src="logistic map.png" style="width: 400px;">

Reproduce this plot using the `matplotlib` package.

**Hint:** Produce samples from the function to fill arrays on the x and y axis!

**Hint:** Take the final 50 values in a series of data points produced by the function!