# Warmup: Count to 50

Use a RNG to generate rolls of a 12-sided die. 
Write a function that counts the number of rolls taken until the total of the rolls totals 50 or more.

```
rollto50() -> 5
rollto50() -> 6
```

In [6]:
import numpy as np
import random

def rollto50():
  
    # start an empty list
    ttl = []

    # start a loop of maximum (but highly unlikely) 50 rolls
    for i in range(1, 51):
        # roll the die
        ttl.append(random.randint(1, 13))
        if sum(ttl) < 50:
            pass
        else:
            return i

print('It took {} rolls to reach a total of 50 or more'.format(rollto50()))
print('It took {} rolls to reach a total of 50 or more'.format(rollto50()))
print('It took {} rolls to reach a total of 50 or more'.format(rollto50()))
print('It took {} rolls to reach a total of 50 or more'.format(rollto50()))

It took 8 rolls to reach a total of 50 or more
It took 6 rolls to reach a total of 50 or more
It took 11 rolls to reach a total of 50 or more
It took 6 rolls to reach a total of 50 or more


# Problem 1: Monte Carlo Sampling

Data Scientists are often lazy. Instead of calculating the exact probability of complex events, we simulate samples with a RNG and average the results. This is called **Monte Carlo Sampling** after the casino in Monaco (yes, really).

Write a function `monte_carlo_dice(n)` that given a 6-sided die, rolls it $n$ times and averages the result.

The result should get closer to the true expected value (3.5) as $n$ increases:

```
n: 100 Trial average 3.39 
n: 1000 Trial average 3.576 
n: 10000 Trial average 3.5054 
n: 100000 Trial average 3.50201 
n: 500000 Trial average 3.495568
```

In [7]:
import numpy as np
import random

def monte_carlo_dice(n):
  
    # start an empty list
    ttl = []

    # start a loop to n
    for i in range(1, n):
        # roll the 6-sided die
        ttl.append(random.randint(1, 6))
    # return the average
    return sum(ttl) / n

for n in [100, 1000, 10000, 100000, 500000, 1000000]:
    print('n: {} Trial average is {}'.format(n, monte_carlo_dice(n)))

n: 100 Trial average is 3.33
n: 1000 Trial average is 3.466
n: 10000 Trial average is 3.4642
n: 100000 Trial average is 3.50694
n: 500000 Trial average is 3.502156
n: 1000000 Trial average is 3.500471


# 2: Estimating the Area of a Circle

Consider a dartboard with a circle of radius $r$ inscribed in a square with side $2r$. Now let’s say you start throwing a large number of darts at it. 

Some of these will hit the board within the circle—let’s say, $N$—and others out-side it—let’s say, $M$. If we consider the fraction of darts that land inside the circle:

$$f = \dfrac{N}{N + M}$$

Then the value of $f * A$ with $A$ being the area of the square will approximate the actual area of the circle (which is  $\pi 2 r$)

<img src="Circle Target.png" style="width: 200px;">

Write a function `circle_estimate(radius, trials)` which will estimate the area of a circle by throwing `trials` random darts at the square.



```
Radius: 2
Area: 12.566370614359172, Estimated (1000 darts): 12.576
Area: 12.566370614359172, Estimated (100000 darts): 12.58176
Area: 12.566370614359172, Estimated (1000000 darts): 12.560128
```

**Hint:** Generate 2 random numbers for each dart throw, one for the `x` axis and one for the `y` axis. Use the [Pythagorean Theorem](https://en.wikipedia.org/wiki/Pythagorean_theorem) find if it's outside the circle

In [8]:
import numpy as np
import random

def circle_estimation(rad, trls):
  
    # generate the x and y values using the uniform 
    # distribution of random numbers over a 
    # range of [-radius, +radius]
    # this simulates trial number of x,y dart hits
    # in a square of 2r about the origin
    coord = np.random.rand(2, trls) * 2 * rad - rad

    # initialize counter
    countit = 0
    
    # start a loop to n trials
    for i in range(trls):
        # calcuate the radius of the dart hit from center
        radius = np.sqrt(coord[0, i]**2 + coord[1, i]**2)
        # check if within the circle
        if radius <= rad:
            countit += 1
    # return the hits / total * square area
    return countit / (trls) * (2 * rad) ** 2

r = 2
print('Radius: {}'.format(r))
area = np.pi * r ** 2
for n in [100, 1000, 10000, 100000, 1000000]:
    print('Area: {}, Estimated ({} darts): {}'.format(area, n, circle_estimation(r, n)))

Radius: 2
Area: 12.566370614359172, Estimated (100 darts): 12.8
Area: 12.566370614359172, Estimated (1000 darts): 12.464
Area: 12.566370614359172, Estimated (10000 darts): 12.6128
Area: 12.566370614359172, Estimated (100000 darts): 12.5936
Area: 12.566370614359172, Estimated (1000000 darts): 12.570448


# 3: Binomial distribution

The [binomial random variable](https://en.wikipedia.org/wiki/Binomial_distribution) $ Y \sim Bin(n, p) $ represents the number of successes in $ n $ coin flips, where each trial succeeds with probability $ p $.

Without any import besides `from numpy.random import uniform`, write a function
`binomial_rv` such that `binomial_rv(n, p)` generates one draw of $ Y $.

Hint: If $ U $ is uniform on $ (0, 1) $ and $ p \in (0,1) $, then the expression `U < p` evaluates to `True` with probability $ p $.

In [13]:
from numpy.random import uniform

def binomial_rv(n, p):
    
    # generate n draws of uniform distribution and count
    # the number of results less than p
    draw_uni = sum(uniform(0, 1, n) < p)
    
    return draw_uni

# generate print output for testing
p_vals = [0.1, 0.5, 0.8]
n_vals = [20, 50, 100]

for n in n_vals:
    for p in p_vals:
        print('n: {}, p: {}, draw binomial: {}'.format(n, p, binomial_rv(n, p)))

n: 20, p: 0.1, draw binomial: 2
n: 20, p: 0.5, draw binomial: 10
n: 20, p: 0.8, draw binomial: 17
n: 50, p: 0.1, draw binomial: 6
n: 50, p: 0.5, draw binomial: 26
n: 50, p: 0.8, draw binomial: 44
n: 100, p: 0.1, draw binomial: 13
n: 100, p: 0.5, draw binomial: 49
n: 100, p: 0.8, draw binomial: 73
