# Homework 06: Expected Value and Variance
***

**Name**: Matthew Riopelle

***

This assignment is due on Canvas by **6:00PM on Friday October 7**. Your solutions to theoretical questions should be done in Markdown directly below the associated question.  Your solutions to computational questions should include any specified Python code and results as well as written commentary on your conclusions.  Remember that you are encouraged to discuss the problems with your classmates, but **you must write all code and solutions on your own**.

**NOTES**: 

- Any relevant data sets should be available in the Homework 01 assignment write-up on Canvas. To make life easier on the grader if they need to run your code, do not change the relative path names here. Instead, move the files around on your computer.
- If you're not familiar with typesetting math directly into Markdown then by all means, do your work on paper first and then typeset it later.  Remember that there is a [reference guide](https://math.meta.stackexchange.com/questions/5020/mathjax-basic-tutorial-and-quick-reference) linked on Canvas on writing math in Markdown. **All** of your written commentary, justifications and mathematical work should be in Markdown.
- Because you can technically evaluate notebook cells is a non-linear order, it's a good idea to do $\color{red}{\text{Kernel}}$ $\color{red}\rightarrow$ $\color{red}{\text{Restart & Run All}}$ as a check before submitting your solutions.  That way if we need to run your code you will know that it will work as expected. 
- It is **bad form** to make your reader interpret numerical output from your code.  If a question asks you to compute some value from the data you should show your code output **AND** $\color{red}{\text{write a summary of the results}}$ in Markdown directly below your code. 
- This probably goes without saying, but... For any question that asks you to calculate something, you **must show all work and justify your answers to receive credit**. Sparse or nonexistent work will receive sparse or nonexistent credit. 

---

Import Pandas, NumPy, and matplotlib.pylab.

In [1]:
import pandas as pd
import numpy as np
import matplotlib.pylab as plt
%matplotlib inline 

# Problem 1
***
Consider a continuous random variable $X$ with a PDF given by:

$f(x)=
    \begin{cases}
        \frac{1}{10}(1-\frac{1}{20}x) & \text{if } 0\leq x\leq20\\
        0 & \text{elsewhere} 
    \end{cases}$
    
Furthermore, consider $Y$ to be a random variable with values $Y=3X+5$.

### Part A

***(2 points)*** Show that $f(x)$ is a valid PDF.

***solution:*** Put your solution to Part A here:

A PDF is valid if 

1) $f(x) \geq 0$ for all x

2)$\int_{-\infty}^{\infty}f(x) dx = 1$

$\frac{1}{10}\cdot(1-\frac{1}{20}x)$ is positive for $0 \leq x \leq 20$

Now, we will take the integral from 0 to 20, as that is the only place the pdf is defined:

$\frac{1}{10} \cdot \int_{0}^{20}(1-\frac{x}{20}) dx$

$\frac{1}{10} \cdot [x - \frac{x^{2}}{40}]_{0}^{20}$

$\frac{1}{10} \cdot [20 - \frac{400}{40} - (0-0)]$

$\frac{1}{10} \cdot [20-10]$

$= 1$

f(x) is a valid PDF

### Part B

***(2 points)*** Find $E[X]$.

***solution:*** Put your solution to Part B here:

The expected value of a continous variable is defined as follows:

$\int_{-\infty}^{\infty} (x \cdot f(x)) dx$

Knowing our PDF is defined from 0, 20, these will be our bounds. We will multiply x and evaluate f(x)

$\int_{0}^{20}x \cdot (\frac{1}{10} - \frac{x}{200}) dx$

$\int_{0}^{20} (\frac{x}{10} - \frac{x^{2}}{200}) dx$

$[\frac{x^2}{20} - \frac{x^3}{600}]_{0}^{20}$

$\frac{20^2}{20} - \frac{20^3}{600} - (0 - 0)$

$20 - \frac{8000}{600}$

$20 - 13.33333$

$6.67$

The expected value of $f(x)$ will be 6.67

### Part C

***(2 points)*** Find $Var(X)$. 

***solution:*** Put your solution to part C here:

We know the following:

$Var(X) = E[X^2] - (E[X])^2$

We will first find

$E[X^2] = \frac{1}{10} \int_{0}^{20}x^2 \cdot (1-\frac{1}{20}x)dx$

$=\frac{1}{10} \int_{0}^{20}(x^2-\frac{x^3}{20})dx$

$\frac{1}{10}\cdot[\frac{x^3}{3} - \frac{x^4}{80}]_0^{20}$

$\frac{1}{10} \cdot [\frac{20^3}{3} - \frac{20^4}{80} - (0-0)]$

$\frac{1}{10} \cdot [\frac{8000}{3} - \frac{160000}{80}]$

$\frac{1}{10} \cdot [2666.\overline{6} - 2000]$

$66.67$

Now we will simply square $E[X]$ from above to get $(E[X])^2$

$(E[X])^2 = (6.67)^2$

Now we will plug in values:

$Var(X) = E[X^2] - (E[X])^2$

$66.67 - 44.44$

$22.23$

Var(X) is roughly 22.23

### Part D

***(2 points)*** Find $E[Y]$.

***solution:*** Put your solution to part D here:

We have 

$y = 3x + 5$

When given formulas in the form:

$y(x) = rx + s$

We can deduce that

$E[Y] = rE[X] + s$

Plugging into the formula:

$E[Y] = 3E[X] + 5$

$3 \cdot (6.67) + 5$

$20 + 5$

$25$

Therefore:

$E[Y] = 25$

### Part E

***(2 points)*** Find $Var(Y)$.

***solution:*** Put your solution to part E here:

We know that:

$Var(rx + s) = r^2\cdot Var(x)$

And we are looking for:

$Var(3x + 5)$

Plugging in:

$Var(Y) = 3^2 \cdot Var(x)$

$=9 \cdot (22.23)$

$= 200.07$

Var(Y) is roughly 200.07

# Problem 2
***

![image](wheel.png)

Consider the American roulette wheel as pictured above. 

### Part A

Suppose you decide to bet on red23 over and over and over again UNTIL you win, then you'll stop playing.

Winning means the ball lands on the red23 slot - and it has an equal chance of landing in any of the slots. 

So, you might play only once (if you win in one game.) Or, perhaps you'll play twice, or three times, etc. before you win. 

Let $X$ be the random variable, "Number of times you play till you win."

***(2 points)*** How many times do you expect to play before you win?

***solution:*** Put your answer to Part A here:

There are 36 numbers from 1-36, then a 0 and a 00 on the roulette board. That means that at each spin, we have a $\frac{1}{38}$ chance of landing on the number 23 (the color is irrelevant here).

This means that you would have to spin 38 times until you are expected to win.

### Part B

***(3 points)*** What is the probability that you don't win until your 4th attempt?

***solution:*** Put your answer to Part B here:

This means that we are taking the probability of winning on the 4th attempt. We will take $p = \frac{1}{38}$ and n = 4 and plug into the formmula:

$P(X = n) = (1-p)^{n-1}\cdot p$

$P(X = 4) = (1-\frac{1}{38})^3 \cdot \frac{1}{38}$

$= 0.02429242025$

Therefore, you have roughly a 2.4% chance of winning on your 4th attempt.

### Part C

Recall from Calculus that a geometric series with ratio $r$ diverges if $|r|>1$, but if $0<|r|<1$ then the series converges:

$\displaystyle{\sum_{n=0}^{\infty}ar^n=\frac{a}{1-r}}$.

Therefore, for $0<p<1$, we have $\displaystyle{\sum_{k=0}^{\infty}p(1-p)^k=p\cdot\frac{1}{1-(1-p)}=1}.$

***(5 points)*** Explain (show) why $\displaystyle{E[X]=\sum_{k=1}^{\infty}kp(1-p)^{k-1}=\frac{1}{p}}$.

***solution:*** Put your answer to Part C here:

Here, we have:

$E[X] = p \cdot \sum_{k=1}^{\infty}k \cdot (1-p)^{k-1}$

If we multiply both sides by (1-p), we have:

$(1-p) \cdot E[X] = p \cdot \sum_{k=1}^{\infty}k \cdot (1-p)^{k}$

By multiplying both sides by (1-p), we are essentially getting rid of the k value

Writing this all out algabraically:

$E[X] = p \cdot \sum_{k=1}^{\infty}k \cdot (1-p)^{k-1}$

$E[X] = p\cdot[1 + 2(1-p) + 3(1-p)^2 + ...]$

being subtracted by:

$(1-p) \cdot E[X] = p \cdot[(1-p) + 2(1-p)^2 + 3(1-p)^3 + ...]$

Now we have: 

$E[X](1-(1-p)) = p \cdot[1 + (1-p) + (1-p)^2 + (1-p)^3 + ...]$

applying geometric formula:

$E[X] \cdot p = p \cdot \frac{1}{1-(1-p)}$

$E[X] = \frac{p}{p} \cdot \frac{1}{p}$

$E[X] = \frac{1}{p}$

### Part D

***(3 points)*** What is $E[X]$ ?

***solution:*** Put your answer to Part D here:

We know the probability is $p = \frac{1}{38}$

We know that

$E[X] = \frac{1}{p}$

Plugging in we have:

$E[X] = \frac{1}{\frac{1}{38}}$

$E[X] = 38$

### Part E

You are interested in knowing how much money you should expect to win $\textbf{each time you play.}$ Afterall, you have decided to play over and over again till you win.

Now, let $X$ be the random variable, "Amount of money you win."

You are still betting on 23red, and each bet costs \$1.

If you lose, you lose your dollar.

If you win, you get your dollar back $\textbf{and}$ you get an additional \$35 for winning.

***(3 points)*** What is $E[X]$ ?

***solution:*** Put your answer to Part E here:

We have the following:

Probability of winning: $\frac{1}{38}$

Outcome of winning: (36 -1) = $35

Probability of losing: $\frac{37}{38}$

Outcome of losing: -$1

Therefore:

$E[X] = (\frac{1}{38}) \cdot 35 + (\frac{37}{38}) \cdot -1$

$= \frac{35}{38} + \frac{-37}{38}$

$= -\frac{2}{38}$

$=-.0526$

Each time you play, you can expect to lose about 5 cents!

# Problem 3
***

***Using the definition*** of expected value and variance,

Discrete: $E[X] = \sum_ia_i\cdot P(X=a_i)$.

Continuous: $E[X] = \int_{-\infty}^{\infty}xf(x)\phantom{x}dx$

$Var(X) = E\left[(X-E[X])^2\right] = E[X^2]-(E[X])^2$

Find the following:

### Part A

***(3 points)*** Suppose $X$~$U[\alpha, \beta]$, find $E[X]$. $X$ takes on all real values between $\alpha$ and $\beta$.

***solution:*** Put your answer to Part A here:

For a uniform distribution, we have the following function:

\begin{equation}
f(x) = 
\left\{
    \begin{array}{lr}
        \frac{1}{b-a}, & \text{if } a \leq x \leq b\\
        0, & \text{everwhere else}
    \end{array}
\right\}
\end{equation}

We also know that:

$E[X] = \int_{-\infty}^{\infty}(x \cdot f(x)) dx$

Plugging in:

$E[X] = \frac{1}{b-a}\int_{a}^{b}(x) dx$

$=\frac{1}{b-a}[\frac{x^2}{2}]_a^b$

$=\frac{1}{b-a} \cdot [\frac{b^2-a^2}{2}]$

$=\frac{1}{b-a} \cdot (\frac{(b+a)(b-a)}{2})$

$=\frac{b+a}{2}$

Therefore, E[X] is $\frac{a+b}{2}$ for uniform distributions

### Part B

***(3 points)*** Suppose $X$~$U[\alpha, \beta]$, find $Var[X]$.

***solution:*** Put your answer to Part B here:

We will use the following formula:

$Var[X] = E[X^2] - (E[X])^2$

Now we have:

$E[X] = \int_{-\infty}^{\infty}(x \cdot f(x)) dx$

Plugging in:

$E[X^2] = \frac{1}{b-a}\int_{a}^{b}(x^2) dx$

$=\frac{1}{b-a}[\frac{x^3}{3}]_a^b$

$=\frac{1}{b-a} \cdot [\frac{b^3-a^3}{3}]$

$=\frac{1}{b-a} \cdot (b-a) \cdot (\frac{b^2+ ba + a^2}{3})$

$E[X^2] = \frac{b^2+ba+a^2}{3}$

Now, plugging in we have:

$Var[X] = \frac{b^2+ba+a^2}{3} - (\frac{b+a}{2})^2$

$\frac{b^2+ba+a^2}{3} - \frac{b^2+2ab+a^2}{4}$

$\frac{4b^2+4ba+4a^2}{12} - \frac{3b^2 + 6ab + 3a^2}{12}$

$\frac{b^2-2ab+a^2}{12}$

$\frac{(b-a)^2}{12}$

Therefore, Var(X) for a uniform distr is = $\frac{(b-a)^2}{12}$

### Part C

***(3 points)*** Suppose $X$~$Ber(p)$, find $E[X]$.

***solution:*** Put your answer to Part C here:

We know that the distr for Bernoulli's is:

\begin{equation}
f(x) = 
\left\{
    \begin{array}{lr}
        1-p, & \text{if } x = 0\\
        p, & \text{if } x = 1\\
        0, & \text{elsewhere}\\
    \end{array}
\right\}
\end{equation}

We also know that:

$E[X] = \sum_ia_ip(a_i)$

Plugging in:

$E[X] = (1-p)\cdot 0 + (p) \cdot 1$

$E[X] = p$

Therefore, $E[X]$ for $X~Ber(p)$ is $p$

### Part D

***(3 points)*** Suppose $X$~$Ber(p)$, find $Var[X]$.

***solution:*** Put your answer to Part D here:

Using the same knowledge as above, knowing that:

$Var[X] = Var[X^2] - (Var[X])^2$

We will have:

$E[X^2] = \sum_ix^2a_ip(a_i)$

$= (0^2)(1-p)(0) + (1^2)(p)(1)$

$E[X^2] = p$

Now plugging in:

$Var[X] = (p) - (p^2)$

$Var[X] = p(1-p)$

Therefore, $Var[X]$ for $X~Ber(p)$ is $=p(1-p)$

# Problem 4
***

Consider a card game played with a standard deck of 52 cards.

The cards are shuffled, a card is chosen, recorded, and returned to the deck.

This is done three times and the record of three choices is observed.

The game costs \$1 to play.

If all three cards have the same number, then you get your dollar back plus \$3.

If you only have two cards with the same number, then you get your dollar back plus \$2.

If all three cards are of the same suit, then you get your dollar back plus \$2.

Examples:

Ace of spades, Ace of clubs, 5 of diamonds: dollar back plus \$2.

Ace of spaces,2 of spades, 7 of spades: dollar back plus \$2.

5 of diamonds, 8 of diamonds, 8 of diamonds: dollar back plus \$2, plus \$2.

4 of hearts, 4 of hearts, 4 hearts: dollar back plus \$3, plus \$2.

6 of diamonds, 6 of hearts, 6 of clubs: dollar back plus \$3.

3 of hearts, 5 of diamonds, Queen of clubs: Lose your dollar.

***(7 points)*** Write a function or functions that will create a random draw of three cards as described above. Simulate this game (at least 1000 times) and determine from the simulation the expected winnings per dollar of this game. 

In [2]:
# Put your code for Problem 4 here:
def PickCards():
    #all possible numbers, and suites for a deck
    card_num = np.array([1,2,3,4,5,6,7,8,9,10,"Jack", "Queen", "King"])
    card_suite = np.array(["Diamond","Club","Heart","Spade"])
    card_pick_num = np.array([])
    card_pick_suite = np.array([])
    #we will loop through and pick 3 cards at random
    for x in range(3):
        rand_num = np.random.choice(card_num)
        rand_suite = np.random.choice(card_suite)
        #only picks unique cards to add to empty array
        if rand_num not in card_pick_num:
            card_pick_num = np.append(card_pick_num, rand_num)
        if rand_suite not in card_pick_suite:
            card_pick_suite = np.append(card_pick_suite, rand_suite)
    #if all three cards have the same number
    winamount = -1
    won = False
    if(len(card_pick_num) == 1):
        winamount += 3
        won = True
    elif(len(card_pick_num) == 2):
        winamount += 2
        won = True
    if(len(card_pick_suite) == 1):
        winamount += 2
        won = True
    if(won):
        winamount +=1
    return winamount

winnings = np.array([PickCards() for k in range(100000)])
print("You can expect to win: $",np.sum(winnings)/100000, " every time you play!")

  if rand_num not in card_pick_num:
  if rand_suite not in card_pick_suite:


You can expect to win: $ -0.1653  every time you play!


In [3]:
####Therefore, you can expect to lose about 16 cents every time you play!

### Rubric Check
***
***(5 points)*** Makesure your answers are thorough but not redundant. Explain your answers, don't just put a number. Make sure you have matched your questions on Gradescope. Make sure your PDF is correct and your LaTeX is correct. etc. etc. BE NEAT.