### 2.1

Give a real world example of a joint distribution $Pr(x,y)$ where $x$ is discrete and $y$ is continuous.

##### answer

Let $x$ be a discrete random variable that represents a weather forcast, suppose it has two possible values $\{rainy,sunny\}$. Let $y$ be a continuous random variable that represents tempurature. We could then define a joint probability $P(x,y)$ that models the interaction between weather forcasts and tempurature. The definition of $P(x,y)$ will depend on what assumptions we (as the modeler) make. 

Suppose we assume that the weather forcast and the tempurature are independent, then an appropriate course of action would be:

1. define $P(x)$ and $P(y)$ individually
2. define $P(x,y) = P(x)P(y)$

Suppose we assume that the weather forcast and the tempurature depend on each other in the following way - "if the forecast is rainy, then the tempurature is relativly lower". We could then define $P(x,y)$ by:

1. defining a prior probability over forcasts $P(x) \sim Bern_x(\lambda)$ where $P(x=0)$ gives the probability of a rainy forecast and $P(x=1)$ gives a probability of a sunny forecast.
2. defining two gaussian distributions $P(y|x=0)$ and $P(y|x=1)$ where the expected value of the first is less than the expected value of the second. 


### 2.2

What remains if I marginalize a joint distribution $Pr(v,w,x,y,z)$ over five variables with respect to variables $w$ and $y$? What remains if i marginalize the resulting distribution with respect to $v$?

##### answer

note: I'm interpreting 'with respect to' as the indexes of the sum in the marginalization procedure

1. P(v,x,z) - w and y are the indexes of the sum
2. P(x,z) - v is the index of the sum



### 2.3

show that the following relation is true:

$$
Pr(w,x,y,z) = Pr(x,y)Pr(z|w,x,y)Pr(w|x,y)
$$

proof:

$$
Pr(x,y)Pr(z|w,x,y)Pr(w|x,y)
& = Pr(z|w,x,y)Pr(w|x,y)Pr(x,y) &&\text{rearrange} \tag 1\\
& = Pr(z|w,x,y)Pr(w,x,y) &&\text{prince 2.5} \tag 2\\
& = \frac{Pr(w,x,y,z)}{Pr(w,x,y)}Pr(w,x,y) &&\text{bayes rule on left factor of (2)} \tag 3\\
& = Pr(w,x,y,z) &&\text{terms cancel} \tag 4\\
$$



### 2.4

In my pocket there are two coins. Coin 1 is unbiased, so the likelihood $Pr(h = 1 | c = 1)$ of getting heads is 0.5 and the likelihood $Pr(h = 0 | c = 1)$ of getting tails is 0.5. Coin 2 is biased so the likelihood $Pr(h = 1 | c = 2)$ of getting heads is 0.8 and the likelihood $Pr(h = 0 | c = 2)$ of getting tails is 0.2. I reach into my pocket and draw one of the coins at random. there is an equal prior probability I might have picked either coin. I flip the coin and observe a head. Use Bayes' rule to compute the posterior probability that I chose coin 2.

##### answer

for clarity, lets rewrite bayes rule with $h$ and $c$ values substituted in:

$$
P(c|h) = \frac{P(h|c)P(c)}{P(h)}
$$

the task is to compute the following probability:

$$
P(c=2|h=1)
$$

first lets compute the denomenator $P(h=1)$

$$
\begin{align}
P(h=1)
& = \sum_{C} P(h=1,c) &&\text{sum rule} \tag 1\\
& = \sum_{C} P(h=1|c)P(c) &&\text{product rule} \tag 2\\
& = P(h=1|c=1)P(c=1) + P(h=1|c=2)P(c=2) &&\text{expand sum} \tag 3\\
& = (0.5)(0.5) + (0.8)(0.5) &&\text{substitute given likelihood and prior probabilities} \tag 4\\
& = 0.65
\end{align}
$$

even though its not necessary, lets compute P(h=0) as a sanity check (it should equal 0.35)

$$
\begin{align}
P(h=0)
& = \sum_{C} P(h=0,c) &&\text{sum rule} \tag 1\\
& = \sum_{C} P(h=0|c)P(c) &&\text{product rule} \tag 2\\
& = P(h=0|c=1)P(c=1) + P(h=0|c=2)P(c=2) &&\text{expand sum} \tag 3\\
& = (0.5)(0.5) + (0.2)(0.5) &&\text{substitute given likelihood and prior probabilities} \tag 4\\
& = 0.35
\end{align}
$$

next we compute the numerator

$$
\begin{align}
P(h=1|c=2)P(c=2)
& = (0.8)P(c=2) &&\text{substitute given likelihood probability} \tag 1\\
& = (0.8)(0.5) &&\text{substitute given prior probability} \tag 2\\
& = 0.4
\end{align}
$$

putting it all together:

$$
P(c=2|h=1) = \frac{P(h=1|c=2)P(c=2)}{P(h=1)} = \frac{.4}{.65} = 0.62
$$

so there is a 62% chance that the coin drawn from the pocket is coin 2 given that it was flipped and heads was observed.

### 2.5

If variables $x$ and $y$ are independent and variables $x$ and $z$ are independent, does it follow that variables $y$ and $z$ are independent?

here i interpret independent as 'pairwise independent'

proof



In [9]:
import itertools
import pandas as pd

[p for p in itertools.product([0,1],[0,1],[0,1])]


[(0, 0, 0),
 (0, 0, 1),
 (0, 1, 0),
 (0, 1, 1),
 (1, 0, 0),
 (1, 0, 1),
 (1, 1, 0),
 (1, 1, 1)]

In [12]:
df = pd.DataFrame([p for p in itertools.product([0,1],[0,1],[0,1],[0])],
                  columns=['x','y','z','prob'])
df

Unnamed: 0,x,y,z,prob
0,0,0,0,0
1,0,0,1,0
2,0,1,0,0
3,0,1,1,0
4,1,0,0,0
5,1,0,1,0
6,1,1,0,0
7,1,1,1,0
