# Statistical Power & Bayes

Chris Overton  
2016-09-22  

Adapted from versions by several other lecturers

Morning: winding up our frequentist statistics
* Recap: significance vs. causality
* Statistical Power

Afternoon:
* Bayesian Inference

## Afternoon: Bayesian Inference
* Frequentist vs. Bayesian approaches
* Work with Bayes rule to compute posterior probability
* Prior, likelihood and posterior distributions

# Three coins problem - a second look

Three coins are in an Urn.  

Coin $B_1$ has sides HH (i.e. heads on each side)  
Coin $B_2$ has sides HT (like a normal fair coin)  
Coin $B_3$ has sides TT (tails on each side.)

Pull out a coin and flip it. It comes up H.

What is the probability the same coin comes out H if you flip it a second time?

In [14]:
# As tested by simulation:
import random
import pandas as pd
coins = ['HH', 'HT', 'TT']
results = []
for i in range(100):
    coin = random.choice(coins)
    results.append([random.choice(coin) for i in [1,2]])
df = pd.DataFrame(results, columns=['first', 'second']) == 'H'
df.groupby('first').mean()

Unnamed: 0_level_0,second
first,Unnamed: 1_level_1
False,0.173913
True,0.851852


# Our old solution

$ P(X_2 = H) = 1/2 $

$ P(X_2 = H | X_1 = H) = \frac{5}{6} \ne \frac{1}{2} = P(X_2=H) $

Originally, each coin had a probability 1/3 of being picked. Now it is impossible for the coin picked to have been the third, and it is now twice as likely that the coin picked is the second.

# Old solution uses conditional probability

$ P(B|A) = P(A \cap B) / P(A) $

in other words

$ P(X_2=H | X_1=H) = P(X_2 = H \cap X_1 = H) / P(X_1=H) = \frac{\frac{1}{3} + \frac{1}{3}\frac{1}{4}}{\frac{1}{2}}$

Now let's look at this using Bayes formula.

## Bayes Rule
Allows us to compute $P(B|A)$ using information about $P(A|B)$

$$P(B|A) = \frac{P(A|B)P(B)}{P(A)}$$

### Proof (remember this if nothing else):  

The probability for the intersection can be obtained from either end of the equation below:

$$P(B|A) * P(A) = P(A \cap B) = P(A|B)* P(B)$$

### The reson this is helpful: often, it is easier to compute conditional probabilities going in one direction, but you really want conditional probabilities going in the other "hard" direction

## Bayes Rule
Bayes' Rule helps when all you know about $P(B)$ is an initial guess - the **prior**, and you are trying to figure out how additional evidence (A) alters this guess. 

However, it is easier to make conclusions $P(A|B)$ about A from B (the **likelihood**) than what you want - the **posterior probability** $P(B|A)$:

$$P(B|A) = \frac{P(A|B)P(B)}{P(A)}$$

Here, the denominator $P(A)$ might seem hard to compute, but can be obtained using the **Law of Total Probability**


# Law of Total Probability (LOTP)

If $\{B_n\}$ is a partition of a sample space $ X $, meaning $ \cup_i B_i = X$ and $B_i \cap B_j=\emptyset$  $ \forall i, j$

Then for any event $A \subset X$  

$ P(A) = \sum P(A\cap B_i) $

or

$ P(A) = \sum P(A|B_i) P(B_i)$  


## Back to Bayes Rule
Assuming a partition $\{B_n\}$, you can thus re-write the denominator as follows (for any i):

$$P(B_i|A) = \frac{P(A|B_i)P(B_i)}{P(A)}$$

$$ = \frac{P(A|B_i)P(B_i)}{\sum P(A|B_i) P(B_i)}$$

Now, all of the conditional probabilities go in the 'easy' direction from $B_i$'s to $A$.


## Back to the coin problem

Our question asks which of three disjoint events has occurred: whether the coin chosen is $B_1$ (HH), $B_2$ (HT) or $B_3$ (TT).

We want to know about the outcome A = 'the coin comes up heads'

This might seem tricky, but it is easier to reason from $B_i$ to $A$:  
$P(A|B_1) = 1$, $P(A|B_2) = 1/2$, and $P(A|B_3) = 0$

Our prior probability for each $B_i = \frac{1}{3}$

Plugging this into Bayes formula gives:

$$P(B_i|A) = \frac{P(A|B_i)P(B_i)}{\sum P(A|B_i) P(B_i)} = \frac{P(A|B_i)P(B_i)}{1 * 1/3 + 1/2 * 1/3 + 0 * 1/3} = P(A|B_i)P(B_i) * 2$$


## Back to the coin problem 
Note that we had already evaluated each possibility for the numerator when computing the denominator.  

It follows that $P(B_1|A) = 2/3$, $P(B_2|A) = 1/3$, and $P(B_3|0) = 0$

We can now use our *posterior* probabilities of the $B_i$ to calculate the probability of a second H coin flip:

$$P(A|{posterior P(B_i)}) = 2/3 * 1 + 1/3 * 1/2 = 5/6$$

This might seem like a long path to a result that took us fewer lines earlier, but now we have the additional estimates of probabilities for each the coins.

Further coin flip results would continue to alter these!

## 'Reliable' test for rare disease - a famously counterintuitive example

A fairly reliable diagnostic test T exists for a rare disease D. The result of the test is either positive ($T_+$) or negative ($T_-$)

|Conditional Events | Probability |
| --------- | ----------- |
| $ P(T_+|D)$ | .99 |
| $ P(T_+|\neg D)$ | .05 |
| $P(D)$ | .005 |

So for someone who tests positive, what is their probability of having the disease ($ P(D | T_+) $)?

First, give a quick rough answer!  
In particular, are they more likely to have the disease or not?

## ## 'Reliable' test for rare disease: Rough answer $P(D|T_+) \approx 1/11$
There are two ways to test positive: $P(T_+D)$ and $P(T_+ \neg D)$.  

The rare events gating these are respectively $P(D) = .005$ and $ P(T_+ \neg D) = .05$

Because $D$ and $\neg D$ partition the space, Bayes theorem says:

$$P(D|T_+) = \frac{P(T_+|D)P(D)}{(T_+|D)P(D) + (T_+| \neg D)P( \neg D)} \approx \frac{.005}{.005 + .05} = \frac{1}{11}$$

From this, we obtain a quick estimate by ignoring terms close to 1.

If the test were less reliable ($P(T_+|D) << 1$), we would need that in an estimate as well.

## 'Reliable' test for rare disease
This probability update of D goes in the right direction, from $0.005$ to $.091$.

Even so, it may seem surprisingly slow to update as we'd wish!

## Bayesian Updating: Accumulation of evidence

In today's pairs sprint, you'll implement a discrete approximation to the following:

![Bayesian updating](images/bayesianUpdate.png)

## Bayesian Updating: Accumulation of evidence (II)

Observe how situations impossible from the data are updated to 0 (e.g. p=1 when any tails have been seen.)

After many updates, the posterior distribution starts resembling a normal distribution calculated via MOM or MLE.

## Frequentist vs. Bayesian

Frequentist probability: **long-run** probability of an outcome

Subjective probability: a degree of measure of **belief**

Bayesians consider both types

## Frequentist vs. Bayesian

** Experiment 1: **  ![Musician](images/musicMan.png) 

A fine classical musician says he’s able to distinguish Haydn from Mozart.  
Small excerpts are selected at random and played for the musician.
Musician makes 10 correct guesses in exactly 10 trials.

** Experiment 2: **  ![Drunk](images/drunk.png)

Drunken man says he can correctly guess what face of the coin will fall down, mid air.
Coins are tossed and the drunken man shouts out guesses while the coins are mid air.
Drunken man correctly guesses the outcomes of the 10 throws.


## Frequentist vs. Bayesian

Frequentist:  “They’re both so skilled!  I have as much confidence in musician’s ability to distinguish Haydn and Mozart as I do the drunk’s to predict coin tosses”

Bayesian:  “I’m not convinced by the drunken man…”

The Bayesian approach is to incorporate prior knowledge into the experimental results.




## Frequentist vs. Bayesian

Frequentist:  “They’re both so skilled!  I have as much confidence in musician’s ability to distinguish Haydn and Mozart as I do the drunk’s to predict coin tosses”

Bayesian:  “I’m not convinced by the drunken man…”

The Bayesian approach is to incorporate prior knowledge (perhaps very subjective) into the experimental results:

$$P(psychic | correct) = \frac{P(correct | psychic) P(psychic)}{P(correct)} = \frac{1 * 10^{-5}}{.5^{10}} \approx 10^{-2}$$




## The fierce ideological war between Bayesians and 'Frequentists': XKCD#1132
    
![xkcd Bayes vc Freq](images/xkcd1132.png)    

## The fierce ideological war between Bayesians and 'Frequentists': XKCD#1132
    
![xkcd Bayes vc Freq](images/xkcd1132b.png)    

## The Monty Hall problem: an interesting use for Bayesian logic

Setup: three doors. Behind two, there's a goat. Behind one, there's a car.  

After you pick one, the game show host will open another door with a goat.  

You are then allowed to change your choice of door.  

** Question **: should you?

![Monty Hall game](images/montyHall.png)

## Frequentist vs. Bayesian: Closing thoughts  

* Frequentist tools certainly can be and are misused
* Probabilities may change faster than long-run models can be assembled

*** However ***

* Bayesians use frequentist tools too.
* Frequentist reasoning is nearly as entrenched as phone lines, so it's useful to master
* The supposed 'war' is more sort of a tounge-in-cheek means to promoting discussion

#  Summary


##  Summary

* Frequentist vs. Bayesian approaches
* Work with Bayes rule to compute posterior probability
* Prior, likelihood and posterior distributions

![Losing Monty Hall game](images/wonAGoat.png)