# Advanced Certification Program in Computational Data Science

##  A program by IISc and TalentSprint

### Case Study (ungraded): Odds and Bayes Factors

## Learning Objectives

At the end of the experiment, you will be able to :

* understand the odds factor
* know the Bayes factor
* compute Bayes factor without computing the odds first

### Information

The odds of an event is a ratio involving the probability that the event occurs and the probability that the event does not occur 

$$odds(A) = \frac{P(A)}{P(A^c)} = \frac{P(A)}{1 - P(A)}$$

 
In many situations (e.g. gambling) odds are reported as odds $A$, that is, the odds of $A^c:\ P(A^c)/P(A).$ 

The probability of an even can be obtained from odds

$$P(A) = \frac{odds(A)}{1 + odds(A)}$$

### Import required packages

In [None]:
from math import factorial

**Example 1:** The ELISA test for HIV was widely used in the mid-1990s for screening blood donations. As with most medical diagnostic tests, the ELISA test is not perfect. If a person actually carries the HIV virus, experts estimate that this test gives a positive result $97.7\%$ of the time. (This number is called the sensitivity of the test.) If a person does not carry the HIV virus, ELISA gives a negative (correct) result $92.6\%$ of the time (the specificity of the test). Estimates at the time were that $0.5\%$ of the American public carried the HIV virus (the base rate).

Suppose that a randomly selected American tests positive; we are interested in the conditional probability that the person actually carries the virus.

Let, $H$ denote the event that the person selected is actually HIV positive,

$E$ denote the event that the person selected is tested HIV positive through ELISA test,

$H^c$ denote the person is actually not HIV positive, and 

$E^c$ denote the person is tested negative in ELISA test.

In [None]:
# According to problem statement
p_H = 0.005
p_Hc = 1 - p_H
p_H_and_E = 0.977
p_H_and_Ec = 1 - p_H_and_E
p_Hc_and_Ec = 0.926
p_Hc_and_E = 1 - p_Hc_and_Ec

In [None]:
# probability of a person tested positive
p_E = (p_H * p_H_and_E) + (p_Hc * p_Hc_and_E)
p_E

In [None]:
# using bayes rule
p_H_given_E = (p_H * p_H_and_E) / p_E
p_H_given_E

**Example 2:** Continuing Example 1

**(i)** Calculate the *prior odds* of a randomly selected American having the HIV virus, before taking an ELISA test.

- If $P(H)$ is the prior probability of $H$, the prior odds (in favor) of $H$ are $P(H)/P(H^c)$.

In [None]:
prior_odds = p_H / (1 - p_H)
prior_odds

**(ii)** Calculate the *posterior odds* of a randomly selected American having the HIV virus, given a positive test result.

- If $P(H|E)$ is the posterior probability of $H$ given $E$, the posterior odds (in favor) of $H$ given $E$ are $P(H|E)/P(H^c|E)$.

In [None]:
# using bayes rule
p_Hc_given_E = (p_Hc * p_Hc_and_E) / p_E
p_Hc_given_E

In [None]:
posterior_odds = p_H_given_E / p_Hc_given_E
posterior_odds

**(iii)** By what factor has the odds of carrying HIV increased, given a positive test result, as compared to before the test? This is called the *Bayes factor*.

- The **Bayes factor (BF)** is defined to be the ratio of the posterior odds to the prior odds 

$$BF= \frac{posterior\ odds}{prior\ odds} = \frac{P(H|E)/P(H^c|E)}{P(H)/P(H^c)}$$


In [None]:
bayes_factor = posterior_odds / prior_odds
bayes_factor

**(iv)** Suppose you were given the prior odds and the Bayes factor. How could you compute the posterior odds?

The odds form of Bayes rule says

$$posterior\ odds = prior\ odds\ ×\ Bayes factor$$

$$\frac{P(H|E)}{P(H^c|E)} = \frac{P(H)}{P(H^c)} × BF$$

**(v)** Compute the ratio of the likelihoods of testing positive, for those who carry HIV and for those who do not carry HIV. What do you notice?

Apply Bayes rule to $P(H|E)$ and $P(H^c|E)$

$$\frac{P(H|E)}{P(H^c|E)} = \frac{P(E|H)P(H)/P(E)}{P(E|H^c)P(H^c)/P(E)}$$

$$\ \ \ \ \ \ \ \ \ \ \ \  = \frac{P(H)}{P(H^c)}\ × \frac{P(E|H)}{P(E|H^c)}$$

$$posterior\ odds = prior\ odds\ × \frac{P(E|H)}{P(E|H^c)}$$

Therefore, the Bayes factor for hypothesis $H$ given evidence $E$ can be calculated as the *ratio of the likelihoods*

$$BF = \frac{P(E|H)}{P(E|H^c)}$$

We noticed that, the Bayes factor can be computed without first computing posterior probabilities or odds.

**Odds form of Bayes rule**

$$\frac{P(H|E)}{P(H^c)|E)} = \frac{P(H)}{P(H^c)} × \frac{P(E|H)}{P(E|H^c)}$$

$$posterior\ odds = prior\ odds × Bayes\ factor$$

**Example 3:** Now suppose that $5\%$ of individuals in a high-risk group carry the HIV virus. Consider a randomly selectd person from this group who takes the test. Suppose the sensitivity and specificity of the test are the same as in Example 1.

**(i)** Compute and interpret the prior odds that a person carries HIV.

In [None]:
p_H_ = 0.05
p_Hc_ = 1 - p_H_
prior_odds_ = p_H_ / p_Hc_
prior_odds_

In [None]:
1 / prior_odds_

Therefore, a person in this group is $19$ times more likely to not carry HIV than to carry HIV.

**(ii)** Use the odds form of Bayes rule to compute the posterior odds that the person carries HIV given a positive test, and interpret the posterior odds.

The *posterior odds* are the product of the *prior odds* and the *Bayes factor*. The Bayes factor is the ratio of the likelihoods. Since the *sensitivity* and *specificity* are the same as in the previous example, the likelihoods are the same, and the Bayes factor is the same.

In [None]:
bayes_factor

In [None]:
posterior_odds_ = prior_odds_ * bayes_factor
posterior_odds_

In [None]:
1 / posterior_odds_

Given a positive test, a person in this group is $1.44$ times more likely to not carry HIV than to carry HIV.

**(iii)** Use the posterior odds to compute the posterior probability that the person carries HIV given a positive test.

The odds is the ratios of the posterior probabilities. The posterior probability is

In [None]:
p_H_given_E_ = posterior_odds_ / (1 + posterior_odds_)
p_H_given_E_

**Example 4:** Most people are right-handed, and even the right eye is dominant for most people. This preference for the right side manifests itself in other ways as well. In particular, a researcher investigated if people have a tendency to slightly lean their heads to the right when talking. The researcher observed talking pairs in public places and recorded whether the pair leaned their heads to the right or left. (We’ll assume this represents a randomly representative selected sample of talking pairs.)

The parameter of interest in this study is the population proportion of talking pairs who lean their heads to the right. Denote this unknown parameter $θ$. For now we’ll only consider two potential values for $θ: 1/2$ or $2/3$. We could write this as a pair of competing hypotheses.

$$H_1 = \{\theta = 1/2\}$$

$$H_2 = \{\theta = 2/3\}$$

**(i)** Let $Y$ be the number of pairs in a random sample of $n$ talking pairs that lean their heads to the right. What is the distribution of $Y$? Identify it by name and its relevant parameters.

$Y$, the number of pairs in a random sample of $n$ talking pairs that lean their heads to the right, has a *Binomial distribution* with parameters $n$ and $θ$. The probability that $y$ pairs in the sample lean right is

$$P(Y = y) = \frac{n!}{y!(n-y)!}\theta^y(1-\theta)^{n-y}, \quad y = 0, 1, 2, \ldots, n$$

**(ii)** Suppose that the researcher observed $12$ talking pairs, $8$ of whom leaned their heads to the right (a proportion of $8/12=0.667$). Compute the relevant likelihoods and the corresponding Bayes factor.

The evidence is the event of observing $8$ pairs leaning to the right in a sample of $12$, that is, $E = \{Y = 8\}$ where $Y$ has a Binomial($12, θ$) distribution. If $H_1$ is true, $Y$ has a Binomial($12, 1/2$), so the likelihood is

$$P(E|H_1) = P(Y = 8|\theta = 1/2) = \frac{12!}{8!(12 - 8)!}(1/2)^8(1-1/2)^{12-8} = 0.121,$$

In [None]:
n = 12
y = 8
theta = 1/2
P_E_given_H1 = (factorial(n)/(factorial(n - y)*factorial(y)))*(theta**y)*((1 - theta)**(n-y))
P_E_given_H1

If $H_2$ is true, $Y$ has a Binomial($12, 2/3$) distribution, so the likelihood is

$$P(E|H_2) = P(Y = 8|\theta = 2/3) = \frac{12!}{8!(12 - 8)!}(2/3)^8(1-2/3)^{12-8} = 0.238,$$

In [None]:
n = 12
y = 8
theta = 2/3
P_E_given_H2 = (factorial(n)/(factorial(n - y)*factorial(y)))*(theta**y)*((1 - theta)**(n-y))
P_E_given_H2

The Bayes factor is

$$BF = \frac{P(E|H_1)}{P(E|H_2)} = \frac{0.121}{0.238} = 0.506$$

In [None]:
bf = P_E_given_H1 / P_E_given_H2
bf

Observing $8$ pairs leaning right in a sample of $12$ talking pairs is about $2$ times more likely if $θ = 2/3\ (H_2)$ than if $θ = 1/2\ (H_1)$.

**(iii)** Suppose that our prior belief is that the two hypotheses are equally likely. Determine the posterior probabilities for the two hypotheses.

If the prior probabilities are equal, then the posterior probabilities will be in proportion to the likelihoods. So the posterior probability of $H_2$ will be about $2$ times greater than the posterior probability of $H_1$. In terms of odds: the prior odds of $H_1$ are $0.5/0.5 = 1$, so the posterior odds of $H_1$ given $E$ are $1×0.506$.

$$posterior\ odds = prior\ odds\ ×\ bayes\ factor$$

$$ = \frac{0.5}{0.5} × 0.506$$

Also, 
$$posterior\ probability = \frac{posterior\ odds}{1 + posterior\ odds}$$

In [None]:
posterior_odds = 1 * bf
posterior_prob_for_H1 = posterior_odds / (1 + posterior_odds)
posterior_prob_for_H1

In [None]:
posterior_prob_for_H2 = 1 - posterior_prob_for_H1
posterior_prob_for_H2

**(iv)** Repeat the previous part but with a prior probability of $0.9$ for $H_1$.

Now the prior odds of $H_1$ are $0.9/0.1 = 9$; the prior probability of $H_1$ is $9$ times greater than the prior probability of $H_2$. The posterior odds given $E$ are $9×0.506 = 4.56$; the posterior probability of $H_1$ is $4.56$ times greater than the posterior probability of $H_2$.

In [None]:
posterior_odds = 0.9/0.1 * bf
posterior_odds

In [None]:
posterior_prob_for_H1 = posterior_odds / (1 + posterior_odds)
posterior_prob_for_H1

In [None]:
posterior_prob_for_H2 = 1 - posterior_prob_for_H1
posterior_prob_for_H2

Even though observing $8$ out of $12$ pairs leaning right is more likely if $θ=2/3\ (H_1)$ than if $θ=1/2\ (H_2)$, the posterior probability of $H_1$ is greater than the posterior probability of $H_2$ because of the large discrepancy in the prior probabilities.

**(v)** The full study actually used a sample of $124$ talking pairs, of which $80$ leaned their heads to the right (a proportion of $80/124 = 0.645$). Compute the relevant likelihoods and the corresponding Bayes factor.

Now the evidence is the event of observing 80 pairs leaning to the right in a sample of 124, that is, E={Y=80} where Y has a Binomial(124, θ) distribution. If H1 is true, Y has a Binomial(124, 1/2) distribution, so the likelihood is

$$P(E|H_1) = P(Y = 80|\theta = 1/2) = \frac{124!}{80!(124 - 80)!}(1/2)^{80}(1-1/2)^{124-80} = 0.00037,$$

In [None]:
n = 124
y = 80
theta = 1/2
P_E_given_H1 = (factorial(n)/(factorial(n - y)*factorial(y)))*(theta**y)*((1 - theta)**(n-y))
P_E_given_H1

If $H_2$ is true, $Y$ has a Binomial($124, 2/3$) distribution, so the likelihood is

$$P(E|H_2) = P(Y = 80|\theta = 2/3) = \frac{124!}{80!(124 - 80)!}(2/3)^{80}(1-2/3)^{124-80} = 0.0658,$$

In [None]:
n = 124
y = 80
theta = 2/3
P_E_given_H2 = (factorial(n)/(factorial(n - y)*factorial(y)))*(theta**y)*((1 - theta)**(n-y))
P_E_given_H2

The Bayes factor is

$$BF = \frac{P(E|H_1)}{P(E|H_2)} = \frac{0.00037}{0.0657} \approx 0.00566 \approx \frac{1}{176.64}$$

In [None]:
bf_ = P_E_given_H1 / P_E_given_H2
bf_

In [None]:
1 / bf_

Observing $80$ pairs leaning right in a sample of $124$ talking pairs is about $177$ times more likely if $θ=2/3\ (H_2)$ than if $θ=1/2\ (H1)$.

**(vi)** Suppose that our prior belief is that the two hypotheses are equally likely. Determine the posterior probabilities for the two hypotheses given the data from the sample of $124$ couples.

If the prior probabilities are equal, then the posterior probabilities will be in proportion to the likelihoods. So the posterior probability of $H_2$ will be about $177$ times greater than the posterior probability of $H_1$. In terms of odds: the prior odds of $H_1$ are $0.5/0.5 = 1$, so the posterior odds of $H_1$ given $E$ are $1×0.0056$.

In [None]:
posterior_odds = 0.5/0.5 * bf_
posterior_prob_for_H1 = posterior_odds / (1 + posterior_odds)
posterior_prob_for_H1

In [None]:
posterior_prob_for_H2 = 1 - posterior_prob_for_H1
posterior_prob_for_H2

**(vii)** Repeat the previous part but with a prior probability of $0.9$ for $H_1$.

Now the prior odds of $H_1$ are $0.9/0.1 = 9$; the prior probability of $H_1$ is $9$ times greater than the prior probability of $H_2$. The posterior odds given $E$ are $9×(1/176.64)=1/19.63$; the posterior probability of $H_2$ is $19.63$ times greater than the posterior probability of $H_1$.

In [None]:
posterior_odds = 0.9/0.1 * bf_
posterior_prob_for_H1 = posterior_odds / (1 + posterior_odds)
posterior_prob_for_H1

In [None]:
posterior_prob_for_H2 = 1 - posterior_prob_for_H1
posterior_prob_for_H2

Even though our prior probability for $H_1$ was very large, the likelihood of the data is so small under $H_1$ compared with $H_2$ that the posterior probability for $H_1$ is small.

**(viii)** Compare the results of the two samples ($n=12$ versus $n=124$). What do you observe about the influence of the prior?

When $n = 12$ and prior probabilities are equal,

| theta | prior | likelihood | product | posterior |
|:--------------|:-----------|:-----------|:-----------|:-----------|
|  $\frac{1}{2}$ |0.5|0.1208|0.0604|0.3364|
|$\frac{2}{3}$|	0.5|0.2384|0.1192|0.6636|
|sum|1.0|NA|0.1796|1.0000|
|ratio|1.0|0.5068|0.5068|0.5068|

When $n = 12$ and prior odds of $H_1$ are $0.9/0.1 = 9$,

| theta | prior | likelihood | product | posterior |
|:--------------|:-----------|:-----------|:-----------|:-----------|
|$\frac{1}{2}$|	0.9|	0.1208|	0.1088|	0.8202|
|$\frac{2}{3}$|	0.1|	0.2384|	0.0238|	0.1798|
|sum|	1.0|	NA|	0.1326|	1.0000|
|ratio|	9.0|	0.5068|	4.5614|	4.5614|

When $n = 124$ and prior probabilities are equal,

| theta | prior | likelihood | product | posterior |
|:--------------|:-----------|:-----------|:-----------|:-----------|
|$\frac{1}{2}$|	0.5|	0.0004|	0.0002|	0.0056|
|$\frac{2}{3}$|	0.5|	0.0658|	0.0329|	0.9944|
|sum|	1.0|	NA|	0.0331|	1.0000|
|ratio|	1.0|	0.0057|	0.0057|	0.0057|

When $n = 124$ and prior odds of $H_1$ are $0.9/0.1 = 9$,

| theta | prior | likelihood | product | posterior |
|:--------------|:-----------|:-----------|:-----------|:-----------|
|$\frac{1}{2}$|	0.9|	0.0004|	0.0003|	0.0485|
|$\frac{2}{3}$|	0.1|	0.0658|	0.0066|	0.9515|
|sum|	1.0|	NA|	0.0069|	1.0000|
|ratio|	9.0|	0.0057|	0.0510|	0.0510|

From the above tables we can see that the prior had much more influence with the smaller sample size. When the sample size was large, the data, represented by the likelihoods, had much more weight in determining the posterior probabilities.