In [1]:
import sympy as sp

def factorial(n):
    
    if n<1:
        return 1
    else:
        return n*factorial(n-1)
    
def binomial(n, m):
    
    if m<0:
        return 1
    elif m>n:
        return 1
    else:  
        return factorial(n)/( factorial(n-m)*factorial(m) )

## Marginal Probability and Density

1. Let $Y_1$ and $Y_2$ be discrete random variables then the *marginal probability functions* of $Y_1$ and $Y_2$ are

$$ p_1(y_1) = \sum_{y_2} p(y_1, y_2) \quad\mbox{and}\quad p_2(y_2) = \sum_{y_1} p(y_1, y_2) $$

2. Let $Y_1$ and $Y_2$ be continuous random variables with joint PDF $f$, then the *marginal density functions* of $Y_1$ and $Y_2$ are

$$ f_1(y_1) = \int_{-\infty}^\infty f(y_1, y_2) dy_2 \quad\mbox{and}\quad f_2(y_2) = \int_{-\infty}^\infty f(y_1, y_2) dy_1 $$

## Example 

A congression committee formed of 4 democrats, 3 republicans, and 1 independent is forming a subcommittee of three people  to work on some legislation. Suppose the subcommittee is selected completely randomly with each member equally likely to be chosen. Let $Y_1$ be the number of democrats on the subcommittee and $Y_2$ the number of republicans. 

Find the marginal probabilities for $Y_1$ and $Y_2$.  


In [2]:
def P(y1, y2):

  if y1+y2 < 2:
    return 0
  elif y1+y2 > 3:
    return 0
  else:
    return binomial(4, y1)*binomial(3, y2)*binomial(1, 3-(y1+y2))/binomial(8, 3)

In [7]:
P(1, 0), P(1, 1), P(1, 2), P(1, 3)

(0, 0.21428571428571427, 0.21428571428571427, 0)

In [9]:
# marginal probability P1(1) 
sum( [ P(2, y) for y in range(4) ] )

0.4285714285714286

In [10]:
p1 = {}

for y1 in range(4):
  p1[y1] = sum( [ P(y1, y2) for y2 in range(4) ] )

p2 = {}
for y2 in range(4):
  p2[y2] = sum( [ P(y1, y2) for y1 in range(4) ] )

In [11]:
p1

{0: 0.07142857142857142,
 1: 0.42857142857142855,
 2: 0.4285714285714286,
 3: 0.07142857142857142}

In [12]:
p2

{0: 0.17857142857142855,
 1: 0.5357142857142857,
 2: 0.26785714285714285,
 3: 0.017857142857142856}

## Conditional Probability

Which then brings us to conditional probability. Suppose $Y_1$ and $Y_2$ are discrete random variables, we define *the probability $Y_1$ is a value given that $Y_2$ is a value*:

$$ p(y_1 | y_2) = P(Y_1 = y_1 | Y_2 = y_2) = \frac{P(Y_1 = y_1, Y_2 = y_2)}{P_2(Y_2 = y_2) }  $$

provided that $P(Y_2 = y_2) > 0 $.  Note that the denominator is the marginal distribution function.


### Example

We roll two dice and let $Z_1$ be the value of the first die, and $Z_2$ the sum of the two dice. 

- What is the probability that the first die is a 1 given that the sum on the two dice is 2?

- What is the probability that the first die is a 1 given that the sum on the two dice is 7?

In [None]:
# P( Z_1 = 1 | Z_2 = 2) = 1

In [13]:
# 6, 1; 1, 6; 5, 2; 2, 5; 4, 3; 3, 4; 

1/6

0.16666666666666666

### Example

Consider the subcommittee being formed above. What is the probability distribution for the number of democrats on the subcommittee given that 1 republican is on the subcommittee?

In [17]:
P(0, 1)/p2[1], P(1, 1)/p2[1], P(2, 1) / p2[1], P(3, 1)/p2[1]

(0.0, 0.39999999999999997, 0.6000000000000001, 0.0)

### Conditional Probability for Continuous Random Variables

Suppose $Y_1$ and $Y_2$ are continuous random varialbes with a joint PDF $f(y_1, y_2)$. Then the *conditional density functions* are given by 

$$ f(y_1 | y_2) = \frac{ f(y_1, y_2) }{f_2(y_2) } \quad\mbox{and}\quad f(y_2|y_1) = \frac{f(y_1, y_2)}{f_1(y_1)} $$

### Example 

Consider the random variables $Y_1$ and $Y_2$ with joint density given by:

$$ f(y_1, y_2) = \left\{ \begin{matrix} 2 & 0 \leq y_2 \leq y_1 \leq 1 \\ 0 & \mbox{otherwise} \end{matrix}\right. $$

Find the probability that $y_1 \leq \frac12$ given that $y_2 \leq \frac34 $

In [18]:
# first find an expression for f2(y2) = int_0^1 dy1 f(y1, y2)
y1 = sp.Symbol('y1')
y2 = sp.Symbol('y2')

sp.integrate(2, (y1, y2, 1))

# 0 <= y2 <= 1

-2*y2 + 2

In [19]:
# second find an expression for f(y1 | y2) = f(y1, y2) / f2(y2) 

sp.integrate( 2/(2-2*y2), (y2, 0, y1), (y1, 0, 1/2) ) 

0.153426409720027

In [20]:
sp.integrate( 2/(2-2*y2), (y1, y2, 1/2), (y2, 0, 1/2) ) 

0.153426409720027

### Example

Back to our example from the beginning of class. Suppose we are testing our virus test on 1000 people and we find the following:

|     | Infected | Not Infected | 
| --- | --- | --- |
| Tested Positive | 120 | 10 | 
| Tested Negative | 30  | 840 | 

Note that in this example what we really would mean by *infected* is *showing symptoms*, *hospitalized with symptoms*, or *tested positive with some other test*.

How likely is it that someone who has tested negative, in fact is Infected? (noting that what we really mean is *how likely is it that someone from our study*; we will come back to how we can use this data to extrapolate to the broader application of this test).

In [21]:
# Marginal Probability they tested negative
870 /1000 

0.87

In [22]:
# Probability they tested negative and are infected
30 / 1000

0.03

In [24]:
# Probability they are infected given that they tested negative

0.03 / 0.87

0.034482758620689655

# Independent Random Variables

The question of whether two random variables are independent has come up a few times already in class. Let's make it formal now.

## Definition

Let $Y_1$ and $Y_2$ be two jointly distributed random variables with marginal cummultative distribution functions $F_1(y_1)$ and $F_2(y_2)$.  Then the two variables are *independent* if the JDF is given by

$$ F(y_1, y_2) = F_1(y_1) F_2(y_2) $$

If $Y_1$ and $Y_2$ are not independent they are called *independent*

This definition could be phrased in terms of the probability distribution of discrete random variables or the probability density functions. 

## What does this mean

Two random variables are independent if we can compute the probability of them both happening by computing the likliehood of each one separately and then taking the product. *Think dice*

### Example

A class at the university has 15 mathematics majors, 8 software engineering majors, and 3 students from other majors. The instructor is going to choose a team of 5 students for a project. If the students are choosen at random, we let $Y_1$ by the number of mathematics majors and $Y_2$ the number of software engineering majors on the team. 

Is $Y_1$ independent of $Y_2$?

In [33]:
def P(y1, y2):

  if y1+y2 < 2:
    return 0
  elif y1+y2 > 5:
    return 0 
  else:
    return binomial(15, y1)*binomial(8, y2) * binomial(3, 5- y1-y2)/ binomial(26, 5)

In [34]:
p1 = {}

for y1 in range(6):
  p1[y1] = sum( [ P(y1, y2) for y2 in range(6)] )

p2 = {}
for y2 in range(6):
  p2[y2] = sum( [ P(y1, y2) for y1 in range(6) ] )

In [35]:
p1

{0: 0.007023411371237458,
 1: 0.07525083612040134,
 2: 0.2633779264214047,
 3: 0.3804347826086957,
 4: 0.2282608695652174,
 5: 0.04565217391304348}

In [36]:
p2

{0: 0.13025235633931287,
 1: 0.37214958954089394,
 2: 0.3473396169048343,
 3: 0.13025235633931287,
 4: 0.019154758285193068,
 5: 0.0008513225904530252}

In [38]:
P(0, 3), p1[0]*p2[3]

(0.002553967771359076, 0.0009148158806440034)

### Example

Let two continuous random variables have a joint density function given by:

$$ f(y_1, y_2) = \left\{ \begin{matrix} 2 y_1 & 0 \leq y_1 \leq 1 \quad\mbox{and}\quad 0 \leq y_2 \leq 1 \\ 0 & \mbox{otherwise} \end{matrix} \right. $$

are $Y_1$ and $Y_2$ indepdent?

Show independence by showing that the density factors:  $$ f(y_1, y_2) = g(y_1) h(y_2) $$

### Example 

Let two continuous random variables have a joint density function given by:

$$ f(y_1, y_2) = \left\{ \begin{matrix} 2 & 0 \leq y_2 \leq y_1 \leq 1 \\ 0 & \mbox{otherwise} \end{matrix} \right. $$

are $Y_1$ and $Y_2$ independent?

## Independence and Conditional Probability

Note that independence has a consequence for conditional probability:

$$ p(y_1 | y_2 ) = \frac{ p(y_1, y_2) }{ p_2(y_2) } $$

However if $Y_1$ and $Y_2$ are independent then $p(y_1, y_2) = p_1(y_1) p_2(y_2) $ then:

$$ p(y_1 | y_2) = p_1(y_1) $$ 

I.e. the probability of $Y_1$ conditioned by $Y_2$ is just the same as the marginal probability of $Y_1$. In other words $Y_2$ is not contributing information to the probablity we assign to $Y_1$.

### Example 

Back to our example from the beginning of class. Suppose we are testing our virus test on 1000 people and we find the following:

|     | Infected | Not Infected | 
| --- | --- | --- |
| Tested Positive | 5 | 120 | 
| Tested Negative | 35  | 840 | 

Show that the test and the infection are independent (and conclude that this is not a very useful test).

In [40]:
# Not infected, given that they test positive

(120 / 1000) / ( 125 / 1000)

0.96

In [41]:
960 / 1000

0.96