## Joint probability distribution

The joint probability mass function of two discrete random variables $X,Y$ is:

${\displaystyle p_{X,Y}(x,y)=\mathrm {P} (X=x\ \mathrm {and} \ Y=y)}$


or written in terms of conditional distributions

${\displaystyle p_{X,Y}(x,y)=\mathrm {P} (Y=y\mid X=x)\cdot \mathrm {P} (X=x)=\mathrm {P} (X=x\mid Y=y)\cdot \mathrm {P} (Y=y)}$


where ${\displaystyle \mathrm {P} (Y=y\mid X=x)}\mathrm {P} $ is the probability of ${\displaystyle Y=y}$ given that ${\displaystyle X=x}$.

### Example

Consider the roll of a fair die and let $A = 1$ if the number is even (i.e., 2, 4, or 6) and $A = 0$ otherwise. Furthermore, let $B = 1$ if the number is prime (i.e., 2, 3, or 5) and $B = 0$ otherwise.


|   |1	|2	|3	|4	|5	|6  |
|---|---|---|---|---|---|---|
|A	|0	|1	|0	|1	|0	|1  |
|B	|0	|1	|1	|0	|1	|0  |

Then, the joint distribution of $A$ and $B$, expressed as a probability mass function, is:

$P(A=0,B=0)=P\{1\}=\frac{1}{6}$

$P(A=0,B=1)=P\{3,5\}=\frac{2}{6}$

$P(A=1,B=0)=P\{4,6\}=\frac{2}{6}$

$P(A=1,B=1)=P\{2\}=\frac{1}{6}$


## Joint probability mass function (joint pmf)

If discrete random variables  $X$  and  $Y$  are defined on the same sample space  $S$ , then their joint probability mass function (joint pmf) is given by
$p(x,y) = P(X=x\ \ \text{and}\ \ Y=y),\notag$
 

where  $(x,y)$  is a pair of possible values for the pair of random variables  $(x,y)$ , and  $p(x,y)$  satisfies the following conditions:

- $0 \leq p(x,y) \leq 1$ 
- $\displaystyle{\mathop{\sum\sum}_{(x,y)}p(x,y) = 1}$
- $\displaystyle{P\left((X,Y)\in A\right)) = \mathop{\sum\sum}_{(x,y)\in A} p(x,y)}$


Refs: [1](https://stats.libretexts.org/Courses/Saint_Mary's_College_Notre_Dame/MATH_345__-_Probability_(Kuter)/5%3A_Probability_Distributions_for_Combinations_of_Random_Variables/5.1%3A_Joint_Distributions_of_Discrete_Random_Variables#:~:text=Suppose%20that%20X%20and%20Y,p(x%2Cy).)

## Joint cumulative distribution function (joint cdf)
In the discrete case, we can obtain the joint cumulative distribution function (joint cdf) of  $X$  and  $Y$  by summing the joint pmf:

$F(x,y) = P(X\leq x\ \text{and}\ Y\leq y) = \sum_{x_i \leq x} \sum_{y_j \leq y} p(x_i, y_j),\notag$

## Semicolon notation in joint probability

In $p_{\theta} (x|z, y) = f(x; z, y, \theta)$, 

$f(x; z, y, \theta)$

is a function of $x$ with "parameters" $y,x,\theta$