# Joint Probability Distributions

[Basics of joint probability](https://www.youtube.com/watch?v=CQS4xxz-2s4) - Video with very good visuals

Here‚Äôs a clean, intuitive explanation that builds on the two videos you have open‚Äîbut in a way that stands on its own and gives you the deeper intuition you love.

---

# üìä What Are Joint Probability Distributions?

A **joint probability distribution** describes the probability behavior of **two (or more) random variables at the same time**.

Think of it as a full map of how two variables interact.

---

# üéØ The Core Idea

If you have two random variables:

- $X$ = height  
- $Y$ = weight  

A **joint distribution** tells you:

> What is the probability that *both* $X$ takes a certain value **and** $Y$ takes a certain value?

This is written as:

- **Discrete case:**  
  $$
  P(X = x, Y = y)
  $$

- **Continuous case:**  
  $$
  f_{X,Y}(x,y)
  $$  
  where $f_{X,Y}$ is the **joint density function**.

---

# üß† Why It Matters

Joint distributions let you:

- Understand **relationships** between variables  
- Compute **conditional probabilities**  
- Derive **marginal distributions**  
- Measure **dependence** (correlation, covariance)  
- Build models in machine learning, statistics, and data science

They‚Äôre the foundation for everything from Bayesian inference to multivariate normal distributions.

---

# üß© How Joint Distributions Work

## 1. **Joint PMF (Discrete)**  
If $X$ and $Y$ take discrete values:

| X \ Y | Y=1 | Y=2 |
|-------|-----|-----|
| **X=1**   | 0.1 | 0.2 |
| **X=2**   | 0.3 | 0.4 |

Each cell is $P(X=x, Y=y)$.

All cells must sum to 1.

To find the marginal probability of $X = 1$, you sum over the row:

$P(X = 1) = P(X = 1, Y = 1) + P(X = 1, Y = 2) = 0.1 + 0.2 = 0.3$

---

## 2. **Joint PDF (Continuous)**  
If $X$ and $Y$ are continuous:

- The joint density $f_{X,Y}(x,y)$ must satisfy  
  $$
  f_{X,Y}(x,y) \ge 0
  $$
  $$
  \iint f_{X,Y}(x,y)\,dx\,dy = 1
  $$

- Probabilities come from **areas** under the surface:  
  $$
  P((X,Y) \in A) = \iint_A f_{X,Y}(x,y)\,dx\,dy
  $$

---

# üßÆ Marginals and Conditionals

### **Marginal distributions**  
You ‚Äúsum out‚Äù or ‚Äúintegrate out‚Äù the other variable.

- Discrete:  
  $$
  P(X=x) = \sum_y P(X=x, Y=y)
  $$

- Continuous:  
  $$
  f_X(x) = \int f_{X,Y}(x,y)\,dy
  $$

### **Conditional distributions**  
You ‚Äúslice‚Äù the joint distribution.

- Discrete:  
  $$
  P(Y=y \mid X=x) = \frac{P(X=x, Y=y)}{P(X=x)}
  $$

- Continuous:  
  $$
  f_{Y\mid X}(y\mid x) = \frac{f_{X,Y}(x,y)}{f_X(x)}
  $$

---

# üé® Intuition You‚Äôll Love

Imagine the joint distribution as a **landscape**:

- The **height** of the surface at $(x,y)$ is the density $f_{X,Y}(x,y)$.
- **Marginals** are the shadows cast onto the axes.
- **Conditionals** are vertical slices through the landscape.
- **Independence** means the landscape is a perfect rectangular ridge:  
  $$
  f_{X,Y}(x,y) = f_X(x)\,f_Y(y)
  $$