# Intuition for Joint, Marginal, and Conditional Probability

Understanding probability for a single random variable is relatively straightforward, but things can get a bit complex when you involve two or more variables. In the context of just two variables, you might want to explore:

- **Joint Probability**: This is about the probability of two events happening together.

- **Conditional Probability**: It's about the probability of one event given that another event has already occurred.

- **Marginal Probability**: This represents the probability of an event independently of other variables.

While these types of probability can be formally defined, grasping their real-world meaning might take some time. To truly understand them, you'll benefit from practical examples that you can experiment with.

In this tutorial, we'll walk you through the intuitions behind calculating joint, marginal, and conditional probability. By the end of this tutorial, you'll be equipped with the knowledge of:

- How to compute joint, marginal, and conditional probability for independent random variables.

- How to gather observations from joint random variables and construct a joint probability table.

- How to calculate joint, marginal, and conditional probability from a joint probability table.

This tutorial will help you build a solid understanding of these concepts, making them more intuitive and accessible.


---


## Understanding Joint, Marginal, and Conditional Probabilities

When we're dealing with a single random variable, calculating probability is a relatively straightforward task. However, things get more interesting when we study two or more random variables, which is quite common in real-world scenarios.
- There are three key types of probabilities that we may want to explore when working with two or more random variables. They are:

  - **Joint Probability**: This is the probability of events happening simultaneously.

  - **Marginal Probability**: It's the probability of an event without considering the other variables.

  - **Conditional Probability**: This is about the probability of events when other events are present.

The meaning and calculation of these different types of probabilities can vary based on whether the two random variables are independent (simpler) or dependent (more complex). In this exploration, we will learn how to calculate and interpret these three types of probability using practical examples.

- In the next section, we'll examine the independent rolls of two dice, and in the following section, we'll delve into the occurrence of weather events in two geographically close cities. This journey will provide you with a solid understanding of how these probabilities work in different scenarios.


---


# Probabilities of Rolling Two Dice

To understand probabilities when rolling two dice, let's start with the basics.
- When you roll a fair six-sided die, each number from 1 to 6 has an equal chance of landing, making it a 1 in 6 or 16.666% probability.

  - Probability of rolling a 1: 16.666%
  - Probability of rolling a 2: 16.666%
  - Probability of rolling a 3: 16.666%
  - Probability of rolling a 4: 16.666%
  - Probability of rolling a 5: 16.666%
  - Probability of rolling a 6: 16.666%

  ---

## Independent Events

When we roll a second die, the probabilities remain the same for each value on that die. The rolls of the two dice are independent and do not affect each other.

- Probability of rolling any number on the first die: 100%
- Probability of rolling any number on the second die: 100%

  ---

## Even Numbers

Now, let's find the probability of rolling an even number on the first die. We can do this by adding the probabilities of rolling a 2, 4, or 6.

- Probability of rolling an even number on the first die: 50%

  ---

## Joint Probability

To find the joint probability of rolling an even number on both dice, we can use the formula for independent events:

  ```
  P(A ∩ B) = P(A) × P(B)
  ```

- Here, it's the probability of rolling an even number on the first die multiplied by the probability of rolling an even number on the second die. This is because the probability of the first event affects the probability of the second event.

  - Probability of both dice showing even numbers: 25%

  ---

## Joint Probability Table

We can create a table to show the joint probabilities for all possible combinations of dice rolls.
  - For example, the probability of both dice showing a 2 is 2.777%.

    ```
    |       | 1      | 2      | 3      | 4      | 5      | 6      |
    |-------|--------|--------|--------|--------|--------|--------|
    | 1     | 2.777% | 2.777% | 2.777% | 2.777% | 2.777% | 2.777% |
    | 2     | 2.777% | 2.777% | 2.777% | 2.777% | 2.777% | 2.777% |
    | 3     | 2.777% | 2.777% | 2.777% | 2.777% | 2.777% | 2.777% |
    | 4     | 2.777% | 2.777% | 2.777% | 2.777% | 2.777% | 2.777% |
    | 5     | 2.777% | 2.777% | 2.777% | 2.777% | 2.777% | 2.777% |
    | 6     | 2.777% | 2.777% | 2.777% | 2.777% | 2.777% | 2.777% |

    ```

- This table helps us understand the joint and marginal probabilities of independent variables. We can even calculate more complex scenarios, like the probability of one die showing a 2 and the other showing an odd number.

  - Probability of one die showing 2 and the other showing an odd number: 8.333%

  ---

## Marginal Probability

Marginal probabilities are the probabilities of one variable regardless of the other.
- For example, the probability of rolling a 6 on the second die is 16.666%.

- Importantly, when we sum all the probabilities in the table, it adds up to 100%. This is a requirement for a table of joint probabilities because the events are independent.

- Finally, in the case of independent events, conditional probability doesn't offer any new insights, as the probability of rolling a 2 on the first die remains the same no matter what the second die shows.


---


# Probabilities of Weather in Two Cities

To understand probabilities for two dependent random variables, let's consider two cities, City 1 and City 2.
- These cities are close enough that they are generally affected by similar weather patterns, but not identical weather.
- We can categorize the weather in both cities as sunny, cloudy, or rainy. There's a dependency between the weather in the two cities, and we'll explore different types of probability.

  ---

## Data Collection

To start, we collected data for the weather in both cities over twenty days. For instance:

  ```
  | Day | City 1 | City 2   |
  | --- | ------ | ------- |
  | 1   | Sunny  | Sunny   |
  | 2   | Sunny  | Cloudy  |
  | 3   | ...    | ...     |
  ```

- We have omitted the complete table for brevity, but it helps us explore the probability of weather events in the two cities.

  ---

## Frequency of Weather

Next, we can calculate the frequency of paired events we observed, such as the number of times it was sunny in both cities, sunny in one and cloudy in the other, and so on.
  ```
  | City 1 | City 2 | Total   |
  | ------ | ------ | ------- |
  | Sunny  | Sunny  | 6/20    |
  | Sunny  | Cloudy | 1/20    |
  | Sunny  | Rainy  | 0/20    |
  | ...    | ...    | ...     |
  ```

This data serves as the foundation for exploring the probabilities of weather events in both cities.


---


# Joint Probabilities

Let's begin by exploring the probabilities of weather events in two cities. We can create a table to show the probabilities of combined weather events in both cities.
- The table summarizes the likelihood of different weather conditions for both cities, with City 1 represented along the columns and City 2 along the rows.
  ```
  |         | Sunny | Cloudy | Rainy |
  | ------- | ----- | ------ | ----- |
  | Sunny   | 0.3   | 0.1    | 0.0   |
  | Cloudy  | 0.1   | 0.25   | 0.1   |
  | Rainy   | 0.0   | 0.1    | 0.15  |
  ```

- A cell in the table describes the joint probability of a weather event in both cities.
- Together, these probabilities provide a comprehensive view of the joint probability distribution for weather events in the two cities. The sum of the joint probabilities in all cells adds up to 1.0.

  ---

## Calculating Joint Probabilities

- For example, we can determine the joint probability of both cities experiencing a sunny day at the same time. This can be expressed as:

  - Probability of both cities being sunny: 30%

- We can also consider more specific scenarios, such as the probability of City 1 being sunny while City 2 is rainy:

  - Probability of City 1 being sunny and City 2 being rainy: 5%

- The table also allows us to calculate the marginal distribution of events.
  - For instance, we can find the probability of a sunny day in City 1, regardless of what's happening in City 2. This is calculated by summing the probabilities in the first column:

    - Probability of a sunny day in City 1: 35%

- Similarly, we can calculate the probability of a rainy day in City 2 by summing the probabilities in the last row:

  - Probability of a rainy day in City 2: 20%

- Marginal probabilities are useful and often interesting. It's a good practice to include them in the table of joint probabilities for a comprehensive view.

  - Here's an updated table that includes both joint and marginal probabilities:

  ```
  |         | Sunny | Cloudy | Rainy | Marginal |
  | ------- | ----- | ------ | ----- | -------- |
  | Sunny   | 0.3   | 0.1    | 0.0   | 0.4      |
  | Cloudy  | 0.1   | 0.25   | 0.1   | 0.45     |
  | Rainy   | 0.0   | 0.1    | 0.15  | 0.25     |
  ```

---


# Conditional Probabilities

Conditional probability helps us understand the likelihood of a weather event occurring in one city given the occurrence of a weather event in another city. This can be calculated using joint and marginal probabilities.

- We can express conditional probability using the formula:

    ```
    P(A|B) = P(A ∩ B) / P(B)
    ```

- For example, we might be interested in the probability of it being sunny in City 1, given that it is sunny in City 2:

    ```
    P(city1=sunny|city2=sunny) = P(city1=sunny ∩ city2=sunny) / P(city2=sunny)
    ```

- We can use the probabilities we've gathered from the previous sections, for instance:

  - Probability of City 1 being sunny when City 2 is sunny: 75%

- This percentage is intuitive as we would expect sunny weather in one city to often correspond with sunny weather in the other. It's essential to note that conditional probability is not reversible:

  ```
  P(A|B) ≠ P(B|A)
  ```

- The probability of it being sunny in City 1 when City 2 is sunny is different from the probability of it being sunny in City 2 when City 1 is sunny:

    ```
    P(city1=sunny|city2=sunny) ≠ P(city2=sunny|city1=sunny)
    ```

- To calculate the latter, the probability of City 2 being sunny given that City 1 is sunny:

    ```
    P(city2=sunny|city1=sunny) = P(city2=sunny ∩ city1=sunny) / P(city1=sunny)
    ```

- This comes out to be higher, at about 85.714%. We can also use conditional probability to calculate the joint probability:

    ```
    P(A ∩ B) = P(A|B) * P(B)
    ```

- For example, if we only know the conditional probability of City 2 being sunny given that City 1 is sunny and the marginal probability of City 2 being sunny, we can calculate the joint probability as:

    ```
    P(city1=sunny ∩ city2=sunny) = P(city2=sunny|city1=sunny) * P(city1=sunny) = 0.857 * 0.35 = 0.3
    ```

This result aligns with our expectations.


---


## Further Reading


## Books
- [Probability: For the Enthusiastic Beginner, 2016](https://amzn.to/2jULJsu)
- [Pattern Recognition and Machine Learning, 2006](https://amzn.to/2JwHE7I)
- [Machine Learning: A Probabilistic Perspective, 2012](https://amzn.to/2xKSTCP)

## Articles
- [Probability, Wikipedia](https://en.wikipedia.org/wiki/Probability)
- [Notation in probability and statistics, Wikipedia](https://en.wikipedia.org/wiki/Notation_in_probability_and_statistics)
- [Independence (probability theory), Wikipedia](https://en.wikipedia.org/wiki/Independence_(probability_theory))
- [Independent and identically distributed random variables, Wikipedia](https://en.wikipedia.org/wiki/Independent_and_identically_distributed_random_variables)
- [Mutual exclusivity, Wikipedia](https://en.wikipedia.org/wiki/Mutual_exclusivity)
- [Marginal distribution, Wikipedia](https://en.wikipedia.org/wiki/Marginal_distribution)
- [Joint probability distribution, Wikipedia](https://en.wikipedia.org/wiki/Joint_probability_distribution)
- [Conditional probability, Wikipedia](https://en.wikipedia.org/wiki/Conditional_probability)

---
