# Probability Distributions

- Probability isn't just about determining the chance of one specific event; it can also be used to summarize the likelihood of all possible outcomes.

- In probability, we often talk about something called a "random variable," which is simply an object of interest.
  - The connection between each possible outcome for a random variable and their associated probabilities is known as a "probability distribution."

  - These distributions are a fundamental concept in probability, and you've probably encountered some of the common ones.

  - The nature and characteristics of a probability distribution depend on the type of random variable it represents, which could be either continuous or discrete.

  - This distinction influences how we describe the distribution and calculate the most probable outcome and its likelihood.

In this tutorial, we'll provide a gentle introduction to probability distributions. By the end of it, you'll understand:

- Random variables in probability come with a specific range and can be either continuous or discrete.

- Probability distributions help us summarize the connection between possible values and their probabilities for a random variable.

- Probability density or mass functions associate values with probabilities, and cumulative distribution functions associate outcomes less than or equal to a value with a probability.

---


# Random Variables

- A **random variable** is a quantity that results from a random process. In probability, a random variable can assume one of several possible values, which can be events from the state space.
  - Specific values or sets of values for a random variable can be assigned probabilities.

- In probability modeling, we often consider example data or instances as events, observations, or realizations of underlying random variables.

  > "A random variable is often denoted as a capital letter, e.g. X, and values of the random variable are denoted as a lowercase letter and an index, e.g. x₁, x₂, x₃."
  > — Page viii, *Probability: For the Enthusiastic Beginner*, 2016.

- The values that a random variable can take are called its **domain**, and this domain can be either **discrete** or **continuous**.

  > "Variables in probability theory are called random variables, and their names begin with an uppercase letter. Every random variable has a domain — the set of possible values it can take on."
  > — Page 486, *Artificial Intelligence: A Modern Approach*, 3rd edition, 2009.

- **Discrete Random Variable**: Values are drawn from a finite set of states, for example, colors of a car.

- **Boolean Random Variable**: Values are drawn from the set {true, false}, for example, a coin toss.

- **Continuous Random Variable**: Values are drawn from a range of real-valued numerical values, for example, the height of humans.

- A value of a random variable can be specified using an equals operator, for example, `X = True`.
  - The probability of a random variable is denoted as a function using the uppercase 'P' or 'Pr'; for instance, `P(X)` represents the probability of all values for the random variable `X`.
  - The probability of a specific value of a random variable can be denoted as `P(X = True)`, indicating the probability of the random variable `X` having the value `True`.

---

# Probability Distribution

- A **probability distribution** is a way of summarizing the likelihood of different values for a random variable.
  - Imagine arranging all the possible values of a random variable in a line, and then assigning each value a probability.
  - This arrangement forms a distribution, which has specific properties we can measure.
  - Two key properties are the **expected value** and the **variance**, often referred to as the first and second moments of the distribution.

- **Expected Value**: This is the average or most likely value of a random variable.
  - It's denoted as E[X] or E[f(x)] when we use a function to select values from the variable's range.

- **Variance**: This tells us how the values of a random variable spread out from the average.
  - The variance is usually represented as Var(X) or Var[f(x)]. The square root of the variance gives us the **standard deviation**.
  - The relationship between two random variables is described by **covariance**, which summarizes how they change together.

- In essence:

  - **Expected Value**: The average value of a random variable.

  - **Variance**: The average spread of values around the expected value.

- Every random variable has its own probability distribution.
  - Although many different random variables may have the same distribution shape, the specifics can vary.
  - Common probability distributions are defined by a few parameters and come with procedures for calculating the expected value and variance.
  - The structure of the distribution depends on whether the random variable is discrete or continuous.

---


# Discrete Probability Distributions

- **Discrete probability distributions** summarize the likelihood of different outcomes for a discrete random variable. They are characterized by two main functions:

  - **Probability Mass Function (PMF)**: This function assigns probabilities to specific values of a discrete random variable.

  - **Cumulative Distribution Function (CDF)**: The CDF tells us the probability that the random variable will take a value less than or equal to a specific discrete value.

- In some cases, the values of the random variable may not follow a clear order on a number line, like counts can, but car colors cannot.
  - This can make the PMF and CDF structures discontinuous or lack a smooth transition in relative probabilities across values.

- The **expected value** for a discrete random variable can be calculated by using the mode, which means finding the most common value. The sum of probabilities in the PMF always adds up to one.

- Common examples of discrete probability distributions include:

  - **Bernoulli and Binomial Distributions**
  - **Multinoulli and Multinomial Distributions**
  - **Poisson Distribution**

- These distributions are often associated with specific domains:

  - The probabilities of dice rolls form a **discrete uniform distribution**.
  - The probabilities of coin flips form a **Bernoulli distribution**.
  - The probabilities of car colors form a **multinomial distribution**.

For a deeper understanding of discrete probability distributions, you can refer to Chapter 8.

---




# Continuous Probability Distributions

- **Continuous probability distributions** describe the probability for a continuous random variable. They're characterized by two main functions:

    - **Probability Distribution Function (PDF)**: This function defines the probability distribution for a continuous random variable.
      - Unlike discrete random variables, which have a Probability Mass Function (PMF), continuous random variables use the PDF.

- **Cumulative Distribution Function (CDF)**: The CDF tells us the probability that the random variable will take a value less than or equal to a specific numerical value from its domain.

- In contrast to discrete distributions, continuous distributions are typically represented by smooth curves. Common examples of continuous probability distributions include:

  - **Normal (Gaussian) Distribution**
  - **Exponential Distribution**
  - **Pareto Distribution**

- These distributions are often associated with specific domains:

  - The heights of humans follow a **Normal distribution**.
  - The success of movies can follow a **Power-law distribution**.
  - Income levels can follow a **Pareto distribution**.

For a more detailed exploration of continuous probability distributions, you can refer to Chapter 9.


---


# Further Reading

## Books
- [Probability Theory: The Logic of Science, 2003](https://amzn.to/2lnW2pp)
- [Introduction to Probability, 2nd edition, 2019](https://amzn.to/2xPvobK)
- [Probability: For the Enthusiastic Beginner, 2016](https://amzn.to/2jULJsu)

## Articles
- [Random variable, Wikipedia](https://en.wikipedia.org/wiki/Random_variable)
- [Moment (mathematics), Wikipedia](https://en.wikipedia.org/wiki/Moment_(mathematics))
- [Probability distribution, Wikipedia](https://en.wikipedia.org/wiki/Probability_distribution)
- [List of probability distributions, Wikipedia](https://en.wikipedia.org/wiki/List_of_probability_distributions)

---

