# 4. Probability Models for a Discrete Random Variable
<hr>

1. Bernoulli random variable
2. Binomial random variable
3. Poisson random variable
4. Geometric random variable
5. Hyper geometric random variable
6. The negative binomial random variable

## 4.1 Bernoulli Random Variable
<hr>

A Bernoulli random variable represents an experiment that results in two outcomes, which can be classified as either a success or a failure.

**Example:** Consider an experiment of tossing a fair coin. We can define a Bernoulli random variable $X$ to represent the outcome of this experiment, where $X$ takes the value of either 0 or 1.
- Let $X=0$ represent the outcome being a failure (e.g., tails).
- Let $X=1$ represent the outcome being a success (e.g., heads).

A Bernoulli distribution is characterized by a single parameter $p$, which is the probability of success.

$$X \sim \text{Bernoulli}(p)$$

### 4.1.1 PMF of Bernoulli
The PMF of a Bernoulli random variable defines the probability of each of its possible outcomes. It is given by:

\begin{align}
P(X=1) &= p && \text{(probability of success)} \\
P(X=0) &= 1 - p && \text{(probability of failure)}
\end{align}

Here, $p$ is the probability of the event being a success (e.g., getting heads in a coin toss), and $1−p$ is the probability of it being a failure.

### 4.1.2 Expected Value and Variance of Bernoulli

$$E(X)=1(p)+0(1-p)=p \quad \quad \text{Var}(X)=E(X^2 )-\mu^2=p-p^2=p(1-p)$$

## 4.2 Binomial Random Variable
<hr>

A Binomial Random Variable can be thought of as a generalization of the Bernoulli Random Variable. It describes the number of successes in a fixed number of independent Bernoulli trials.

- **Bernoulli Trial:** A single experiment with two outcomes: success (probability $p$) and failure (probability $1−p$).
- **Binomial Setting:** Consists of $n$ independent Bernoulli trials, each with the same probability of success $p$.

If $X$ is defined as the number of successes in $n$ trials, and each trial has a success probability $p$, then $X$ is a binomial random variable, denoted as:

$$X \sim \text{Binomial}(n, p)$$

In the binomial setting, when $n=1$, it reduces to a Bernoulli distribution.

**Example:** Consider an experiment of tossing a coin 3 times. Assume $P(H)=1/3$ and we are interested in the total number of heads. Let $X$ be the total number of heads in 3 tosses. Then $X$ follows a binomial distribution:

$$X \sim \text{Binomial} \left(3, \frac{1}{3} \right)$$

### 4.2.1 PMF of Binomial
The PMF of a binomial random variable $X$ is given by:

$$P(X=x) = \binom{n}{x} p^x (1-p)^{n-x}, \quad x=0,1,\cdots,n$$

where $\binom{n}{x}$ is the binomial coefficient representing the number of ways to choose $x$ successes out of $n$ trials.

For our coin toss example, the sample space $S$ and the corresponding probabilities $P(X=x)$ are:

\begin{align}
S &= \{\text{TTT, HHH, HHT, HTH, THH, HTT, THT, TTH}\} \\
X &= 0 : P(X=0)=\binom{3}{0} \left(\frac{2}{3} \right)^3 \\
X &= 1 : P(X=1)=\binom{3}{1} \left(\frac{1}{3}\right) \left(\frac{2}{3}\right)^2 \\
X &= 2 : P(X=2)=\binom{3}{2} \left(\frac{1}{3}\right)^2 \left(\frac{2}{3}\right) \\
X &= 3 : P(X=3)=\binom{3}{3} \left(\frac{1}{3}\right)^3 \\
\end{align}

### 4.2.2 Expected Value and Variance of Binomial
The expected value (mean) of a binomial random variable is:

$$E(X) = np \quad \quad \text{Var}(X) = np(1-p)$$

## 4.3 Poisson Random Variable
<hr>

The Poisson Random Variable is used to model the number of events occurring within a fixed interval of time or space, under the assumption that these events happen with a known constant rate and independently of the time since the last event.

- $X$ represents the number of occurrences of events in a given time interval
- $\lambda$ is the average number of events in that interval

$$X \sim \text{Poisson}(\lambda)$$

### 4.3.1 PMF of Poisson
The PMF of a Poisson random variable $X$ is given by:

$$P(X=x)= e^{-\lambda} \frac{\lambda^x}{x!}$$

### 4.3.2 Expected Value and Variance of Poisson

$$E(X)=\lambda \quad \text{Var}(X)=\lambda$$

**Example:** Consider a phone operator who on average handles 5 calls every 3 minutes. What is the probability that there will be no calls at the next minute? What is the probability that there’ll be at least 3 calls within the next 2 minutes?

The rate $\lambda$ for one minute is $\frac{5}{3}$ calls. Let $X$ be the number of calls in one minute. Then,

$$X \sim \text{Poisson} \left(\lambda=\frac{5}{3}\right)$$

The probability of no calls is given as:

$$P(X=0)=e^{\frac{-5}{3}} \frac{\left(\frac{5}{3}\right)^0}{0!} = e^{\left(\frac{-5}{3}\right)}$$

The rate $\lambda$ for two minutes is $2 \times \frac{5}{3} = \frac{10}{3}$ calls. Let $Y$ be the number of calls in two minutes. Then, 

$$Y \sim \text{Poisson}\left(\frac{10}{3}\right)$$

The probability of at least 3 calls is given as:

$$P(Y \geq 3) = \sum_{i=3}^\infty e^{\frac{-10}{3}} \frac{\left(\frac{10}{3}\right)^i}{i!}$$

OR ...

$$P(Y \geq 3) = 1-P(Y=0)-P(Y=1)-P(Y=2) = 1- e^{\frac{-10}{3}} - e^{\frac{-10}{3}} \frac{10}{3} - \frac{e^{\frac{-10}{3}} \left(\frac{10}{3}\right)^2}{2} = 64.72\%$$

## 4.3.4 Poisson Approximation to the Binomial
<hr>

Let,

$$X \sim \text{Binomial}(n, p)$$

If $n$ is large and $p$ is small:

\begin{cases}
n \geq 30 \\
p \leq 0.05 \\
\end{cases}

Then instead of using the Binomial, we can use the Poisson Approximation as:

$$X \sim \text{Poisson}(\lambda = n\times p)$$

**Example:** 97% of electronic messages are transmitted with no errors. What's the probability that out of 200 messages, at least 195 will be transmitted correctly.

Let $X= \text{the number of messages transmitted correctly out of 200}$

\begin{align}
X \sim \text{Binomial}(200, 0.97) \\
P(X \geq 195) = \sum_{i=195}^{200} \binom{200}{i} (0.97)^i (0.03)^{200-i}
\end{align}

OR ...

\begin{align}
Y \sim \text{Poisson}(\lambda=200 \times 0.03 = 6) \\
P(X \geq 195) = P(Y \leq 5) = \sum_{i=0}^5 e^{-6} \frac{6^i}{i!} = 44.56\%
\end{align}

# 4.4 Geometric Random Variable
<hr>

The Geometric Distribution models the number of trials needed to achieve the first success in a sequence of independent and identically distributed (iid) Bernoulli trials.

$X$ represents the number of trials until the first success is observed, $p$ is the probability of success on any individual trial.

$$X \sim \text{Geometric}(p)$$

**Example:** Consider a coin-tossing experiment where we toss a coin until a head (H) appears, with $P(H)=2/3$.

- $X$ is the number of tosses until the first head appears.
- The sequence of tosses and the corresponding probabilities $P(X=x)$ are:
    - 1 Toss (H): $P(H)=2/3$
    - 2 Tosses (TH): $P(TH)=1/3 \times 2/3$
    - 3 Tosses (TTH): $P(TTH)=(1/3)^2 \times 2/3$
    - 4 Tosses (TTTH): $P(TTTH)=(1/3)^3 \times 2/3$
    - and so on...
    
*Note: The success trial is always positioned at the end of the sequence.*
    
### 4.4.1 PMF of Geometric

$$P(X=k)=(1-p)^{k-1} \times p, \quad x=1, 2, \cdots$$

For $X=k$, there are $k-1$ failures before a success. The probability of failure is $1-p$

### 4.4.2 Expected Value and Variance of Geometric

$$E(X)=\frac{1}{p} \quad\quad \text{Var}=\frac{1-p}{p^2}$$

# 4.5 Negative Binomial Random Variable
<hr>

The Negative Binomial Distribution can be viewed as an extension of the Geometric Distribution. It models the number of trials required to achieve a specified number of successes (rather than just the first success) in a sequence of independent and identically distributed (iid) Bernoulli trials.

- $X$ denotes the number of trials needed to achieve $r$ successes.
- $r$ is the target number of successes.
- $p$ is the probability of success on any individual trial.

$$X \sim \text{NegBinomial}(r, p)$$

**Example:** Consider a coin-tossing experiment where we are interested in achieving $r=2$ heads (successes), with the probability of a head $P(H)=\frac{2}{3}$.

$X$ is the number of tosses until 2 heads are observed. Example sequences and probabilities:

- 2 Tosses (HH): $P=\left( \frac{2}{3} \right) \left(\frac{2}{3}\right)$
- 3 Tosses (HTH, THH): $P=2 \times \left[ \frac{1}{3} \left( \frac{2}{3}\right)^2 \right]$
- 4 Tosses (HTTH, THTH, TTHH): $P = 3 \times \left[ \left(\frac{1}{3}\right)^2 \left(\frac{2}{3}\right)^2 \right]$
- and so on ...

*The distribution starts with the best-case scenario where $X=r$, indicating all trials are successful. The last success is always considered fixed in its position.*

### 4.5.1 PMF of Negative Binomial

$$P(X=k) = \binom{k-1}{r-1} (1-p)^{k-r} p^r, \quad x=r, r+1, \cdots $$

- $k$ is the total number of trials
- $r$ is the number of successes
- The binomial coefficient $\binom{k-1}{r-1}$ represents the number of ways to arrange $r-1$ successes in the first $k-1$ trials

### 4.5.2 Expected Value and Variance of Negative Binomial

$$E(X)=\frac{r}{p} \quad \quad \text{Var}(X)= \frac{r(1-p)}{p^2}$$

# 4.6 Hypergeometric Random Variable
<hr>

The Hypergeometric Distribution models the probability of drawing a specific number of successes (e.g., red balls) without replacement from a finite population. It's often used in scenarios where the sampling doesn't allow for replacements, distinguishing it from the Binomial Distribution.

- $X$: Represents the number of successes (e.g., red balls) in the sample.
- $N$: Total number of items in the population.
- $m$: Number of successes in the population.
- $k$: Number of draws (sample size).

$$X \sim \text{HyperGeo}(N, m, k)$$

**Example:** Consider a box containing $N$ balls, of which $m$ are red and $N−m$ are green. Suppose we draw $k$ balls without replacement.

- $X$: Number of red balls in our sample of size $k$.
- The possible values of $X$ range from $0$ to $k$, depending on how many red balls are drawn.

### 4.6.1 PMF of Hypergeometric

$$P(X=x) = \frac{\binom{m}{x} \binom{N-m}{k-x}}{\binom{N}{k}}$$

\begin{cases}
x=0,1, \cdots n & n \leq r \\
x=0,1, \cdots r & n \gt r \\
\end{cases}

- $X$: Number of successes (red balls) in the sample.
- $\binom{m}{x}$: Ways to choose $x$ successes.
- $\binom{N-m}{k-x}$: Ways to choose the remaining $k-x$ failures (green balls).
- $\binom{N}{k}$: Total ways to choose $k$ balls from $N$.

### 4.6.2 Expected Value and Variance of Hypergeometric

$$E(X)=\frac{km}{N} \quad \quad \text{Var}(X) = \frac{km}{N} \left(\frac{N-m}{N}\right) \left(\frac{N-k}{N-1}\right)$$