# Bernoulli probability distribution
The **Bernoulli probability distribution** is a discrete probability distribution for a random variable which takes the value 1 with probability $p$ and the value 0 with probability $1-p$. It models the outcome of a single experiment (trial) that has exactly two possible outcomes: "success" (1) and "failure" (0).

#### Parameters
$p$: Probability of success ($0 \leq p \leq 1$)

#### Support
The support of a probability distribution is the set of all possible values that the random variable can take with non-zero probability. For the Bernoulli distribution, the support is $\{0, 1\}$, meaning the random variable can only take the values 0 or 1.

- **Support** refers to the set of possible values that the random variable can take with non-zero probability (e.g., for Bernoulli, $\{0, 1\}$).
- **Parameter range** refers to the set of valid values that the distribution's parameters can take (e.g., for Bernoulli, $p \in [0, 1]$).
- The support describes the outcomes of the random process, while the parameter range describes the allowable settings for the distribution's defining parameters.

#### Probability Mass Function (PMF)
$$
P(X = x) = p^x (1-p)^{1-x}, \quad x \in \{0, 1\}
$$

Which is equivalent to
$$
\operatorname {P} (X=x)={
    \begin{cases}
    p \quad if \quad x=1 \\
    1-p \quad if \quad x=0
    \end{cases}
}
$$

#### Mean (Expected Value)
$$
\mathbb{E}[X] = p
$$

#### Variance
$$
\mathrm{Var}(X) = p(1-p)
$$

#### Standard Deviation
$$
\sigma = \sqrt{p(1-p)}
$$

#### Median
- If $p < 0.5$, median is $0$
- If $p > 0.5$, median is $1$
- If $p = 0.5$, both $0$ and $1$ are medians

#### Mode
- If $p < 0.5$, mode is $0$
- If $p > 0.5$, mode is $1$
- If $p = 0.5$, both $0$ and $1$ are modes

#### Cumulative Distribution Function (CDF)
$$
F(x) = 
\begin{cases}
0 & \text{if } x < 0 \\
1 - p & \text{if } 0 \leq x < 1 \\
1 & \text{if } x \geq 1
\end{cases}
$$

#### Median Absolute Deviation from the Median (MAD)
$$
MAD = 2pq = 2p(1-p)
$$

#### Skewness
$$
\gamma_1 = \frac{1 - 2p}{\sqrt{p(1-p)}} = \frac{q - p}{\sqrt{pq}}
$$

## Use Cases
The Bernoulli distribution is used to model random experiments with two possible outcomes, such as flipping a coin (heads or tails), success/failure in a trial, or yes/no survey responses. Here's how to use the Python scipy module to create Bernoulli RV and sample from it:

In [4]:
from scipy.stats import bernoulli

Sample from a Bernoulli distribution, with a probability of success p = 0.7 and 10 samples.

In [5]:
p = 0.7 # probability of success
n = 10  # number of trials
# Generate a sample of size n
sample = bernoulli.rvs(p, size=n)
print("Sample from Bernoulli distribution with p=0.7:", sample)

Sample from Bernoulli distribution with p=0.7: [0 0 1 1 1 1 1 1 1 1]


## Relationship to other probability distributions

### Binomial distribution
The Bernolli distribution can also be interpreted as a particular base of the Binomial distribution when there is only one trial such that $ Bernoulli(p) \sim Binomial(p,n=1)$

If $X_1, X_2, \dots, X_n$ are $n$ independent and identically distributed random variables with $X_i \sim \operatorname{Bernoulli}(p)$, then the random variable $X_1 + X_2 + \cdots + X_n$ follows a binomial distribution with parameters $n$ and $p$, that is,

$$
\sum_{i=1}^{n} X_i \sim \operatorname{Bin}(n, p)
$$

### Categorial distribution
The categorical distribution is the generalization of the Bernoulli distribution for variables with any constant number of discrete values.

### Beta distribution
The Beta distribution is the conjugate prior of the Bernoulli distribution.

### Geometric distribution
The geometric distribution models the number of independent and identical Bernoulli trials needed to get one success.


