## $$Statistics \ Cheat \ Sheet \ - \ Distributions$$

#### 1. Normal Distribution (Gaussian Distribution) of Continuous RV, $X \sim N(\mu, \sigma^2)$
This distribution models measurement error, intelligence/ability, height, averages of lots of data.

**PDF**
$$f(x; \mu, \sigma) = \frac{1}{\sqrt{2 \pi \sigma^2}}e^{-(x-\mu)^2/(2 \sigma^2)}$$
where $ -\infty \lt x \lt \infty, -\infty \lt \mu \lt \infty, 0 \lt \sigma$ <BR>
For n independent RVs,
$$f(x_1,...,x_n;\mu,\sigma^2) = \frac{1}{\sqrt{2 \pi \sigma^2}}e^{-(x_1 -\mu)^2/(2 \sigma^2)} \cdot ... \cdot \frac{1}{\sqrt{2 \pi \sigma^2}}e^{-(x_n -\mu)^2/(2 \sigma^2)} = \biggl(\frac{1}{2 \pi \sigma^2}\biggl)^{n/2}e^{-\sum (x_i-\mu)^2/(2 \sigma^2)}$$
    
**CDF**
$$F(x) = P(X \leq x) = \frac{1}{\sigma \sqrt{2 \pi}} \int_{-\infty}^x e^{-(v-\mu)^2/2 \sigma^2} dv$$
$$P(a \leq X \leq b) = \int_a^b\frac{1}{\sqrt{2 \pi \sigma^2}}e^{-(x-\mu)^2/(2 \sigma^2)}dx$$
    
**Properties**
* 68% of the area is within 1 standard deviation of the mean, $P(-1 \leq Z \leq 1) \approx .68$
* 95% is within 2 standard deviations, $P(-2 \leq Z \leq 2) \approx .95$; more precisely $P(-1.96 \leq Z \leq 1.96) \approx .95$
* 99% is within $P(-2.58 \leq Z \leq 2.58)$
* 99.7% is within 3 standard deviations, $P(-3 \leq Z \leq 3) \approx .997$; more precisely $P(-3.29 \leq Z \leq 3.29) \approx .997$$

Also,
* $P(Z \lt 1) \approx 0.84$
* $P(Z \lt 2) \approx 0.977$
* $P(Z \lt 3) \approx 0.999$

**Percentiles of an Arbitrary Normal Distribution**<BR>
$(100p)th$ percentile for normal $(\mu, \sigma)$ = $\mu$ + \[ $(100p)th$ percentile for standard normal\] . $\sigma$
    
++ If the **skewness** is less than -1 or greater than 1, the distribution is considered to be substantially skewed

#### 2. Standard Normal Distribution ( z-Distribution) of Continuous RV, $Z \sim N(\mu=0, \sigma = 1)$

**PDF**
$$f(z; 0, 1) = \frac{1}{\sqrt{2 \pi}}e^{-z^2/2}$$
where $ -\infty \lt z \lt \infty$

**CDF**

The CDF is obtained as the area under $\phi$, to the left of $z$.

$$\phi(z) = P(Z \leq z) = \int_{-\infty}^z f(y;0,1)dy$$
$$=\frac{1}{\sqrt{2 \pi}} \int_{-\infty}^z e^{-u^2/2}du$$
$$=\frac{1}{2} + \frac{1}{\sqrt{2 \pi}} \int_0^z e^{-u^2/2}du$$
where the area of the standard normal curve to the right of 0 (between $-\infty$ to 0) is 1/2.

For any $c \gt 0$,
$$P(|Z| \gt c) = P(Z \gt c) + P(Z \lt -c) = 2P(Z \gt c) = 2[1 - \phi(c)]$$

**Critical Values**<BR>
$z_\alpha$ is the $100(1 - \alpha)$th percentile of the standard normal distribution
    
* 95% of the area of standard normal distribution is within $\pm$1.96
* 99% of the area of standard normal distribution is within $\pm$2.58

++ Use qnorm to find the z critical values - $z_{0.05} = qnorm(0.95, 0,1). <BR>
++ Standard normal curve table can also be used - Find the z row and column values corresponding to the percentile (row and column corr to .95 in the above example - this will be equal to 1.64) <BR>
++ Use pnorm with z value as parameter to get the corresponding percentile and use qnorm with the perentile to get the corresponding z value

#### 3. Standardized Variable
If $X$ has a normal distribution with mean $\mu$ and standard deviation $\sigma$, then
$$Z = \frac{X - \mu}{\sigma}$$
has a standard normal distribution and Z is said to be asymptotically normal. Thus
$$P(a \leq X \leq b) = P(\frac{a - \mu}{\sigma} \leq Z \leq \frac{b - \mu}{\sigma})$$
$$=\phi(\frac{b - \mu}{\sigma}) - \phi(\frac{a - \mu}{\sigma})$$
$$P(X \leq a) = \phi(\frac{a - \mu}{\sigma})$$
$$P(X \geq b) = 1 - \phi(\frac{b - \mu}{\sigma})$$

$$P(Z \leq z) = P(X \leq \sigma z + \mu) = \int_{-\infty}^{\sigma z + \mu}f(x;\mu, \sigma)dx$$

++ Use table for Standard normal curve or the R function **pnorm** to find out $\phi$ values

#### 4. Binomial RV, X ~ Bin(n,p)
**Bernoulli Distribution**

Suppose we are repeating an experiment having only two outcomes $ \{\text{success},\text{failure}\}$, $N$ times in such are way that their outcomes are independent where $p = P(\text{success})$, then 
$$ P(\text{ k successes in N trials }) = {{N}\choose{k}}p^k(1-p)^{N-k} $$
If $X$ and $Y$ are independent and $X \sim Bin(n,p)$ and $Y \sim Bin(m,p)$, then $X + Y \sim Bin(n + m , p)$

**PMF**
$$b(x; n,p) = \begin{cases}
\binom nxp^x(1-p)^{n-x}, & \ x=0,1,2,3,...,n \\
0, & \ otherwise
\end{cases}$$

**CDF**
$$B(x; n,p) = P(X \leq x) = \sum_{y=0}^xb(y;n,p)  \ \  x=0,1,2,...,n$$

** Use cumulative binomial probability table to find out these values<BR>

**Mean**
$$E(X)=np$$ where n is the number of trials and p is the probability of success in a single trial.<BR>
    
**Variance** 
$$V(X) = np(1-p) = npq$$ where $q=1-p$ and <BR>
    
**Standard Deviation** 
    $$\sigma_X = \sqrt{npq}$$
    
**Normal Distribution**<BR>
Binomial distribution approaches normal distribution when
$$\lim_{n \to \infty} p \biggl(a \leq \frac{X - np}{\sqrt{npq}} \leq b \biggl) = \frac{1}{\sqrt{2 \pi}} \int_a^b e^{-u^2/2}du$$
$$P(X \leq x) = B(x; n,p) \approx (area \ under \ normal \ curve \ to \ the \ left \ of \ x + .5)$$
$$=\phi(\frac{x + .5 - np}{\sqrt{npq}})$$
provided $np \geq 10$ and $nq \geq 10$

#### 5. Hypergeometric Distribution
$$p(x) = P(X = x) = h(x; n, M, N) = \frac{\binom Mx \binom {N - M}{n - x}}{\binom Nn}$$
for x, an integer, satisfying $max(0, n - N + m) \leq  x \leq min(n,M)$. Here $n$ is the sample size, $N$ is the population size and $M$ is the number of successes in $N$

**Mean**
$$E(X) = n \cdot \frac{M}{N} = np$$
where $p = M/N$ is the proportion of success in the population

**Variance**
$$V(X) = \biggl(\frac{N - n}{N - 1}\biggl) \cdot n \cdot \frac{M}{N} \cdot \biggl(1 - \frac{M}{N}\biggl)$$
$$=\biggl(\frac{N - n}{N - 1}\biggl) \cdot np \cdot (1-p)$$
where $(N - n)/(N - 1)$ is the **finite population correction factor**.<BR>
$(N - n)/(N - 1) = (1 - n/N)/(1 - 1/N) \approx 1$ when $n$ is small relative to $N$.
    
Also, population size
$$\hat N = \frac{M \cdot n}{x}$$

#### 6. Negative Binomial Distribution
$$p(x) = P(X = x) = nb(x; rp) = \binom {x + r - 1}{r - 1}p^r(1 - p)^x \ \ x=0,1,2...$$
where $r$ is the numbers of successes and $p=P(S)$ is the probability of success

$r = 1$ is the special case for the number of failures before the first success:
$$nb(x;1,p) = (1 - p)^xp \ \ x=0,1,2...$$

The number of failures until the first success is $1/p - 1 = (1-p)/p$

**Mean**
$$E(X) = \frac{r(1 - p)}{p}$$

**Variance**
$$V(X) = \frac{r(1 - p)}{p^2}$$

#### 7. Poisson Distribution
$$p(x; \mu) = P(X = x) = \frac{\mu^x e^{- \mu}}{x!} \ \ x=0,1,2...$$

From **Maclaurin series expansion** of $e^\mu$:
$$e^\mu = 1 + \mu + \frac{\mu^2}{2!} + \frac{\mu^3}{3!} + ... = \sum_{x = 0}^{\infty} \frac{\mu^x}{x!}$$
$$\implies \sum p(x; \mu) = \sum_{x = 0}^{\infty} \frac{e^{-\mu} \cdot \mu^x}{x!} = 1$$


If $n \to \infty$ and $p \to 0$ in such a way that $np$ approaches a value $\mu > 0$, i.e. in any binomial experiment in which n is large and p is small, $b(x;n,p) \approx p(x; \mu)$ 
This can be safely applied when $n \gt 50, np \lt 5, \mu = np$ <BR>
    
**CDF**
$$F_X(x) = \frac{\Gamma(x + 1, \mu)}{x!}$$
where $\Gamma$ is the **upper incomplete gamma function**, a special function that is normally defined in terms of an integral
$$\Gamma(s,x) = \int_x^\infty t^{s - 1}e^{-t} dt$$
    
**Mean and Variance**
$$E(X) = V(X) = \mu$$
    
Poisson distribution approaches normal distribution when
$$\lim_{\mu \to \infty} p \biggl(a \leq \frac{X - \mu}{\sqrt{\mu}} \leq b \biggl) = \frac{1}{\sqrt{2 \pi}} \int_a^b e^{-u^2/2}du$$
where $\frac{X - \mu}{\sqrt\mu}$ is the standardized random variable

**Poisson Process**<BR>
Probability that $k$ events will be observed during any particular time interval of length $t$
$$P_k(t) = \frac{e^{-\alpha t} \cdot (\alpha t)^k}{k!}$$
where $\mu = \alpha t$ and $\alpha$ is the rate of the event process, the expected number of events occuring in unit time
    
$$P(X_1 \leq t) = 1 - P(X_1 \gt t) = 1 - e^{-\alpha t}$$

#### 8. Exponential Distribution
This distribution is used to model the waiting time for a continuous process to change state.
$$f(x; \lambda) = \begin{cases}
\lambda e^{-\lambda x} & \ x \geq 0 \\
0 & \ otherwise
\end{cases} $$

This is also expressed as
$$f(x; \lambda) = \begin{cases}
\frac{1}{\beta} e^{\frac{-x}{\beta}} & \ x \geq 0 \\
0 & \ otherwise
\end{cases} $$
where $\beta = 1/\lambda$

For $n$ independent RVs, 
$$f(x_1,...,x_n;\lambda) = (\lambda e^{-\lambda x_1})...(\lambda e^{-\lambda x_n}) = \lambda^ne^{-\lambda \sum x_i}$$

**CDF**
$$F(x; \lambda) = P(X \leq x) = \begin{cases}
0 & \ x \lt 0 \\
1 - e^{-\lambda x} & \ x \geq 0
\end{cases} $$
$$\text{Right tail distribution, } P(X \geq t) = 1 - P(X \leq t) = 1 - (1 - e^{-\lambda t}) = e^{-\lambda t}$$
$$P(a \leq X \leq b) = F(b) - F(a)$$

**Expected Value**
$$E(X) = \int_0^\infty x \lambda e^{- \lambda x} \ dx \implies \mu = \frac{1}{\lambda}$$

**Variance**
$$\sigma^2 = \frac{1}{\lambda^2}$$

**Median**<BR>
The median of $X$ is the value $x$ for which $P(X \leq x) = 0.5$
$$\implies 1 - e^{-\lambda x} = 0.5 \text{(for median)}$$
$$\implies Median, \ x = \frac{ln(2)}{\lambda}$$

**Memorylessness**
$$P(X \gt s + t | X \gt s) = P(X \gt t)$$

#### 9. Gamma Distribution

**Gamma Function**
$$\Gamma(\alpha) = \int_0^\infty x^{\alpha - 1}e^{-x} \ dx$$

**Properties of Gamma Function**
* For any $\alpha \gt 1, \Gamma(\alpha) = (\alpha - 1) \cdot \Gamma(\alpha - 1)$
* For any positive integer $n, \Gamma(n) = (n - 1)!$
* $\Gamma(\frac{1}{2}) = \sqrt \pi$

**PDF**
$$f(x;\alpha, \beta) = \begin{cases}
\frac{1}{\beta^\alpha \Gamma(\alpha)} x^{\alpha - 1}e^{-x/\beta} & \ x \geq 0 \\
0 & \ otherwise
\end{cases}$$
where $\alpha \gt 0, \beta \gt 0$.

**Standard Gamma Distribution**<BR>
For a standard gamma distribution, $\beta = 1$ Hence the pdf is give by 
$$f(x;\alpha) = \begin{cases}
\frac{x^{\alpha - 1}e^{-x}}{\Gamma(\alpha)}  & \ x \geq 0 \\
0 & \ otherwise
\end{cases}$$
    
Setting $\alpha=1$ and $\beta = 1/\lambda$ in the gamma distribution, we get the exponential distrbution.

**CDF of Standard Gamma Distribution (incomplete gamma function)**
$$F(x;\alpha) = \int_0^x \frac{y^{\alpha - 1}e^{-y}}{\Gamma(\alpha)} dy \ \ x \gt 0$$
** Calculate using tables

**CDF of Gamma Distribution**
$$P(X \leq x) = F(x;\alpha,\beta) = F\biggl(\frac{x}{\beta}, \alpha \biggl)$$

**Mean**
$$E(X) = \mu = \alpha \beta$$
and
$$E(X^2) = \beta^2(\alpha + 1) \alpha$$

**Variance**
$$V(X) = \sigma^2 = \alpha \beta^2$$

#### 10. Chi-Squared DIstribution
The pdf is given by the pdf of the gamma density with $\alpha=\nu / 2$ and $\beta=2$
$$f(x;\nu) = \begin{cases}
\frac{1}{2^{\nu/2}\Gamma(\nu/2)}x^{(\nu/2)}e^{-x/2} & \ x \geq 0 \\
0 & \ x \lt 0
\end{cases}$$
where $\nu$ is a positive integer and is called the **number of degrees of freedom** $(df)$ of $X$.<BR>
Chi-squared is often represented by $\chi^2$

#### 11. Weibull Distribution
**PDF**
$$f(x; \alpha, \beta) = \begin{cases}
\frac{\alpha}{\beta^\alpha}x^{\alpha - 1}e^{-(x/\beta)^\alpha} & \ x \geq 0 \\
0 & \ x \lt 0
\end{cases}$$
where $\alpha \gt 0, \beta \gt 0$ <BR>
Replacing $\lambda = 1/\beta$ in the above gives the exponential distribution
    
**CDF**
$$F(x;\alpha,\beta) = \begin{cases}
0 & \ x \lt 0 \\
1 - e^{-(x/\beta)^\alpha} & \ x \geq 0
\end{cases}$$
where $\alpha$ is the **location parameter** and $\beta$ is the **scale parameter**

For a **three parameter Weibull distribution**
$$F(x;\alpha,\beta,\gamma) = 1 - e^{-(x - \gamma/\beta)^\alpha}$$
    
**Mean**
$$\mu = \beta\Gamma\biggl(1 + \frac{1}{\alpha}\biggl)$$
**Variance**
$$\sigma^2 = \beta^2\biggl\{\Gamma\biggl(1 + \frac{2}{\alpha}\biggl) - \biggl[\Gamma\biggl(1 + \frac{1}{\alpha}\biggl)\biggl]^2\biggl\}$$

#### 12. Lognormal Distribution
A nonnegative rv $X$ is said to have a lognormal distribution if the rv $Y = ln(X)$ has a normal distribution.

**PDF**
$$f(x;\mu, \sigma) = \begin{cases}
\frac{1}{\sqrt{2 \pi} \sigma x}e^{-[ln(x) - \mu]^2/(2\sigma^2)} & \ x \geq 0 \\
0 & \ x \lt 0
\end{cases}$$
where $\mu$ and $\sigma$ are the mean and standard deviation of $ln(X)$

**CDF**
$$F(x;\mu, \sigma) = P(X \leq x) = P[ln(X) \leq ln(x)]$$
$$=P\biggl(Z \leq \frac{ln(x) - \mu}{\sigma} \biggl) = \phi \biggl(\frac{ln(x) - \mu}{\sigma} \biggl) \ \ x \geq 0$$

**Mean**
$$E(X) = e^{\mu + \sigma^2/2}$$

**Variance**
$$V(X) = e^{2\mu + \sigma^2} \cdot (e^{\sigma^2} - 1)$$

#### 13. Beta Distribution
**PDF**
$$f(x;\alpha, \beta, A, B) = \begin{cases}
\frac{1}{B - A} \cdot \frac{\Gamma(\alpha + \beta)}{\Gamma(\alpha) \cdot \Gamma(\beta)} \biggl(\frac{x - A}{B - A} \biggl)^{\alpha - 1} \biggl(\frac{B - x}{B - A}\biggl)^{\beta - 1} & \ A \leq x \leq B \\
0 & \ otherwise
\end{cases}$$
$A = 0, B = 1$ gives the **standard beta distribution**

**Mean**
$$\mu = A + (B - A) \cdot \frac{\alpha}{\alpha + \beta}$$

**Variance**
$$\sigma^2 = \frac{(B - A) ^2 \alpha \beta}{(\alpha + \beta)^2 (\alpha + \beta + 1)}$$

#### 14. Extreme Value Distribution
$$F(x;\theta_1, \theta_2) = 1 - e^{-e^{(x - \theta_1)/\theta_2}}$$
where $-\infty \lt x \lt \infty$

#### 15. Cauchy Distribution
$$f(x) = \frac{1}{\pi [1 + (x - \mu)^2]} \ \ \ -\infty \lt x \lt \infty$$