# Introduction to Statistics
## Chapter 1.6 - Continuous Random Variables
Equations and properties for continuous random variables are very similar to discrete random variables. As a general rule of thumb, discrete random variables uses summations in their equations while continuous random variables uses integration. 


## Cumulative Distribution Function

The definition of a cumulative distribution function for a continuous random variable is the same as the discrete case, $P(X \le x)$, and has the exact some properties. It can also be interpreted as modeling the quantile of the distribution. The main difference is the cdf of a continuous distribution is a continuous function while the cdf of a discrete distribution is a stepwise function.

## Probability Density Function
### Definition
A probability distribution of a continuous random variable is also called a probability density density (pdf) and is defined as the following:
- (1) $f(x)=\frac{dF(x)}{dx}$ wherever the derivative exists

Since a cdf is nondecreasing function, we also know that $f(x)\ge0$.  

Based on the definition of a pdf, the cdf must be equal to the following:
- (2) $F(x)=\int_{-\infty}^{x}f(t)dt$

This would also mean we can find the probability of $X$ being between the interval $[x_1, x_2]$ using a pdf:
- (3) $P(x_1 \le X \le x_2) = \int_{x_1}^{x_2}f(t)dt$
    - $P(x_1 \le X \le x_2) = F(x_2) - F(x_1) = \int_{-\infty}^{x_2}f(t)dt - \int_{-\infty}^{x_1}f(t)dt = \int_{x_1}^{x_2}f(t)dt$

This means if $x_1 = x_2$, then the probability will be 0. Using this we can derive the following property
- $P(X\le x) = P(X<x)$.
    - $P(X<x) = P(X\le x) - P(X=x) = P(X\le x) - 0 = P(X\le x)$.

### Conceptual
A continuous random variable's pdf and a discrete random variable's pmf both models probability, however they do it in different ways. A pmf models probability directly, while a pdf models probability indirectly. The pdf models the probability density (hence the name) and the probability is the area under the curve. 

Probability density is analogous to density in physics. In physics, density is how much "stuff" is in a given unit volume. Objects with higher density has a higher mass, assuming they have the same volume. Probability density is how much probability is in a given unit interval. And values with a higher probability density is relatively more "likely" to occur, even though the absolute probability is $0$.

The value of the pdf at a particular point is not constrained between $[0,1]$. Suppose the mass of object is constrain to be between $0$g and $1$g. The density an object is not constrain to be between $0$ and $1$. The object could have a high density (like $2$g), but have a low volume (like .1cm). Similarly, probability density could be greater than 1 even though probability is constrained to be between $[0,1]$.

## Expected Value
The definition of expected value in the continuos case is the following:
- (4) $E(X)=\int_{-\infty}^{\infty}xf(x)dx$

It also share all of expected value properties for the discrete case:
- $E(g(X)) = \int_{-\infty}^{\infty}g(x)f(x)dx$
    - This property was slightly modified. Summation was replaced with integration.
- $E(c) = c$
- $E(cX) = cE(X)$
- $E(X + Y) = E(X) + E(Y)$

## Variance
The definition of variance in the continuos case remains the same and share the same properties.
- $Var(X) = E[(X - \mu)^2]$
- $Var(c) = 0$
- $Var(X + c) = Var(X)$
- $Var(aX) = a^2Var(X)$
- $Var(X + Y) = Var(X) + Var(Y)$, if $X$ and $Y$ are independent
- $Var(aX + bY) = a^2Var(X) + b^2Var(Y) + abCov(X, Y)$

## Uniform Random Variable
A uniform random variable describes an experiment where there is an arbitrary result and is bounded by a min and max value (inclusive or exclusive), . 

The pdf of the uniform is the following:
- (5) $f(x) = \begin{cases} \frac{1}{a - b}, \quad \text{if } a\le x \le b \\ 0, \quad \text{o.w.} \end{cases}$

### Notation
- $X \sim Unif(a, b)$
- $X \sim U(a, b)$

### Expected Value and Variance
- (6) $E(X) = \frac{a+b}{2}$
- (7) $Var(X) = \frac{(b-a)^2}{12}$


## Normal (Gaussian) Random Variable
A normal random variable describes an experiment whose result has a high probability of being near the mean. The probability decreases rapidly the more the result deviate from the mean. It is symmetric around the mean with forms a bell shape curve ranging from $[-\infty, \infty]$

The pdf of a normal distribution is the following:
- (8) $f(x) = \frac{1}{\sigma\sqrt{2\pi}}exp(-\frac{(x-\mu)^2}{2\sigma^2})$
    - $exp(x) = e^x$

### Notation
- $X \sim N(\mu, \sigma^2)$

### Mean and Variance
- (9) $E(X) = \mu$
- (10) $Var(X) = \sigma^2$

### Standard Normal
A special instance of the Normal distribution is $N(0,1)$. All normal distributions are similar in shape, so all normal distributions can be converted into the standard normal using the following formula:
- $Z = \frac{X-\mu}{\sigma}$  
    - $X \sim N(\mu, \sigma^2)$  
    - $Z \sim N(0, 1)$  
    
The cdf of a standard normal is sometimes notated as $\Phi(x)$

### Properties
- A linear combination of normal random variable will result in a normal distribution.
- Binomial $B(n,p)$ is approximately normal $N(np, np(1-p)$, if $n$ is large is $p$ is not too close to 0 or 1
    - $B(n,p) \stackrel{.}{\sim} N(np, np(1-p)$ 
- Poisson $Pois(\lambda)$ is approximately normal $N(\lambda, \lambda)$ for large values of $\lambda$


## Equations  
General  
1) $f(x)=\frac{dF(x)}{dx}$ wherever the derivative exists  
2) $F(x)=\int_{-\infty}^{x}f(t)dt$  
3) $P(x_1 \le X \le x_2) = \int_{x_1}^{x_2}f(t)dt$   
4) $E(X)=\int_{-\infty}^{\infty}xf(x)dx$  

Uniform  
5) $f(x) = \begin{cases} \frac{1}{a - b}, \quad \text{if } a\le x \le b \\ 0, \quad \text{o.w.} \end{cases}$  
6) $E(X) = \frac{a+b}{2}$  
7) $Var(X) = \frac{(b-a)^2}{12}$  

Normal   
8) $f(x) = \frac{1}{\sigma\sqrt{2\pi}}exp(-\frac{(x-\mu)^2}{2\sigma^2})$  
9) $E(X) = \mu$  
10) $Var(X) = \sigma^2$
