# Continuous Probability Distributions 
 ---
## Continuous random variables 
Continuous random variables can take on an infinite number of possible values, corresponding to every value in an interval. 

For example, we could have an situation where a random variable might take on any value between:
$$(3,4)$$
A random variable might take on the value 3.1, or 3.4, or 3.9695, or what have you. And of the infinite number of values between 3 and 4 are fair game. 

Lets dive a bit deeper looking at the distribution below: 

![cont%20dist%20height.png](attachment:cont%20dist%20height.png)

This shows the distribution of the height of canadian males. Now height is a continuous random variable so it is going to have a continuous probability distribtution. This looks like a smooth version of a histogram. Loosely speaking, values of the curve where it is high are more likely to occur than values where the curve is low. 

It is import to note, **we cannot model continuous random variables with the same methods we use for discrete random variables.**

## Probability Density Function
We model a continuous random variable with a curve $f(x)$, called a *probability density function (pdf)*. Below is another example of a continuous probability distribution. 

![cont%20dist%202.png](attachment:cont%20dist%202.png)

This is the distribution for time to failure, in thousands of hours, for a type of lightbulb. Values that x can take on are given on the x-axis. And the values of the probability density function, $f(x)$, is a function, giving the height of the curve at those values x. 

$f(x)$ represents the height of the curve at the point $x$. For continuous random variables, probabilities are the **areas under the curve**. 

Below is a continuous probability distribution for a random variable $x$. 

![continuous%20dist%20area%20under%20curve.png](attachment:continuous%20dist%20area%20under%20curve.png)

The height of the curve is represented by $f(x)$. The probability that a random variable x falls between the values a and b, is simply the area under the curve between a and b (the grey shaded region). 

One important notion here is that the probability that the random variable x is *exactly* equal to one specific value, is 0. We could say that the probability that the random variable x is equal to the value a is 0 for any a. Mathematically that looks like:
$$P(X=a) =0$$
We can think of a as an infintesimally small point with an infintesimally small area above it, so we call that area zero. 

So from a practical point of view, it only makes sense to talk about a random variable x falling in an interval of values. 

### Features of continuous probability distribution
* $f(x) \geq 0$ for all $x$
* The area under the entire curve is equal to one

There are a number of common continuous probability distributions that come up frequently in theory and practice. The first is the normal distribution:

![normal%20.png](attachment:normal%20.png)

We also have the continuous uniform distribution:

![uniform.png](attachment:uniform.png)

And the exponential distribution (think exponential decay):

![exponential.png](attachment:exponential.png)

And there are many others!

## Recap
* Probabilities and percentiles are found by integrating the probabilty density function
* Probabilities are areas under the curve, and areas under the curve are found using integration
* Deriving the mean and variance of the pdf also requires integration

---

# Finding Probabilities and Percentiles for a Continuous Probability Distribution
Suppose for a random variable X:
$$f(x)=cx^3$$
when:
$$2 \leq x \leq 4 \;and\;otherwise\;0$$
and c is just some constant. 

The question we want to answer is: **what value of c makes this a legitimate probabilty distribution?**

Okay well lets think, what needs to be satisfied in order for $f(x)$ to be a legitimate probability distribution?
* First, it can *never* take on negative values. Since x ranges from (2,4), we know that c must be positive.  
* Second, the area under the entire curve must equal 1. Mathematically that looks like:
$$\int_{-\infty}^{\infty}f(x)dx=1$$
And in our example it specifically looks like:
$$\int_{2}^{4}cx^3dx=1$$
Lets carry out this integration quickly as a refresher.
$$c\int_{2}^{4}x^3dx=1$$
$$c\Big[\frac{x^4}{4}\Big]_2^4$$
$$c\Big[\frac{4^4}{4}-\frac{2^4}{4}\Big]$$
$$60c$$
and we know that the entire area under the curve must equal one, so in this case:
$$60c = 1$$
$$c = \frac{1}{60}$$
Here is what the pdf looks like plotted out:

![pdf%20ex.png](attachment:pdf%20ex.png)

## Find a probability
Suppose we wanted to find the probability that a random variable X takes on a value greater than 3:
$$P(X>3)$$
For continuous probability distributions, probabilty is simply the area under the curve, so in our case it simply the area under the curve to the right of 3! We can find that value by using integration! Lets walk through that now.

$$\int_{3}^{4}\frac{1}{60}x^3dx$$
$$\frac{1}{60}\Big[\frac{x^4}{4}\Big]_3^4$$
$$\frac{1}{60}\Big[\frac{4^4}{4}-\frac{3^4}{4}\Big]$$
$$\frac{175}{240} = 0.729$$

# Normal Distribution 
The normal distribution is an extremely important continuous probability distribution. 

The equation for the probability density function of the normal distribution is: 

![pdf%20func.png](attachment:pdf%20func.png)

where $\mu$ is the mean, and $\sigma$ is the standard deviation. The distribution is symmetric about the mean!

![normal%20dist.png](attachment:normal%20dist.png)

What does the standard deviation represent? 

![normal%20dist%20sigma.png](attachment:normal%20dist%20sigma.png)

This shows that 68% of the area lies within 1 standard deviation of the mean! 

![normal%20dist%20sigma%202.png](attachment:normal%20dist%20sigma%202.png)

Note, if X is a random variable that has a normal distribution with mean $\mu$ and variance $\sigma^2$, we write this as: 
$$X \approx N(\mu,\sigma^2)$$

What makes the normal distribution so important? It's role in the central limit theorem...

# Central limit theorem
The gist of the **central limit theorem is as follows:**
* The sample mean will be approximately normally distributed for large sample sizes, regardless of the distribution from which we are sampling. 

Lets recall a few characteristics of the sampling distribution!

![PopSamples.GIF](attachment:PopSamples.GIF)

Suppose we are sampling from a population with mean $\mu$ and standard deviation $\sigma$. Let $\bar{X}$ be a random variable representing the sample mean of $n$ independently drawn observations. Then:
* The mean of the sampling distribution of the sample mean is equal to the population mean. $\mu_\bar{X}=\mu$
* The standard deviation of the sampling distribution of $\bar{X}$ is equal to $\sigma_\bar{X}=\frac{\sigma}{\sqrt{n}}$

So if the population is normally distributed, then $\bar{X}$ is also normally distributed. 

But what is the population is *not* normal? The central limit theorem addresses this question!
* The distribution of the sample mean tends towards the normal distribution as the sample size increases, regardless of what distribution from which we are sampling. 

Watch this video for a great intro! https://www.youtube.com/watch?v=Pujol1yC1_A

## Why is this important?
Many statistics have distributions that are approximately normal for large sample sizes, even when we are sampling from a distribution that is not normal!

This means that we can often use well-developed statistical inference procedures that are based on a normal distribution, even if we are sampling from a population that is not normal, provided we have a large sample size! 