### Central Limit Theorem:
The central limit theorem states that if you take sufficiently large samples from a population, the samples’ means will be normally distributed, even if the population isn’t normally distributed.

In probability theory, the central limit theorem (CLT) states that the distribution of a sample variable approximates a normal distribution (i.e., a “bell curve”) as the sample size becomes larger, assuming that all samples are identical in size, and regardless of the population's actual distribution shape.

 - Central limit theorem formula:
 
Fortunately, you don’t need to actually repeatedly sample a population to know the shape of the sampling distribution. The parameters of the sampling distribution of the mean are determined by the parameters of the population:

The mean of the sampling distribution is the mean of the population.

![image-2.png](attachment:image-2.png)

The standard deviation of the sampling distribution is the standard deviation of the population divided by the square root of the sample size.

![image-3.png](attachment:image-3.png)

We can describe the sampling distribution of the mean using this notation:

![image-4.png](attachment:image-4.png)

Where:

X̄ is the sampling distribution of the sample means,
~ means “follows the distribution”,
N is the normal distribution,
µ is the mean of the population,
σ is the standard deviation of the population,
n is the sample size

<b>Example:</b>

A population follows a Poisson distribution (left image). If we take 10,000 samples from the population, each with a sample size of 50, the sample means follow a normal distribution, as predicted by the central limit theorem (right image).

![image.png](attachment:image.png)

The central limit theorem is often used in conjunction with the law of large numbers, which states that the average of the sample means and standard deviations will come closer to equaling the population mean and standard deviation as the sample size grows, which is extremely useful in accurately predicting the characteristics of populations.

 - Why Is the Central Limit Theorem Useful?

The central limit theorem is useful when analyzing large data sets because it allows one to assume that the sampling distribution of the mean will be normally-distributed in most cases. This allows for easier statistical analysis and inference. For example, investors can use central limit theorem to aggregate individual security performance data and generate distribution of sample means that represent a larger population distribution for security returns over a period of time.

 - Why Is the Central Limit Theorem's Minimize Sample Size 30?

A sample size of 30 is fairly common across statistics. A sample size of 30 often increases the confidence interval of your population data set enough to warrant assertions against your findings. The higher your sample size, the more likely the sample will be representative of your population set.

#### Conditions of the central limit theorem
The central limit theorem states that the sampling distribution of the mean will always follow a normal distribution under the following conditions:

 - The sample size is sufficiently large. This condition is usually met if the sample size is n ≥ 30.
 - The samples are independent and identically distributed (i.i.d.) random variables. This condition is usually met if the sampling is random.
 - The population’s distribution has finite variance. Central limit theorem doesn’t apply to distributions with infinite variance, such as the Cauchy distribution. Most distributions have finite variance.

### Central limit theorem examples
Applying the central limit theorem to real distributions may help you to better understand how it works.

#### Continuous distribution
Suppose that you’re interested in the age that people retire in the United States. The population is all retired Americans, and the distribution of the population might look something like this:

![image.png](attachment:image.png)

Age at retirement follows a left-skewed distribution. Most people retire within about five years of the mean retirement age of 65 years. However, there’s a “long tail” of people who retire much younger, such as at 50 or even 40 years old. The population has a standard deviation of 6 years.

Imagine that you take a small sample of the population. You randomly select five retirees and ask them what age they retired.

Example: Central limit theorem; sample of n = 5

68	73	70	62	63

The mean of the sample is an estimate of the population mean. It might not be a very precise estimate, since the sample size is only 5.

mean = (68 + 73 + 70 + 62 + 63) / 5

mean = 67.2 years

Suppose that you repeat this procedure 10 times, taking samples of five retirees, and calculating the mean of each sample. This is a sampling distribution of the mean.

60.8	57.8	62.2	68.6	67.4	67.8	68.3	65.6	66.5	62.1

If you repeat the procedure many more times, a histogram of the sample means will look something like this:

![image-2.png](attachment:image-2.png)

Although this sampling distribution is more normally distributed than the population, it still has a bit of a left skew.

Notice also that the spread of the sampling distribution is less than the spread of the population.

The central limit theorem says that the sampling distribution of the mean will always follow a normal distribution when the sample size is sufficiently large. This sampling distribution of the mean isn’t normally distributed because its sample size isn’t sufficiently large.

Now, imagine that you take a large sample of the population. You randomly select 50 retirees and ask them what age they retired.

![image-3.png](attachment:image-3.png)

The mean of the sample is an estimate of the population mean. It’s a precise estimate, because the sample size is large.

mean = 64.8 years

Again, you can repeat this procedure many more times, taking samples of fifty retirees, and calculating the mean of each sample:

![image-4.png](attachment:image-4.png)

In the histogram, you can see that this sampling distribution is normally distributed, as predicted by the central limit theorem.

The standard deviation of this sampling distribution is 0.85 years, which is less than the spread of the small sample sampling distribution, and much less than the spread of the population. If you were to increase the sample size further, the spread would decrease even more.

We can use the central limit theorem formula to describe the sampling distribution:

![image-5.png](attachment:image-5.png)

µ = 65

σ = 6

n = 50

![image-6.png](attachment:image-6.png)

#### Discrete distribution
Approximately 10% of people are left-handed. If we assign a value of 1 to left-handedness and a value of 0 to right-handedness, the probability distribution of left-handedness for the population of all humans looks like this:

![image.png](attachment:image.png)

The population mean is the proportion of people who are left-handed (0.1). The population standard deviation is 0.3.

Imagine that you take a random sample of five people and ask them whether they’re left-handed.

0	0	0	1	0

The mean of the sample is an estimate of the population mean. It might not be a very precise estimate, since the sample size is only 5.

mean = (0 + 0 + 0 + 1 + 0) / 5

mean = 0.2

Imagine you repeat this process 10 times, randomly sampling five people and calculating the mean of the sample. This is a sampling distribution of the mean.

0	0	0.4	0.2	0.2	0	0.4	0

If you repeat this process many more times, the distribution will look something like this:

![image-2.png](attachment:image-2.png)

The sampling distribution isn’t normally distributed because the sample size isn’t sufficiently large for the central limit theorem to apply.

As the sample size increases, the sampling distribution looks increasingly similar to a normal distribution, and the spread decreases:

![image-3.png](attachment:image-3.png)

![image-4.png](attachment:image-4.png)

![image-5.png](attachment:image-5.png)

The sampling distribution of the mean for samples with n = 30 approaches normality. When the sample size is increased further to n = 100, the sampling distribution follows a normal distribution.

![image-6.png](attachment:image-6.png)

We can use the central limit theorem formula to describe the sampling distribution for n = 100.

![image-7.png](attachment:image-7.png)

µ = 0.1

σ = 0.3

n = 100

![image-8.png](attachment:image-8.png)