Deriving the Normal Distribution from the Binomial Distribution

# Purpose

The purpose of this page, is to derive the Normal Distribution from the Binomial Distribution as a reference for the Galton Board project.

# Binomial Distribution

## Why is it Important?

The Binomial distribution is a fundamental way to calculate the probability of a path, k, in a tree of $2^{n}$ possible results.  There may be, as we saw in the Galton Board, several ways to traverse the board to get a point that is between bondaries.  For example, with a board that is 4 levels deep, there are 5 ending points (or buckets).  To end up in either the first or the last bucket, only one path can be traversed.  For the second and the fourth buckets, there are 4 possbile paths and for the middle bucket there are 6 possible paths. 

|Bucket Number|Possible Paths|Number of Paths|
| :-: | :-: | :-: |
|1|LLLL|1|
|2|RLLL, LRLL, LLRL, LLLR|4|
|3|RRLL, RLRL, RLLR, LRRL, LRLR, LLRR|6|
|4|LRRR, RLRR, RRLR, RRRL|4|
|5|LLLL|1|

The total number of possible paths is 1 + 4 + 6 + 4 + 1 = 16 = $2^4$.  The probability of reaching any bucket is the number of possible paths to get to that bucket divided by the total number of possible paths.  The following table includes the probability of reaching the final state of any bucket.  

|Bucket Number|Possible Paths|Number of Paths|Probability|
| :-: | :-: | :-: | :-:|
|1|LLLL|1|1/16 = 0.0625|
|2|RLLL, LRLL, LLRL, LLLR|4|4/16 = 0.25|
|3|RRLL, RLRL, RLLR, LRRL, LRLR, LLRR|6|6/16 = 0.375|
|4|LRRR, RLRR, RRLR, RRRL|4|4/16 = 0.25|
|5|LLLL|1|1/16 = 0.0625|

Note that the sum of the probabilities has to be 1, otherwise there are paths that have not been accounted for.  In this example, the sum of the probabilities: 0.0625 + 0.25 + 0.375 + 0.25 + 0.0625 = 1. 

Picture a Galton Board configuration in which the depth is 100.  To perform the same manual analysis on this board to list all of the possible paths to get to a single bucket would take a long time.  There are a total of $2^{100} = 1.267 \cdot 10^{30}$ possbilities.  Good luck with that without writing a program!

So how can we determine the number of possible paths to get to a bucket (endpoint) for any given board of depth d?  The answer lies in the Binomial Distribution. 

Let's start by asking how many ways can we get to a bucket?  That is answered using combinatorics.  Specifically, if we have n steps, the number of possible ways we can get to the k-th bucket is

\begin{align}
\frac{n!}{k! \cdot (n - k)!}
\end{align}

Referring to the example above for a board of depth 4, we can calculate the number of paths to each bucket.  Note that $k \leq n$.

|Bucket Number (0 index)|Possible Paths|Number of Paths|
| :-: | :-: | :-: |
|0|LLLL|$\frac{4!}{0! \cdot (4 - 0)!} = \frac{4!}{4!}$ = 1|
|1|RLLL, LRLL, LLRL, LLLR|$\frac{4!}{1! \cdot (4 - 1)!} = \frac{4!}{3!}$ = 4|
|2|RRLL, RLRL, RLLR, LRRL, LRLR, LLRR|$\frac{4!}{2! \cdot (4 - 2)!} = \frac{4!}{2 \cdot 2}$ = 6|
|3|LRRR, RLRR, RRLR, RRRL|$\frac{4!}{3! \cdot (4 - 3)!} = \frac{4!}{3!}$ = 4|
|4|LLLL|$\frac{4!}{4! \cdot (4 - 4)!} = \frac{4!}{4!}$ = 1|

Now the we know how to calculate the number of ways to reach an endpoint k, given a path of n steps, the next step is to determine how to calculate the probability of reaching endpoint k.  

This is where the Binomial Distribution takes center stage.  It states that given the number of possible paths to get to an endpoint, the probability of reaching that endpoint k is

\begin{align}
P = \binom{n}{k} \cdot p^k \cdot (1 - p)^{n-k}
\end{align}

For the Galton board, p = 1/2 because there is an equal chance of the ball going left or right.  Therefore,
\begin{align}
P &= \binom{n}{k} \cdot p^k \cdot (1 - p)^{n-k} \\
&= \binom{n}{k} \cdot (\frac{1}{2} )^k \cdot (1 - \frac{1}{2} )^{n-k} \\
&= \binom{n}{k} \cdot (\frac{1}{2} )^k \cdot (\frac{1}{2} )^{n-k} \\
&= \binom{n}{k} \cdot (\frac{1}{2} )^n \\
&= \frac{n!}{k! \cdot (n - k)!} \cdot (\frac{1}{2} )^n 
\end{align}

Now the probabilities can be added to the table:

|Bucket Number (0 index)|Possible Paths|Number of Paths|Probability (#Paths x $\cdot (\frac{1}{2})^n$) |
| :-: | :-: | :-: | :-: |
|0|LLLL|$\frac{4!}{0! \cdot (4 - 0)!} = \frac{4!}{4!}$ = 1| $1 \cdot (\frac{1}{2})^4 $ = 1/16 = 0.0625|
|1|RLLL, LRLL, LLRL, LLLR|$\frac{4!}{1! \cdot (4 - 1)!} = \frac{4!}{3!}$ = 4| $4 \cdot (\frac{1}{2})^4 $ = 4/16 = 0.25|
|2|RRLL, RLRL, RLLR, LRRL, LRLR, LLRR|$\frac{4!}{2! \cdot (4 - 2)!} = \frac{4!}{2 \cdot 2}$ = 6| $6 \cdot (\frac{1}{2})^4 $ = 6/16 = 0.375|
|3|LRRR, RLRR, RRLR, RRRL|$\frac{4!}{3! \cdot (4 - 3)!} = \frac{4!}{3!}$ = 4| $4 \cdot (\frac{1}{2})^4 $ = 4/16 = 0.25|
|4|LLLL|$\frac{4!}{4! \cdot (4 - 4)!} = \frac{4!}{4!}$ = 1| $1 \cdot (\frac{1}{2})^4 $ = 1/16|





## Derive the Mean and the Variance

## Sample Calculation