>The Background section for this notebook  is a copy of [Alon Amit's Quora post on the mathematical  constant $e$](https://www.quora.com/How-was-the-mathematical-constant-e-calculated-Why-is-it-important),
which is really about the $\exp$ (or exponential) function.

## Background

The function

$$
(1)\,\,  𝑓(x) =1+ x +\frac{x^2}{2!}+ \frac{x^3}{3!} + \frac{x^4}{4!} + \dots
$$

has been called by Rudin "the most important function in mathematics". It satisfies the simplest of differential equations:

$$
f' = f
$$
 
from which all kinds of wondrous things follow, as I will attempt to show below.

The function usually goes by the name  $\exp$, "the exponential function".   The value

$$
\exp\,(1) = e
$$

is, indeed, one of the most important constants in math. The  exp
function can also be written as an actual exponent with base e:

$$
\exp\,(x)= e^{x}
$$

But honestly, it's not so much the number $e$ that's so important. It's the function.

## Harmonic Oscillators

In physics, the harmonic oscillator is the most fundamental model of a vibrating object, and since everything is made out of little vibrating objects you can imagine that the harmonic oscillator is pretty important. The equation of motion of a harmonic oscillator is given by

$$
x''= - ax
$$

where $x$ is the position as a function of time, $x''$
is the second derivative of the position with respect to time, aka the acceleration, and
$a$ is some positive number (it is the ratio between the "spring constant" and the mass). 
To make matters simple we can just take $a=1$
for now. It's easy to change units later and fit the actual value. So,

$$
x''= - x.
$$

How do we solve such an equation? Well, we already have a magical function $f$ which satisfies

$$
f' = f
$$

so if we modify it just a bit,


$$
g(x) = f(cx)
$$
 
we get

$$
g' = cg
$$

and

$$
g'' = c^{2}g
$$

So all we need to do is make sure that $c^{2}=-1$
and we've solved our harmonic motion equation. There are two such numbers$c$, namely $i$
and $-i$, so the functions we need are $e^{ix}$ and $e^{-ix}$.
The entire solution space of the equation of motion is just the collection of linear combinations of these two guys, and we can fit a solution to whatever initial conditions we have 
(initial position and initial velocity - two constraints for a 2-dimensional solution space.
Perfect).

Since the initial position and initial velocity happen to be real numbers, the entire solution is a real function of time, so we usually switch from the two functions $e^{ix}$ and $e^{-ix}$
to the two familiar linear combinations $\sin(x)$ and $\cos(x)$, but that's just a simple change of basis.

$$
\sin(x)=\frac{1}{2i}(e^{ix} - e^{-ix}), \cos(x)=\frac{1}{2}(e^{ix} + e^{-ix}),
$$

The function that truly underlies the solutions of fundamental differential equations such as the equation of the harmonic oscillator is the exponential function. The trigonometric functions are simple combinations of exponentials.

## Normal Distributions

One of the most fundamental laws of the universe is the Central Limit Theorem, which says that when you add up lots of things that have some uncertainty or randomness built into them, you get something that looks like a bell curve.

This is called "the normal distribution" for a reason. Most everything in nature is distributed like that, because so many things are sums or averages of lots of similar contributions. And of course, the function that describes this all-important curve is an exponential:

$$
f(x) = e^{-x^{2}/2}
$$

(I'm ignoring constants and standard deviations and such to show the essence of the function;
[JMG i.e., $\mu=0$ and $\sigma=1$)]. Again, the fundamental reason why this is the case is the defining differential property of the exponential function; in this case, the normal curve is the solution of

$$
f' + xf = 0
$$
 
which turns out to be the thing that makes this distribution so universal. Up to scaling factors, it is one of the few functions that is its own convolution with itself. And it is the only function that is its own Fourier Transform.

The Fourier transform is a fundamental tool in engineering, physics and mathematics. It's hard to overstate its importance. Most people usually associate Fourier series and transforms with trigonometric functions, but we've already seen that these functions are really just exponentials. In fact, the definition of the general Fourier transform is

$$
F(f)(\omega) = \int^{\infty}_{-\infty} e^{-2\pi i \omega x} dx
$$



so here's our favorite $e$ again, or rather the exponential function, at the center of an idea that permeates science and engineering. Signal processing, as an example, cannot get off the ground without Fourier analysis -- and that applies to both the analog and digital domains.

These are just some of the reasons why the exponential function, and the number that accompanies it, are so important.

Here ends the extended excerpt from Alon Amit.

The rest of this notebook relates the properties of $f$ just discussed to other
famous properties.

## Estimating $e$:

We have our definition of this important function $f$ (as an infinite series), which clearly satisfies 
f = f', and we have our definition of $e$ as $f(1)$.

So let's use these two facts to start estimating $e$ computationally.


In [26]:
import numpy as np

def estimate_exp(n,x=1):
    """
    Estimate `exp(x)`  by summing the first n+1 terms of the exp series.
    
    Leaving it to np.math.factorial to enforce the requirement that n be a non negative integer.
    """
    if n == 0:
        return 1
    else:
        return (x**n)/np.math.factorial(n) + estimate_exp(n-1,x=x)
    

print("\nSuccessively better estimates of e\n")
N=5
for n in range(N+1):   
    print(f"{n=}: {estimate_exp(n+1):.3f}")


Successively better estimates of e

n=0: 2.000
n=1: 2.500
n=2: 2.667
n=3: 2.708
n=4: 2.717
n=5: 2.718


Looking up $e$ in our mathematical directory:

In [3]:
np.e

2.718281828459045

Estimating $e^{2}$:

In [16]:
n=8
for i in range(n+1):   
    print(f"{i}: {estimate_exp(i+1,x=2):.3f}")

0: 3.000
1: 5.000
2: 6.333
3: 7.000
4: 7.267
5: 7.356
6: 7.381
7: 7.387
8: 7.389


In [9]:
np.e**2

7.3890560989306495

## Proving that exp is -- well, exponential 

Ok this part, especially remembering to test for uniform convergence, comes from [a Flammable Maths Video.](https://www.youtube.com/watch?v=owdvXpe_-Zc)

Prove that $\exp$ is exponential.  That is, prove it satisfies the following basic relation:

$$
\exp\,(x + y) = \exp\,(x)\,\exp\,(y)
$$

with $\exp\,(x)$ defined as in (1), repeated here:


$$
\exp(x) = 1+ x +\frac{x^2}{2!}+ \frac{x^3}{3!} + \frac{x^4}{4!} + \dots
$$


The proof uses **the Cauchy product of two infinite series** (now to see whether the
Cauchy equivalence holds here, there is an important test I'm skipping; see the note
below):

$$
\left ( \sum_{i=0}^{\infty} a_{i} \right ) \left ( \sum_{j=0}^{\infty} b_{j} \right ) =
 \sum_{k=0}^{\infty} c_{k} \, \,
\mbox{ where } \,\, c_{k} =  \sum_{l=0}^{k} a_{l}b_{k-l} 
$$

This somewhat resembles what we once learned about polynomial multiplication
but is different in important ways.  And since infinite sums are always a little unsettling,
it's nice to know that someone with impecabble math (and engineering) credentials
proved this in the 19th century.  Applying this to 

$$
\exp(x)\,\exp(y) = \left (\sum_{i=0}^{\infty} \frac{x^{i}}{i!} \right )\left (\sum_{j=0}^{\infty} \frac{y^{j}}{j!} \right ) 
$$

we get:

$$
(1) \, \,\exp\,(x)\exp\,(y) =  \sum_{k=0}^{\infty}  \left (\sum_{l=0}^{k} \frac{x^{l}}{l!} \frac{y^{k-l}}{(k-l)!} \right ) 
$$

Now let's take a look at where we're headed.  We'd like the double sum on the right of
the equal sign in (1) to equal:

$$
\exp\,(x + y) = 1 + \frac{(x + y)^{1}}{1!} +\frac{(x+y)^2}{2!}+ \frac{(x+y)^3}{3!} + \dots
= \sum_{k=0}^{\infty} \frac{(x+y)^{k}}{k!}
$$

So if life were kind,  the inner sum in (1) would equal a term in the $\exp$ sum:

$$
(2) \,\,  \sum_{l=0}^{k} \frac{x^{l}}{l!} \frac{y^{k-l}}{(k-l)!} \stackrel{?}{=} \frac{(x + y)^{k}}{k!}
$$

To see whether that's true we use the Binomial Theorem, which tells us:


$$
(x + y)^{k}  = \sum_{l=0}^{k} { k \choose l } \, x^{l}\,y^{k-l} = \sum_{l=0}^{k} \frac{k!}{l!\,(k-l)!} \, x^{l}\,y^{k-l}
$$

which means

$$
\frac{1}{k!}(x + y)^{k}  = \sum_{l=0}^{k} \frac{1}{k!} \frac{k!}{l!\,(k-l)!} \, x^{l}\,y^{k-l} = \sum_{l=0}^{k}  \frac{x^{l}\,y^{k-l}}{l!\,(k-l)!},
$$

which is exactly the inner sum in (1).  This essentially completes the proof, but let's tidy up.

Now that we know exactly what the proof needs, we can appear especially clever by backtracking to (1) and
factoring $\frac{1}{k!}$ out of the inner sum.  We do that by multiplying the terms in the inner sum 
by $\frac{k!}{k!}$

$$
(1') \, \,\exp\,(x)\exp\,(y) = \sum_{k=0}^{\infty}\left (   \sum_{l=0}^{k} \frac{x^{l}}{l!} \frac{y^{k-l}}{(k-l)!} \frac{k!}{k!} \right )
= \sum_{k=0}^{\infty}  \frac{1}{k!} \left (\sum_{l=0}^{k} x^{l} y^{k-l} \frac{k!}{l!\,(k-l)!} \right )
= \sum_{k=0}^{\infty} \frac{(x + y)^{k}}{k!} 
= \exp\,(x + y)
$$

And voila!  A one-line proof (more or less).  We might want to add a note of explanation for the mere mortals:
where the substitution of $(x + y)^{k}$ for the second inner sum is justified by the Binomial Theorem.



========================================================================================================

**Note**:  The fine print on the Cauchy product says that we need the two series we are multiplying to **converge absolutely**, which we can establish with something called the ratio test. That is, for both series (which, fortunately, have exactly the same form), the ratio of the ($n+1$)-th term to $n$th
term has to be less than 1:

$$
\lim_{n\rightarrow \infty} \left \vert \frac{a_{n+1}}{a_{n}} \right \vert \lt 1.
$$

So we check  the ratio of 

$$
\left \vert \frac{x^{n+1}}{(n + 1)!} \right \vert \,\, \mbox{ to } \,\, \left \vert \frac{x^{n}}{n!} \right \vert,
$$

which is

$$
\left \vert \frac{x^{n+1}} {(n + 1)!}  \right \vert \cdot \left \vert \frac{n!}{x^{n}}  \right \vert  = \left \vert \frac{x}{n+1} \right \vert ,
$$

since the product of  the absolute values of two real numbers is the
absolute value of the product. And so we can confidently assert

$$
\lim_{n\rightarrow \infty}  \left \vert \frac{x}{n+1} \right \vert \lt 1,
$$

since the expression on the left dwindles to 0 as $n$ grows, whatever the value of $x$.


## Take 2: A somewhat more satisfying proof

From [one of Dr. Peyam's videos.](https://www.youtube.com/watch?v=TgrT2nTl6IM)

No Cauchy product, in fact no reference to the infinite series at all.  Just make three assumptions

1.  f = f'
2.  f(0) = 1
3.  f > 0.

$$
g(x) = \frac{f(x + a)}{f(x)}
$$

This proof that follows uses $g$ and needs assumption 3, which guarantees that $g$ is nowhere undefined,
licensing the shenanigans below. We of course could derive assumption 3
if we were allowed to peek at the infinite series, but we're not.

$$
g'(x) = \frac{f'(x + a)f(x) - f'(x)f(x+a)}{f(x)^2} = \frac{f(x + a)f(x) - f(x)f(x+a)}{f(x)^2} = 0
$$

Hence we have shown $g$ is constant:

$$
C = \frac{f(x + a)}{f(x)}
$$

To find C we look at $x=0$ and use assumption 2.

$$
C = \frac{f(0 + a)}{f(0)} = \frac{f(a)}{1} = f(a)
$$

So now we are pretty much done. We have:

$$
\frac{f(x + a)}{f(x)} =  f(a) \Rightarrow f(x + a) =  f(a) f(x) 
$$

You might say, well the proof via Cauchy product goes through with no assumptions, so isn't that better?
But of course that proof starts by defining $f$ 
as a very special infinite series, which entails all three assumptions above, 
so in fact it assumes at least as much this proof.

Both proofs have their place. This one shows us that it doesn't take much more than $f' = f$ to give
us exponentiality, and it does so in an insightful way.

## Another proof of exponentiality, another path to e

Why is $e$ the **natural** logarithm base? We will try to motivate that idea here.  In the process we will
discover yet another proof that $\exp$ is exponential.

Choose a small number $\epsilon$, add it to 1, and raise that to the
$1/\epsilon$ power.  Here's an example:

In [30]:
eps = 1e-6
1 + eps

1.000001

Next we choose $1/\epsilon$ as a power to raise $1 + \epsilon$ to: Observe what happens:

In [12]:
epss = 1e-6
(1+eps)**(1/eps)

2.7182804690957534

Which is a pretty good approximation of $e$:

In [5]:
import numpy as np
np.exp(1)

2.718281828459045

Shrink the base, the number whose power is being taken, ever closer to $1$ for 
ever better approximations of $e$:

In [16]:
eps0 = 1
for i in range(1,40):
    bs = 2**(-i)
    r = 2 **(i)
    val=(1+bs)**(1/bs)
    eps = np.e - val
    print(f"{i:>2} {val:.12f}  {eps: .2e}  {eps/eps0:.3f}")
    eps0 = eps

 1 2.250000000000 2.691588029074  4.68e-01  0.468
 2 2.441406250000 2.704813829422  2.77e-01  0.591
 3 2.565784513950 2.711517122929  1.52e-01  0.551
 4 2.637928497367 2.714891744381  8.04e-02  0.527
 5 2.676990129378 2.716584846682  4.13e-02  0.514
 6 2.697344952565 2.717432851769  2.09e-02  0.507
 7 2.707739019688 2.717857218554  1.05e-02  0.504
 8 2.712991624253 2.718069493105  5.29e-03  0.502
 9 2.715632000169 2.718175653176  2.65e-03  0.501
10 2.716955729466 2.718228738916  1.33e-03  0.500
11 2.717618482337 2.718255283211  6.63e-04  0.500
12 2.717950081190 2.718268555715  3.32e-04  0.500
13 2.718115936266 2.718275192118  1.66e-04  0.500
14 2.718198877722 2.718278510311  8.30e-05  0.500
15 2.718240351930 2.718280169413  4.15e-05  0.500
16 2.718261089905 2.718280998471  2.07e-05  0.500
17 2.718271459109 2.718281414236  1.04e-05  0.500
18 2.718276643766 2.718281619647  5.18e-06  0.500
19 2.718279236108 2.718281727296  2.59e-06  0.500
20 2.718280532282 2.718281779144  1.30e-06  0.500


In [35]:
np.e

2.718281828459045

Notice $\epsilon$ the difference between `val` (our estimate) and $e$ is converging on  0, with the error halved on each iteration.

What we are suggesting (but not yet proving) is:

$$
(1A)\;\; e = \lim_{n\rightarrow \infty} \; (1+1/n)^{n} 
$$

This limit, which is an alternative definition of $e$ (we used $\exp(1)$ above) also turns out to lead to an alternative definition of $\exp$.

The interest in this limit (discovered by Bernoulli) arose first in the computation of compound interest.
He was exploring what happens to compound interest as the compounding period grows smaller and 
smaller. Suppose your the interest on your principle is 4% and you compound every day, then 
at the end of the first day your principle will be multiplied
by $(1 + .04/365)$ and if we add that to our new principle for the second day (compounding)
the multiplier will be  $(1 + .04/365)^2$.  Now suppose we compound every half day,
or every minute.   In fact the multiplier grows and grows each time the compounding frequency is increased, but the rate at which it grows slows. In fact, suppose we make our period infinitely small.  Does  the multiplier grow infinite?  No, indeed. Here's what happens:

For any given compounding frequency $r$, the multiplier is:

$$
(1 + .04/r)^r.
$$

Since

$$
\lim_{r\rightarrow \infty} (1 + .04/r)^{r/.04}  = e,
$$

$$
\lim_{r\rightarrow \infty} (1 + .04/r)^{r} = ((1 + .04/r)^{r/.04})^{.04} = e ^{.04}.
$$

Demonstrating, we compute the multipliers from daily compounding to compounding 20 times daily,
and compare with $e^{.04}$.

In [28]:
import numpy as np

lim = np.e**(.04)
def multiplier (r):
    return (1 + (.04/r))**(r)

print(f"e^{.04} {lim:8}",end="\n\n")
col_header= "            difference from e^{.04}"
print (col_header)
print("-" * len(col_header))

for (i,r) in enumerate(range(365, 21*365, 365)):
    print(f"{i+1:>4} daily:        {lim - multiplier(r):.4e}")

e^0.04 1.0408107741923882

            difference from e^{.04}
-----------------------------------
   1 daily:        2.2811e-06
   2 daily:        1.1406e-06
   3 daily:        7.6039e-07
   4 daily:        5.7030e-07
   5 daily:        4.5624e-07
   6 daily:        3.8020e-07
   7 daily:        3.2589e-07
   8 daily:        2.8515e-07
   9 daily:        2.5347e-07
  10 daily:        2.2812e-07
  11 daily:        2.0738e-07
  12 daily:        1.9010e-07
  13 daily:        1.7548e-07
  14 daily:        1.6294e-07
  15 daily:        1.5208e-07
  16 daily:        1.4258e-07
  17 daily:        1.3419e-07
  18 daily:        1.2673e-07
  19 daily:        1.2006e-07
  20 daily:        1.1406e-07


Thus, Bernoulli discovered that at the limit the compound interest multiplier is a power of $e$.

Equally interesting, $e$ came into focus because of building logarithm tables, which were at first
constructed simply as tables for making multiplication easier, without realizing that the values for an exponential function were being computed. The idea was to map multiplication problems to
addition problems, as in:

$$
\begin{array}[t]{ll}
0 & 1\\
1 & 2\\
2 & 4\\
3 & 8\\
4 & 16\\
5 & 32\\
  & \dots\\
9 &  512
\end{array}
$$

To multiply 16 by 32 you would look up their logs (base 2) which are 4 and 5, add them, and look up the "antilog" of 9, which is 512.

This is great, but using 2 as your multiplier only allows you to multiply powers of 2. Suppose you want to multiply other integers, or real positive non-integers. What we need is a finer-grained way of slicing up the positive numbers.  But wait, so far we've only used integer powers of 2.  Suppose we
try fractional powers.   Suppose we use the $1/3$ power.

$$
\begin{array}[t]{ll}
0 & 1\\
1/3 & \sqrt[3]{2}\\
2/3 & {\sqrt[3]{2}}^{2}\\
1 & 2\\
4/3 & {\sqrt[3]{2}}^{4}\\
5/3 & {\sqrt[3]{2}}^{5}\\
2 & 4\\
\end{array}
$$

Indeed why stop there?  For any $n$

$$
\begin{array}[t]{ll}
0 & 1\\
1/n & \sqrt[n]{2}\\
2/n & {\sqrt[n]{2}}^{\,2}\\
\dots &  \dots \\
n/n & 2\\
(n+1)/n & {\sqrt[n]{2}}^{\,n+1}\\
(n+2)/n & {\sqrt[n]{2}}^{\,n+2}\\
\dots & \dots \\
2 & 4\\
\end{array}
$$

Now suppose we want to get still finer-grained entries in order to get greater precision.  We just make $n$ larger. Each time we increase $n$, we get a new value for $2^{1/n}$, the smallest number in our
logarithm table, which we'll call our **atom**.   And the bigger $n$ is, the smaller our atom and  the more accurate our computations with the table get.  So now we are led to ask: Is there an end to this
process?  Well obviously

$$
\lim_{n\rightarrow\infty} 2^{1/n}  = 1,
$$

but that's not much use, since $1^{m} = 1$ for any $m$, so we can't have 1 be our atom. Also
our choice of 2 for our base was arbitrary.  Nothing hangs on that. In fact for any fixed $y>1$:

$$
\lim_{n\rightarrow\infty} y^{1/n}  = 1,
$$

Let's flip things around and ask if there's a natural logarithm atom/base
relationship, that is, any atom $z$ and base $y$ such that:

$$
\lim_{n\rightarrow\infty} z^{n}  = y.
$$

And obviously $z$ can't be some fixed number, since if it is less than 1, $y$ just gets
smaller and smaller, and if it is greater, $y$ gets bigger and bigger.  And we already
saw, $z=1$ is no help.  

But what if we let $z$ depend on $n$ and let it approach 1 as $n$ increases? This brings
us back to the familiar limit in (1a):

$$
\lim_{n\rightarrow\infty} (1+1/n)^{n}  = y.
$$

We've already suggested that this limit exists and is equal to $e$.  And if that's 
true, then there's a very precise sense in which $e$ is the ideal value for $y$.  For 
as $n$ increases, the $n$th root of $e$ grows closer and closer to $(1 + 1/n)$.

And now our table looks like this:

$$
\begin{array}[t]{ll}
0 & 1\\
1/n & 1+1/n\\
2/n & (1+1/n)^2\\
3/n & (1+1/n)^3\\
4/n & (1+1/n)^4\\
5/n & (1+1/n)^5\\
  & \dots\\
n/n &  (1+1/n)^n \approx e\\
\end{array}
$$

Here for completeness is a table comparing the nth root of $e$ with
$(1+1/n)$ as $n$ grows larger:

In [85]:
banner = f"{'n':>3}  {'1+1/n'}  {'(1+1/n)^n':>4} {'e^(1/n)':>4} "
print(banner)
print("="*(len(banner)+1),end="\n")
for n in range(1,140,6):
  print(f"{n:>3}  {1+1/n:.4f}  {(1+1/n)**n:.4f}  {np.e**(1/n):.4f}")

  n  1+1/n  (1+1/n)^n e^(1/n) 
  1  2.0000  2.0000  2.7183
  7  1.1429  2.5465  1.1536
 13  1.0769  2.6206  1.0800
 19  1.0526  2.6500  1.0540
 25  1.0400  2.6658  1.0408
 31  1.0323  2.6757  1.0328
 37  1.0270  2.6824  1.0274
 43  1.0233  2.6873  1.0235
 49  1.0204  2.6911  1.0206
 55  1.0182  2.6940  1.0183
 61  1.0164  2.6963  1.0165
 67  1.0149  2.6983  1.0150
 73  1.0137  2.6999  1.0138
 79  1.0127  2.7013  1.0127
 85  1.0118  2.7025  1.0118
 91  1.0110  2.7035  1.0110
 97  1.0103  2.7044  1.0104
103  1.0097  2.7052  1.0098
109  1.0092  2.7059  1.0092
115  1.0087  2.7066  1.0087
121  1.0083  2.7071  1.0083
127  1.0079  2.7077  1.0079
133  1.0075  2.7081  1.0075
139  1.0072  2.7086  1.0072


The next question is: can we prove (1A)?  To do that 
we are going to relate the limit in (1A) (some particular number,
if the limit exists) to what we know
about the function $f$ (or $\exp$ as we have been calling it)?

What we know by definition is this:

$$
(2)\,\,\exp\,(1) = e 
$$

More precisely we have convinced ourselves that the infinite sum in (1) has a limit (not proved here) and called the limit $e$.

What we'd like to show is that $e$ is also the limit of another infinite sequence:

$$
(2')\,\, e = \lim_{n\rightarrow \infty} \; (1+1/n)^{n}.
$$

That is, we want to simultaneously show that the limit in (1A) exists and that
its value is the limit of the infinite series we used to define $\exp(1)$.

Do we know an algebraic way of computing the limit in (2')?  Yes, we do, a way that's come up once before in the notebook, the good old undervalued Binomial Theorem (well, undervalued by algebra students, perhaps, not by mathematicians, or by people who compute compound interest).  According to the Binomial Theorem


$$
(x + y)^{n}  = \sum_{k=0}^{n} { n \choose k } \, x^{n-k}\,y^{k} = \sum_{k=0}^{n} \frac{n!}{k!\,(n-k)!} \, x^{n-k}\,y^{k}
$$

Therefore

$$
(3)\,\,\,  (1 + 1/n)^{n}  = \sum_{k=0}^{n} { n \choose k } \, 1^{n-k}\,(1/n)^{k} 
               =  \sum_{k=0}^{n} \frac{1}{k!} \frac{n (n-1) (n-2) \dots (n - k +1)}{n^k}
               =  \sum_{k=0}^{n} \frac{1}{k!} \prod^{k-1}_{m=0} \left ( 1 - \frac{m}{n}\right )
$$


We'd like to show this is equal to $\exp(1)=e$ as defined above.  Since,

$$
\exp(x) =1+ x +\frac{x^2}{2!}+ \frac{x^3}{3!} + \frac{x^4}{4!} + \dots,
$$

then

$$
\exp(1) = 1  +  1 +\frac{1}{2!}+ \frac{1}{3!} + \frac{1}{4!} + \dots
$$

In other words:
$$
e= \exp(1) =  \lim_{n\rightarrow \infty} \sum_{k=0}^{n} \frac{1}{k!} 
$$

And what we'd like to show is:

$$
(4)\,\,\, \lim_{n\rightarrow \infty} (1 + 1/n)^{n}  =  \lim_{n\rightarrow \infty} \sum_{k=0}^{n} \frac{1}{k!} 
$$


**Proof***:  Consider the inner product in the formulation on the right in (3):

$$
\prod^{k-1}_{m=0} \left ( 1 - \frac{m}{n}\right )
$$

If $k$ is fixed, this approaches  1 as $n$ grows large.

$$
\lim_{n\rightarrow \infty} \prod^{k-1}_{m=0} \left ( 1 - \frac{m}{n}\right ) = 1
$$


Therefore as $n$ grows large,

$$
\lim_{n\rightarrow \infty} \frac{1}{k!} \prod^{k-1}_{m=0} \left ( 1 - \frac{m}{n}\right )
= \frac{1}{k!}
$$

essentially completing our proof. As $n$ grows large, each of the terms in the sum in (3) approaches
$\frac{1}{k!}$.  


$$
\lim_{n\rightarrow \infty}  (1 + 1/n)^{n} 
               =  \lim_{n\rightarrow \infty} \sum_{k=0}^{n} \frac{1}{k!} \left \lbrack \prod^{k-1}_{m=0} \left ( 1 - \frac{m}{n}\right ) \right \rbrack
               =  \lim_{n\rightarrow \infty} \sum_{k=0}^{n} \frac{1}{k!} 
$$


It's easy enough to generalize this from a proof about $\exp(1)$ to a proof about
$\exp (x)$:


$$
(1 + x/n)^{n}  = \sum_{k=0}^{n} { n \choose k } \, 1^{n-k}\,(x/n)^{k} 
               =  \sum_{k=0}^{n}\frac{x^{k}}{k!}  \left \lbrack
                   \frac{n (n-1) (n-2) \dots (n - k +1)}{n^k}
                   \right \rbrack
               =  \sum_{k=0}^{n}\frac{x^{k}}{k!} \left \lbrack  \prod^{k-1}_{m=0} \left ( 1 - \frac{m}{n}\right ) \right \rbrack
$$

Hence:

$$
(3') \lim_{n\rightarrow \infty}  (1 + x/n)^{n} 
               =  \lim_{n\rightarrow \infty} \sum_{k=0}^{n} \frac{x^{k}}{k!} \left \lbrack \prod^{k-1}_{m=0} \left ( 1 - \frac{m}{n}\right ) \right \rbrack
               =  \lim_{n\rightarrow \infty} \sum_{k=0}^{n} \frac{x^{n}}{k!} 
$$

Where the sum on the right is (1), the sum we used to define $f(x)=\exp(x)$.  One final point remains. We haven't yet shown
that

$$
f(x) = e^{x}.
$$

To show that the lefthand side of (3') is same as $e^{x}$, we just recall our trick with 
compound interest rates and observe that

$$
\lim_{n\rightarrow \infty} (1 + x/n)^n  = \lim_{n\rightarrow \infty} ((1 + x/n)^{n/x})^{x} = e^{x}.
$$

And now we've found yet another proof that $\exp$ is exponential.  In fact, we've done more than that.  We shown that the base of the exponential function is $e$.

###  Back to basics:  The derivative 

We now have two definitions of $\exp$, the one we started with, the infinite sum in (1):

$$
\exp(x) =  \lim_{n\rightarrow \infty} \sum_{k=0}^{n} \frac{x^{k}}{k!} 
$$

and 

$$
\exp(x) = \lim_{n\rightarrow \infty} (1 + x/n)^n 
$$

Now it's trivial to show 

$$
\exp =  \exp\text{'}
$$

by the first definition (exercise).  But what about the second?  Let's rewrite the second:

$$
\exp(x) \begin{array}[t]{l} =  \lim_{n\rightarrow \infty} (1 + x/n)^n \\
                           =                   \lim_{n\rightarrow \infty} g^n  \text{ where}\\
                                       \;\;\;g= (1 + x/n)
              \end{array}
$$



We observe

$$
g' = 1/n
$$

and therefore:

$$
\begin{array}[t]{lcl}
\exp\text{'} & = &   \lim_{n\rightarrow \infty} n g^{n-1} g'\\
   & = &   \lim_{n\rightarrow \infty} n g^{n-1} 1/n\\
    & = &  \lim_{n\rightarrow \infty} g^{n-1} \\
\end{array}
$$

Hence

$$
\begin{array}[t]{lcl}
\exp\text{'} & = & \lim_{n\rightarrow \infty} g^{n-1}\\
\exp(x) &  = & \lim_{n\rightarrow \infty} g^{n}
\end{array}
$$

So the question of whether 

$$
\exp\text{'} = \exp
$$

amounts to whether

$$
\begin{array}[t]{lcl}
 \lim_{n\rightarrow \infty} g^{n-1} & = &  \lim_{n\rightarrow \infty} g^{n}
\end{array}
$$

And it seems quite reasonable that it does.

Now same question for 

$$
h(x) = \frac{x^{n}}{n}
$$

$$
h\text{'}(x) = \frac{x^{n-1}}{n} * n = x^{n-1}
$$

Here

$$
h\text{'}(x) = n/x \; h(x)
$$

So is the following true?

$$
\lim_{n \rightarrow\infty} h(x) = \lim_{n \rightarrow\infty} h\text{'}(x) 
$$