We have been computing derivatives, but do we really know that they mean?

We have learned derivatives as slopes. But that's an interpretation of derivatives, not a definition.

And it's not a very robust interpretation either - think about a function with multiple inputs and multiple outputs. What does a derivative mean for that? E.g.
- A vector field
- A digital image
- Brownian noise

...for those functions, derivatives makes sense, while slopes do not.

## 3 definitions of derivative

### Definition 1

The derivative of function $f$ at $x = a$ is defined as $$
\frac{d f}{d x}|_a = \lim_{x \to a} \frac{f(x) - f(a)}{x - a}
$$.

The benefit of this definition is that we can interpret the derivative as $$
\lim_{input \to a} \frac{\text{change in output}}{\text{change in input}}
$$. And it is conceptually clear.

### Definition 2

The derivative of function $f$ at $x = a$ is defined as $$
\frac{d f}{d x} |_a = \lim_{h \to 0} \frac{f(a + h) - f(a)}{h}
$$

The benefit of this definition is that we can interpret the derivative as $$
\lim_{\text{change in input} \to 0} \frac{\text{change in output}}{\text{change in input}}
$$. This is really the same definition as definition 1, simply as $$
\begin{array}{}
h &= x - a \\
x &= h + a
\end{array}
$$, which is a change of variables.

### Definition 3

This is a bit different...

The derivative of function $f$ at $x = a$ is defined as the constant $$
\frac{d f}{d x} |_a = C
$$ satisfying that $$
f(a + h) = f(a) + C h + O(h^2)
$$. Where $O(h^2)$ implies all other change will vanish quadratically.

Some people call this a __strong__ derivative, because sometimes it does not exists, even when the true derivative in terms of the limit does.

Thinking in terms of a Taylor expansion, and using big $O$ to control higher order terms to be illuminative.

### Comparision of the definitions for $f(x) = x^n $ where $n \geq 1$

According to Definition 1 and 2, we can write the derivative as $$
\begin{equation}
\begin{aligned}{}
\frac{d f}{d x}|_a &= \lim_{h \to 0} \frac{f(a + h) - f(a)}{h} \\
&= \lim_{h \to 0} \frac{(a + h)^n - a^n}{h} \\
&= \lim_{h \to 0} \frac{(a^n + n a^{n - 1} h + O(h^2)) - a^n}{h} \\
&= \lim_{h \to 0} \frac{n a^{n - 1} h + O(h^2))}{h} \\
&= \lim_{h \to 0} n a^{n - 1} + O(h) \\
&= n a^{n - 1}
\end{aligned}
\end{equation}
$$

That definition is the same as the 3rd definition in terms of the variation of the output.

Related to definition 3 $$
\begin{equation}
\begin{aligned}{}
f(a + h) &= (a + h)^n \\
&= a^n + n a^{n - 1} h + O(h^2) \\
\end{aligned}
\end{equation}
$$ Where the first order term of $h$, $n a^{n - 1} h$, is $\frac{d f}{d x}|_a$.

### More Examples

### $e^{x + h}$

$$
\begin{equation}
\begin{aligned}{}
e^{x + h} &= e^x e^h \\
&= e^x (1 + h + O(h^2)) & \text{Warning: circular reasoning} \\
&= e^x + e^x h + O(h^2)
\end{aligned}
\end{equation}
$$

### $\cos (x + h)$

$$
\begin{equation}
\begin{aligned}{}
\cos (x + h) &= \cos x \cos h - \sin x \sin h & \text{[1]} \\
&= \cos x (1 + O(h^2)) - \sin x (h + O(h^3)) \\
&= \cos - (\sin x) h + O(h^2)
\end{aligned}
\end{equation}
$$

[1](http://trigonography.com/2015/09/28/angle-sum-and-difference-for-sine-and-cosine/)

### $\sqrt {x + h}$

$$
\begin{equation}
\begin{aligned}{}
\sqrt{x + h} &= \sqrt{x (1 + \frac{h}{x})} \\
&= \sqrt{x} (1 + \frac{h}{x})^{\frac{1}{2}} \\
&= \sqrt{x} (1 + \frac{1}{2}\frac{h}{x} + O(h^2)) & \text{Binomial series} \\
&= \sqrt{x} + \frac{1}{2} x^{- \frac{1}{2}} + O(h^2) & \text{when $x \neq 0$}
\end{aligned}
\end{equation}
$$



### Notations of derivatives

The derivative of $y = f(x)$ can be denoted by
- $\frac{d f}{d x}$ or $\frac{d y}{d x}$ (The best ways)
- $f^{\prime}$ ,$\dot{y}$, or $d f$ (OK ways, but the information of which variable is changing is lost)

### Examples application of derivatives

The most common incarnations of derivatives involve change of time.
- Velocity: $v = \frac{d x}{d t}$.
- Acceleration: $a = \frac{d v}{d t}$.

But there are other example too.
- Current: $I = \frac{d Q}{d t}$, where $Q$ is the charge.
- Chemical reaction rates: $r_{p} = \frac{d[P]}{d t}$.

And there are examples of derivatives that don't involve the change of time.
- Spring constant: $k = \frac{d(force)}{d(deflection)}$.
- Elastic modulus: $\lambda = \frac{d(stress)}{d(strain)}$.
- Viscosity/Shear stress: $\tau = \mu \frac{d(velocity)}{d(height)}$.
- Marginal rates in economics: $mr = \frac{d(tax)}{d(income)}$.