### Derivatives and Differentiation

$\large f: \mathbb{R} \rightarrow \mathbb{R}$

$\large f'(x) = \lim_{h \rightarrow 0} \frac{f(x+h) - f(x)}{h}$

let $\large u = f(x) = 3x^2-4x$

In [1]:
def f(x):
    return 3 * x ** 2 - 4 * x

By setting  $x=1$  and letting  $h$  approach  $0$ , the numerical result of $\large \frac{f(x+h) - f(x)}{h}$ approaches $2$.

In [2]:
def numerical_lim(f, x, h):
    return (f(x + h) - f(x)) / h

h = 0.1
for i in range(5):
    print(f'h={h:.5f}, numerical limit={numerical_lim(f, 1, h):.5f}')
    h *= 0.1

h=0.10000, numerical limit=2.30000
h=0.01000, numerical limit=2.03000
h=0.00100, numerical limit=2.00300
h=0.00010, numerical limit=2.00030
h=0.00001, numerical limit=2.00003


### Different Notations of Derivates

$\large f'(x) = y' = \frac{dy}{dx} = \frac{df}{dx} = \frac{d}{dx} f(x) = Df(x) = D_x f(x)$

### Derivates of common functions

1. $DC = 0$, (C is a Constant)
2. $Dx^n = nx^{n-1}$, (the power rule,  $n$  is any real number)
3. $De^x = e^x$
4. $D\ln(x) = 1/x$

### Differentiation Rules

1. $\large \frac{d}{dx} [Cf(x)] = C \frac{d}{dx} f(x)$, (constant multiple rule)
2. $\large \frac{d}{dx} [f(x) + g(x)] = \frac{d}{dx} f(x) + \frac{d}{dx} g(x)$, (sum rule)
3. $\large \frac{d}{dx} [f(x)g(x)] = f(x) \frac{d}{dx} [g(x)] + g(x) \frac{d}{dx} [f(x)]$, (product rule)
4. $\large \frac{d}{dx} \left[\frac{f(x)}{g(x)}\right] = \frac{g(x) \frac{d}{dx} [f(x)] - f(x) \frac{d}{dx} [g(x)]}{[g(x)]^2}$, (quotient rule)

### Partial Derivatives

Let $y = f(x_1, x_2, \ldots, x_n)$ be a function with  $n$  variables. The $\textit {partial derivative}$ of  $y$  with respect to its $i^{th}$ parameter $x_i$ is,

$\large \frac{\partial y}{\partial x_i} = \lim_{h \rightarrow 0} \frac{f(x_1, \ldots, x_{i-1}, x_i+h, x_{i+1}, \ldots, x_n) - f(x_1, \ldots, x_i, \ldots, x_n)}{h}$

To calculate $\large \frac{\partial y}{\partial x_i}$ we can simply treat $x_1, \ldots, x_{i-1}, x_{i+1}, \ldots, x_n$ as constants and calculate the derivative of  $y$  with respect to  $x_i$ 

Partial derivates can be denotated as any of the following notations;
$\large \frac{\partial y}{\partial x_i} = \frac{\partial f}{\partial x_i} = f_{x_i} = f_i = D_i f = D_{x_i} f.$

### Gradients

We can concatenate partial derivatives of a multivariate function with respect to all its variables to obtain the $\textit {gradient}$ vector of the function.

$\large \nabla_{\mathbf{x}} f(\mathbf{x}) = \bigg[\frac{\partial f(\mathbf{x})}{\partial x_1}, \frac{\partial f(\mathbf{x})}{\partial x_2}, \ldots, \frac{\partial f(\mathbf{x})}{\partial x_n}\bigg]$

$\large \nabla_{\mathbf{x}} f(\mathbf{x})$ is replace by $\large \nabla f(\mathbf{x})$ if there's no ambiguity.


### Chain Rule

Suppose that functions  $y=f(u)$  and  $u=g(x)$  are both differentiable, then the chain rule states that,

$\large \frac{dy}{dx} = \frac{dy}{du} \frac{du}{dx}$