## Refreshers on Derivatives of vector Valued functions

A Function $f: \mathbb{R}^n \rightarrow \mathbb{R}^m$ is differentiable at $a \in \mathbb{R}^n$ if there is an $m \times n$ matrix such that:

$$ \lim_{x \to a} \frac{|f(x) - f(a) - A \cdot (x-a) |} {|x-a|} = 0$$ 

If such matrix exists, the matrix $A$ is denoted by $Df(a)$ and is called the Jacobian.

Note that $|x-a|$ is the distance metric defined by the Uclidean Distance $\sqrt{{(x_1 - a_1)}^2 + {(x_2 - a_2)}^2 .. {(x_n-a_n)}^2}$ and is a real valued scalar.

Formally, this can be derived from the general definiton of a derviative:

$$
f'(a) = \lim_{x\to a} \frac{f(x) - f(a)}{x-a}
$$
Where this is only true if and only if:

$$
0= \lim_{x\to a} \frac{f(x) - f(a)}{x-a} - f'(a)
$$

which can be transformed to:

$$
0= \lim_{x\to a} \frac{f(x) - f(a) - f'(a)(x-a)}{x-a}
$$

and thus the evaluated to the final distance of each numerator and denominator from the origin:

$$
0= \lim_{x\to a} \frac{|f(x) - f(a) - f'(a)(x-a)|}{|x-a|}
$$
(Since the notion of divison of two vectors is silly)

Here $f'(a)$ represents our Jacobian matrix in $m \times n$ shape. Denote I refer to this matrix from now on as $Df(x)$, though in the example above
it is being used to be evaluated at point $a$ in vector space.


### Defining the Jacobian in terms of coordinates and Indices

#### Definitions of Jacobians via multiple functions of $f$
Let the function $f: \mathbb{R}^n \to \mathbb{R}^m$ be given by the m differentiable functions $f_1(x_,..,x_n),\dots, f_m(x_1,\dots, x_n)$ such that:

$$
f(x_1,\dots,x_n) = \begin{bmatrix}
f_1((x_,..,x_n)) \\
\vdots \\
\vdots \\
f_m((x_,..,x_n))
\end{bmatrix}
$$

Supposing we can represent each $f$ as a family of functions, indexed from 1 to $m$, we can take the derivative of each function $f_i$ for $i \in 1\dots m$:

$$
Df_i(x_1,...,x_n) \to \hat{v_i} \ \text{such that} \ v_i \in \mathbb{R}^n
$$
In this case, we know $v_i$ to be $n$ dimensional, because of our original formulation of the derivative of vector valued functions. Note that we must compute
$f'(a)$ which is another function $Df(a)$. However, we need the rows of $f'(a)$ (a linear map of sorts) for each ith row in $A$ to represent a tangent line. This 
tangent line is conditioned to be for an input vector valued to be resultant vector of $x-a$. 

Similar to our single valued derivative case (grade school calculus):
$$
y = f(a) + f'(a)(x-a)
$$ 
where the above function is the tangent line of some differentiable function at some point $a$ for a function $y=f(x)$ (Note this function is *linear*),

we want to build this same representation for a multivalued function $f_i(x_1,...,x_n)$. Thus we rely on partial derivatives to represent individual *linear* tangent lines
with respect to each individual input $x_j$ where $j \in 1...n$. Thus, we can represent each row of $Df(x)$ as:
$$
Df_i(x_1,..., x_n) = \left (\frac{\partial f_i}{\partial x_1} ,\frac{\partial f_i}{\partial x_2}, \cdots, \frac{\partial f_i}{\partial x_n}\right)
$$
So as a result, we can say $Df_i(x_1,...,x_n)$ when applied to the elements of $x-a$, will represent the linear approximations of wiggles on the vector valued function $f$ that will 
be approximately zero in distance from if we took the difference of $f(x) - f(a)$ exactly.

As a result, we can expand this out to each function $f_1...f_m$ in $f$ so that $Df(x_1,...,x_n)$ our Jacobian is:



$$
Df(x) = \begin{bmatrix}
\frac{\partial f_1}{\partial x_1} & \frac{\partial f_1}{\partial x_2} & \cdots & \frac{\partial f_1}{\partial x_n} \\
\frac{\partial f_2}{\partial x_1} & \frac{\partial f_2}{\partial x_2} & \cdots & \frac{\partial f_2}{\partial x_n} \\
\vdots & \vdots & \ddots & \vdots \\
\frac{\partial f_m}{\partial x_1} & \frac{\partial f_m}{\partial x_2} & \cdots & \frac{\partial f_m}{\partial x_n}
\end{bmatrix}
$$

#### Defining coordinates from domain and codomain dimensions


### Defining Differentials of Compositions of Functions

Let $f: \mathbb{R}^n \rightarrow \mathbb{R}^m$ and $g: \mathbb{R}^m \rightarrow \mathbb{R}^l$ be differentiable functions. Also there is a composition function:
$$
g \circ f: \mathbb{R}^n \rightarrow \mathbb{R}^l
$$
that is also differentiable with a derivative given by: if $f(a)=b$, then
$$
Df(g\circ f)(a) = D(g)(b) \cdot D(f)(a)