<a href="https://colab.research.google.com/github/jfogarty/machine-learning-intro-workshop/blob/master/misc/math-examples.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# Some Examples using LaTeX Math Notations in Jupyter Notebooks

The markdown in Jupter Notebooks supports inline [mathematical notation LaTeX macros](http://docs.mathjax.org/en/latest/tex.html) via the [MathJax](https://www.mathjax.org/) html processor.  This is really useful in documenting more math heavy machine learning models.

I've converted a few [Mathematice Stack Exchange](https://math.stackexchange.com/) posts into Colab compatible cells, as examples to get you started.

### NOTE! This Notebook is Not yet github viewable

Unfortunately the [Giithub Jupyter Notebook Viewer](https://help.github.com/en/articles/working-with-jupyter-notebook-files-on-github) is not happy with MathJax rendering so you'll need to open this notebook in [nbviewer](https://nbviewer.jupyter.org/github/jfogarty/machine-learning-intro-workshop/blob/master/misc/math-examples.ipynb), [Colab](https://colab.research.google.com/github/jfogarty/machine-learning-intro-workshop/blob/master/misc/math-examples.ipynb) or some other [Jupyter](https://jupyter.org/) notebook server.



## Observation on rot (curl), div and grad on a vector field?

- [From](https://math.stackexchange.com/questions/2515744/observation-on-rot-curl-div-and-grad-on-a-vector-field) : https://math.stackexchange.com/questions/2515744/observation-on-rot-curl-div-and-grad-on-a-vector-field


Let $\vec{F}(x, y, z)$ a vector-valued function describing a vector field. Then the rotation and divergence of the field are:

$$\nabla \times \vec{F} = curl\vec{F} = \color{red}{(\frac{\partial{F_3}}{\partial{y}} - \frac{\partial{F_2}}{\partial{z}}, \frac{\partial{F_3}}{\partial{x}} - \frac{\partial{F_1}}{\partial{z}}, \frac{\partial{F_2}}{\partial{x}} - \frac{\partial{F_1}}{\partial{u}})}$$

$$\nabla . \vec{F} = div\vec{F} = \color{blue}{\frac{\partial{F_1}}{\partial{x}} + \frac{\partial{F_2}}{\partial{y}} + \frac{\partial{F_3}}{\partial{z}}}$$

if the elements involved in the above relations are placed in a matrix, one gets:

$$A= \begin{bmatrix}
  \color{blue}{\frac{\partial{F_1}}{\partial{x}}} & \color{red}{\frac{\partial{F_1}}{\partial{y}}} & \color{red}{\frac{\partial{F_1}}{\partial{z}}} \\
  \color{red}{\frac{\partial{F_2}}{\partial{x}}} & 
  \color{blue}{\frac{\partial{F_2}}{\partial{y}}} &
  \color{red}{\frac{\partial{F_2}}{\partial{z}}} \\
   \color{red}{\frac{\partial{F_3}}{\partial{x}}} & 
  \color{red}{\frac{\partial{F_3}}{\partial{y}}} &
  \color{blue}{\frac{\partial{F_3}}{\partial{z}}}  
\end{bmatrix}$$

1. Isn't the above matrix the gradient, $grad$, of $\vec{F}$ ?

2. Isn't this how a [Jacobian](https://math.stackexchange.com/questions/1303362/remembering-the-definition-of-the-jacobian-any-tips) looks like?

Additionally, this matrix could be represented as the sum of a diagonal and antisymmetric matrices, then, the elements of the $curl$ are elements of an antisymmetric matrix and those of $div$ are the $trace$ of a diagonal matrix.

3. If valid, what are all these three matrices called?

4. Could it be said that $curl\vec{F}$ measures how "unsymmetrical" is the matrix $A$?

## Orthonormal basis matrix is the same as Jacobian matrix?

- [From](https://math.stackexchange.com/questions/2988371/orthonormal-basis-matrix-is-the-same-as-jacobian-matrix) : https://math.stackexchange.com/questions/2988371/orthonormal-basis-matrix-is-the-same-as-jacobian-matrix


Given a transformation from $x$ to $y$ defined as $\boldsymbol{y} = (y_{1}, ..., y_{D})$ where $y_{i}=\boldsymbol{u}_{i}^T(\boldsymbol{x} - \boldsymbol{\mu})$ and $\{\boldsymbol{u}_i\}$ is an orthonormal basis, we have a Jacobian matrix $J$ with elements defined as:
$$J_{ij} = \frac{\partial x_i}{\partial y_j} = U_{ji} \qquad (1)
$$
where $U_{ji}$ are the elements of the matrix $\boldsymbol{U}^T$, which is a matrix of the orthonormal basis vectors $\boldsymbol{u}_i$ as columns.

I don't understand why (1) is true. How come the elements of the Jacobian are equal to the elements of $\boldsymbol{U}^T$?

### Answer

Let's use the notation $\mathbf{u}_i = (U_{i1}, \dots, U_{iD})^T$. Then if $U$ has $u_i$ as columns, then $U^T$ has entries $U_{ij}$.

Writing the expressions for $y_i$ in coordinates, we get:
$$y_i = \mathbf{u}_i^T (\mathbf{x} - \boldsymbol{\mu}) = \sum_j U_{ij}(x_j-\mu_j) = \sum_j U_{ij} x_j - \sum_j U_{ij} \mu_j$$

I assume $μ$ is a vector of constants, so the second sum above doesn't contribute to any derivatives. The first sum is just a linear combination of $x_j$, so the coefficients in the Jacobian are:
$$\frac{\partial y_i}{\partial x_j} = U_{ij}$$