![](sethna_title.png)

In [1]:
import numpy as np
import matplotlib.pyplot as plt
%matplotlib inline

## Fitting the model to data

We have experimental data $\mathbf{x}$ and we want to infer the parameters $\mathbf{p}$ of a model:

$$ \frac{d\mathbf{x}}{dt} = g(\mathbf{x} ; \mathbf{p}) + \epsilon$$

where $g$ is some function and $\epsilon$ is some random noise.

We can define a cost function of the parameters $C(\mathbf{p}\ | \ \mathbf{x})$ and pick the parameters that minimize the cost function:

$$\mathbf{p^{*}} = \text{argmin}_{\mathbf{p}}\ C(\mathbf{p}\ |\ \mathbf{x})$$

## Probing the fit by exploring the cost manifold

Recall from elementary calculus that at any local minimimum

$$\frac{\partial C}{\partial p_i}\bigg|_{\mathbf{p^*}}  = 0$$

If we want to know the change in cost due to changing one parameter by a small amount, we can approximate this by Taylor-expanding around $\mathbf{p^*}$

$$C(\delta p_i) \approx  C(\mathbf{p^*}) + \frac{1}{2} \frac{\partial^2 C}{\partial p_i^2}\bigg|_{\mathbf{p^*}} \delta p_i^2 + \mathcal{O}(\delta p_i^3) $$

The larger the second derivative is, the more sharply curved the cost function is due to changes in that parameter.

## Examples

Here are three surfaces with varying degrees of curvature.

![Three surfaces with varying degrees of curvature](three-surfaces.png)

And here is one that is sharply curved in the direction of one coordinate but almost flat in the other.

![A surface that is sharply curved in one direction but almost flat in the other](asymmetric_surface.png)

## Generalization

This is generalized to $N$ dimensions with the *Hessian matrix*, which is the matrix with elements

$$H_{ij} = \frac{\partial^2 C(\mathbf{p})}{\partial p_i \partial p_j} \bigg|_{\mathbf{p}^*}$$

## Combinations of parameters

There is no reason to think that the parameters we measure will be orthogonal (i.e. the non-diagonal entries of the Hessian will not be zero), but the eigenvectors of the Hessian matrix will be orthogonal.  We can use these to quantify how sensitive the model predictions are to variations in *combinations* of parameters.