## Appendix 4: Flux Through Energy Minimization

The following describes the one argument for energy minimization for
flux terms. This has extensively been studied by mathematicians through
the so-called \"Dirichlet energy\". To start, suppose we have a control
volume, $V$, where we are given information about what is happening on
the boundary and we want to find a function, $u$, that interpolates the
data. To get distributed energy, we want an interpolation function that
does not rapidly change over the volume or in other words, we want that
the function is \"smoothed\" out. A way of quantifying this is to
integrate (sum) the gradient of $u$ over the domain; this gives a global
quantifier for the \"change\" in $u$ over $V$. Therefore, we define the
Dirichlet energy, $E$,

$$\ E\left(u\right):=\frac{1}{2}\iiint_V{\left|\nabla u\right| ^2 \, dV}.$$

Notice if the function changes rapidly in parts of the control volume,
the gradient will be large and thus the integral and energy will be
large. To get a smoothed out function, we want that globally (over the
entire volume), the energy should be *minimized*, or the change in $u$
at every point should be as small as possible. Physically, $\nabla u$ is
the generalized force acting on the system while $u$ is the potential
energy. We want to find a potential energy function, $u$, that minimizes
the Dirichlet energy. This entails introducing a \"functional
derivative\" so that we can speak of minimization. We define the
functional derivative as,

$$\ \displaystyle \frac{\displaystyle \delta^{}E}{\displaystyle \delta v ^{}}:=\lim_{\varepsilon\to0}\frac{E\left(u+\varepsilon v\right)-E\left(u\right)}{\varepsilon}.$$

Notice its resemblance to the directional derivative from vector
calculus,

$$\ \nabla E\cdot \mathbf{v}:=\lim_{\varepsilon\to0}\frac{E\left(\mathbf{u}+\varepsilon \mathbf{v}\right)-E\left(\mathbf{u}\right)}{\varepsilon}.$$

These are interpreted as a measure of how $E$ changes as the input, $u$,
changes by a little amount $\varepsilon v$. $u$ and $v$ are any \"nice\"
functions, but we often choose the variation function, $v$, such that,

$$\ ||v||_{ }^2:=\iiint_V{v^2 \, dV}=1,$$ 

which means the
\"norm\" or size of $v$ is 1. Now to minimize the Dirichlet energy, we
take the functional derivative and set it equal to 0 and use
$\left|\nabla u\right|^2=\nabla u\cdot\nabla u$,

$$\ \displaystyle \frac{\displaystyle \delta^{}E}{\displaystyle \delta v ^{}}=\lim_{\varepsilon\to 0}\frac{1}{2\varepsilon}\iiint_V{\Bigl[\nabla \left(u+\varepsilon v\right)\cdot\nabla \left(u+\varepsilon v\right)-\nabla u\cdot\nabla u\Bigr] \, dV}$$

$$\ =\lim_{\varepsilon\to 0}\frac{1}{2\varepsilon}\iiint_V{\Bigl[{{\nabla u}\cdot{\nabla u}}+2\varepsilon{\nabla u}\cdot{\nabla v}+\varepsilon^2{\nabla v}\cdot{\nabla v}-{{\nabla u}\cdot{\nabla u}}\Bigr] \, dV}$$

$$\ =\lim_{\varepsilon\to 0}\frac{1}{2}\iiint_V{\Bigl[ 2{\nabla u}\cdot{\nabla v}+\underbrace{\varepsilon{\nabla v}\cdot{\nabla v}}_{\to\ 0} \Bigr] \, dV}$$

$$\ =\iiint_V{{\nabla u}\cdot{\nabla v} \, dV}.$$ 
Integration by parts gives,

$$\ \iiint_V{{\nabla u}\cdot{\nabla v} \, dV}=\underbrace{{\iint_{\partial V}{{v\nabla u \cdot \mathbf{n}}} \, dA}}_{\to\ 0}-\iiint_V{v\nabla^2 u \, dV}.$$

Where we assume that $v\equiv0$ on $\partial V$ since we know
information on the boundary and thus there must be no variation on the
boundary. Thus we now have, 

$$\ -\iiint_V{v\nabla^2 u \, dV}=0,$$ 

which implies that for the above equation to hold for any variation, $v$,

$$\ -\nabla^2 u = 0,$$ 

which is Laplace's equation. This states that
solutions to Laplace's equation, \"harmonic\" functions, are the
minimizers of the Dirichlet energy. Physically, this states that
diffusion minimizes the generalized potential energy, $u$, and minimizes
the local change of energy. Thus, we now have some justification for why
diffusion and flux capture nature's tendency to minimize energy.

As an aside for those interested in computational work, the integration
by parts step suggests an alternative formulation of Laplace's equation
as 

$$\ \iiint_V{{\nabla u}\cdot{\nabla v} \, dV}=0,$$ 

which requires that $u$ only be once differentiable instead of twice differentiable as
in the classical form of Laplace's equation. This is an important
observation that this most commonly exploited in finite element methods
that often use linear approximations of $u$ for reduced computational
costs. For more information, research the \"Galerkin Method\" and
\"Lagrange Polynomials\".