# Lecture 6: Review of classical statistical mechanics

## Physics 7810, Spring 2020

## 6.1 - Phase space and ensembles

In classical statistical mechanics, the state of an $N$-particle system is specified (in 3 dimensions) by the set of $3N$ coordinates ${\bf r}^{N}$ and $3N$ momenta ${\bf p}^{N}$, which together define a point $\Gamma = ({\bf p}^{N},{\bf r}^{N})$ in the $6N$-dimensional *phase space*.

We will consider several thermodynamic ensembles that are commonly used in statistical physics and computer simulations:
* constant-$NVE$, or *microcanonical* ensemble
* constant-$NVT$, or *canonical* ensemble
* constant-$NPT$, or *isothermal-isobaric* ensemble
* constant-$\mu VT$, or *grand canonical* ensemble

Note that the thermodynamic variables held constant in the microcanonical ensemble are all extensive variables, while a combination of extensive and intensive variables are constrained in other ensembles.

## 6.2 - Microcanonical ensemble

The natural statistical mechanical ensemble for molecular dynamics (MD) simulation is the one we have already encountered in the context of Hamiltonian dynamics, namely the $NVE$ (*microcanonical*) ensemble.

The probability density for the microcanonical ensemble is proportional to

$$
\delta\left[H( \Gamma) - E \right],
$$

where $\delta$ is the Dirac delta function, $H(\Gamma)$ is the Hamiltonian, and $E$ is the energy. This probability density selects those points in phase space having energy $E$, and gives all such states equal weight ('phase space democracy').

The multiplicity $\Omega(N,V,E)$ is the sum of this probability density over all states. In the classical limit ($h \rightarrow 0$, where $h$ is Planck's constant) this becomes

$$
\Omega(N,V,E) = \frac{1}{N!} \frac{1}{h^{3N}} \int d{\bf p}^N d{\bf r}^N \delta\left[H({\bf p}^N,{\bf r}^N) - E \right],
$$

where the factor of $1/N!$ accounts for indistinguishability of particles, and the factor $h^{3N}$ is a measure of the the phase space volume per quantum state.

The entropy $S(N,V,E)$ is proportional to the logarithm of the multiplicity,

$$
S(N,V,E) = k_B \ln \Omega(N,V,E).
$$

The entropy plays the role of a *thermodynamic potential* for the microcanonical ensemble, which is *maximized* in thermal equilibrium (second law of thermodynamics), and $\Omega(N,V,E)$ is the *partition function* for this ensemble.

A partition function is a *normalization factor* that transforms relative probability densities into absolute probability densities.

In a system of particles with no external forces, Hamilton's equations of motion also conserve the total linear momentum ${\bf P}$, so MD simulations typically sample a subset of the microcanonical ensemble, the $NVE{\bf P}$ ensemble.

The total angular momentum ${\bf L}$ is not conserved in most MD simulations, e.g., MD simulations of systems of particles with periodic boundary conditions.

## 6.3 - Canonical ensemble

The probability density for the $NVT$ (*canonical*) ensemble is proportional to

$$
\exp \left[- \beta H( \Gamma) \right],
$$

where $\beta = (k_B T)^{-1}$, and $T$ is the absolute temperature. Note that all energies are now allowed, but aren't weighted in a 'democratic' way (lower energy states are more probable).

In the classical limit, the canonical partition function $Z(N,V,T)$ is

$$
Z(N,V,T) = \frac{1}{N!} \frac{1}{h^{3N}} \int d{\bf p}^N d{\bf r}^N \exp \left[- \beta H({\bf p}^N,{\bf r}^N) \right].
$$

where the factor of $1/N!$ accounts for indistinguishability of particles, and the factor $h^{3N}$ is a measure of the the phase space volume per quantum state.

The thermodynamic potential for the canonical ensemble is the *Helmholtz free energy* $A(N,V,T)$, defined as

$$
A(N,V,T) = - k_B T \ln Z(N,V,T),
$$

which is *minimized* in thermal equilibrium.

MD simulations don't naturally sample the $NVT$ ensemble, but there are variants of the MD method that do, which we'll discuss later. *Monte Carlo methods* can also be used to sample states in a way consistent with the canonical ensemble, as we'll see shortly.

For most classical systems of interest, the Hamiltonian can be written in the form $H({\bf p}^N,{\bf r}^N) = K({\bf p}^N) + U({\bf r}^N)$, where the kinetic energy $K$ depends only on the momenta ${\bf p}^N$ and the potential energy $U$ depends only on the coordinates ${\bf r}^N$.

In this case, the total canonical partition function factors into *ideal gas* and *interaction* contributions,

$$
Z(N,V,T) = \frac{1}{N!} \frac{1}{h^{3N}} \int d{\bf p}^N \exp \left[- \beta K({\bf p}^N) \right]
\int d{\bf r}^N \exp \left[- \beta U({\bf r}^N) \right]
$$

$$
= Z^\mathrm{id}(N,V,T)\ Z^\mathrm{int}(N,V,T).
$$

Here, the (monatomic) *ideal gas* partition function is (after some algebra involving Gaussian integrals)

$$
Z^\mathrm{id}(N,V,T) = \frac{(V / \Lambda^3)^N}{N!},
$$

and the *interaction* or *configurational* partition function is

$$
Z^\mathrm{int}(N,V,T) = \frac{1}{V^N} \int d{\bf r}^N e^{- \beta U({\bf r}^N)},
$$

where $V$ is the volume and $\Lambda$ is the *thermal de Broglie wavelength*,

$$
\Lambda = \left( \frac{h^2}{2 \pi m k_B T} \right)^{1/2}.
$$

Here, $m$ is the particle mass. Quantum effects can be neglected for $\Lambda << a$, where $a = (V/N)^{1/3}$ is the characteristic interparticle spacing.

For interacting classical systems, the calculation of thermodynamic properties reduces to computation of the configurational partition function $Z^\mathrm{int}$, which is a $3N$-dimensional integral.

This is generally impossible to do directly, but it's possible to compute thermodynamic properties in the canonical ensemble using *Markov chain Monte Carlo* methods (*importance sampling*) or appropriately designed MD algorithms.

We'll learn how to do this soon.

## 6.4 - Digression on Gaussian integrals

Consider the integral
 
$$
I = \int_{- \infty}^\infty dx\ e^{- \alpha x^2}.
$$ 
 
We can write
 
$$
I^2 = \left( \int_{- \infty}^\infty dx\ e^{- \alpha x^2} \right) \left( \int_{- \infty}^\infty dy\ e^{- \alpha y^2} \right) 
= \int_{- \infty}^\infty dx \int_{- \infty}^\infty dy\ e^{- \alpha (x^2 + y^2)}
$$

$$
= \int_0^\infty dr\ r \int_0^{2 \pi} d\phi \ e^{- \alpha r^2} = 2 \pi \int_0^\infty dr\ r \ e^{- \alpha r^2}
$$

$$
= \frac{\pi}{\alpha} \int_0^\infty dz\ e^{-z} =  \frac{\pi}{\alpha} \left. \left( - e^{-z} \right) \right\rvert_0^\infty = \frac{\pi}{\alpha},
$$

so
$$
I = \int_{- \infty}^\infty dx\ e^{- \alpha x^2} = \sqrt{ \frac{\pi}{\alpha} }
$$
 
From this we can immediately obtain
$$
\int_{- \infty}^\infty dx\ x^2 e^{- \alpha x^2} = - \frac{\partial}{\partial \alpha} \int_{- \infty}^\infty dx\ e^{- \alpha x^2}
= - \frac{\partial}{\partial \alpha} \sqrt{ \frac{\pi}{\alpha} } = \frac{1}{2} \sqrt{ \frac{\pi}{\alpha^3} }
$$
 
$\int_{- \infty}^{\infty} x^n e^{- \alpha x^2}$ for any even $n$ can be obtained by iterating this procedure. Odd powers of $n$ are even easier.


## 6.5 - Classical equipartition theorem

Let $x_\alpha$ be a coordinate or a momentum component $p_\alpha$ or $q_\alpha$, and assume that the  Hamiltonian has a quadratic dependence on $x_\alpha$:

$$
H(x^{6N}) = a x_\alpha^2 + H^\prime(x_{\beta \neq \alpha}^{6N-1})
$$
 
The contribution of $x_\alpha$ to the average energy is thus

$$
\langle E_\alpha \rangle = \langle a x_\alpha^2 \rangle
= \frac{\int dx^{6N} a x_\alpha^2\ e^{- \beta H(x^{6N})}}{\int dx^{6N} e^{- \beta H(x^{6N})}}
= \frac{a \int_{-\infty}^{\infty} dx_\alpha x_\alpha^2\ e^{- \beta a x_\alpha^2}}{\int_{-\infty}^{\infty} dx_\alpha e^{- \beta a x_\alpha^2}}
$$
$$
= a \frac{1}{2} \sqrt{ \frac{\pi}{(\beta a)^3} } \sqrt{ \frac{(\beta a)}{\pi} } = \frac{1}{2 \beta} = \frac{1}{2} k_B T
$$

which is the classical equipartition theorem.

Note that we've assumed *nothing* about the dependence of $H^\prime$ on the remaining $6N - 1$ degrees of freedom; it need not be quadratic.

## 6.6 - Generalized equipartition theorem

For classical systems, we can prove a more general result, the *generalized equipartition theorem*:

$$
\left\langle x_\alpha \frac{\partial H}{\partial x_\alpha} \right\rangle = k_B T.
$$
 
Consider the integral

$$
I_\alpha = \int dx_\alpha e^{-\beta H(x^{6N})}.
$$

This can be integrated by parts to obtain:

$$
I_\alpha = {\left. x_\alpha e^{-\beta H(x^{6N})} \right\rvert_{x_\alpha^-}^{x_\alpha^+} }
+ \beta \int dx_\alpha\ x_\alpha \frac{\partial H}{\partial x_\alpha} e^{-\beta H(x^{6N})}
= \beta \int dx_\alpha\ x_\alpha \frac{\partial H}{\partial x_\alpha} e^{-\beta H(x^{6N})}.
$$

Note that the integrated term is assumed to vanish at the limits of integration.

Thus,

$$
\frac{I_\alpha}{I_\alpha} = 1 = \frac{\beta \int dx_\alpha\ x_\alpha \frac{\partial H}{\partial x_\alpha} e^{-\beta H(x^{6N})}}{\int dx_\alpha e^{-\beta H(x^{6N})}}
$$

$$
= \frac{\beta \int dx^{6N}\ x_\alpha \frac{\partial H}{\partial x_\alpha} e^{-\beta H(x^{6N})}}{\int dx^{6N} e^{-\beta H(x^{6N})}}
= \beta \left\langle x_\alpha \frac{\partial H}{\partial x_\alpha} \right\rangle
$$

or

$$
\left\langle x_\alpha \frac{\partial H}{\partial x_\alpha} \right\rangle = \frac{1}{\beta} = k_B T.
$$

This can be used to derive many useful results (e.g., the virial theorem).


## 6.7 - Virial theorem

We now choose Cartesian coordinates and make use of Hamilton's equations of motion and the generalized equipartition theorem to obtain:

$$
\sum_{i=1}^N \sum_{\alpha=1}^3 \left\langle r_{i \alpha} \frac{\partial H}{\partial r_{i \alpha}} \right\rangle
= 3 N k_B T
= \sum_{i=1}^N \sum_{\alpha=1}^3 \left\langle r_{i \alpha} \frac{\partial U}{\partial r_{i \alpha}} \right\rangle
$$
$$
= - \sum_{i=1}^N \sum_{\alpha=1}^3 \left\langle r_{i \alpha} f_{i \alpha}^{\rm tot} \right\rangle
= - \left\langle \sum_{i=1}^N {\bf r}_{i} \cdot {\bf f}_{i}^{\rm tot} \right\rangle,
$$

where the total force ${\bf f}_i^{\rm tot}$ is the sum of intermolecular forces ${\bf f}_i^{\rm int}$ and external forces ${\bf f}_i^{\rm ext}$. This can be rearranged to obtain

$$
\frac{1}{3} \left\langle \sum_{i=1}^N {\bf r}_{i} \cdot {\bf f}_{i}^{\rm tot} \right\rangle
= \frac{1}{3} \left\langle \sum_{i=1}^N {\bf r}_{i} \cdot {\bf f}_{i}^{\rm ext} \right\rangle
+ \frac{1}{3} \left\langle \sum_{i=1}^N {\bf r}_{i} \cdot {\bf f}_{i}^{\rm int} \right\rangle
= - N k_B T.
$$


If the external forces are exerted by the container walls, then the force exerted on the system by a surface element $dS$ located at ${\bf r}$ is $- P {\bf n} dS$, where ${\bf n}$ is a unit vector normal to the surface and directed outwards, and its contribution to the average $\langle \sum_{i=1}^N {\bf r}_{i} \cdot {\bf f}_{i}^{\rm ext} \rangle$ is $-P {\bf r} \cdot {\bf n} dS$. Integrating over the surface and applying the divergence theorem then gives

$$
\left\langle \sum_{i=1}^N {\bf r}_{i} \cdot {\bf f}_{i}^{\rm ext} \right\rangle = - P \int {\bf r} \cdot {\bf n} dS
= - P \int \nabla \cdot {\bf r} dV = - 3 P V,
$$

so

$$
\frac{1}{3} \left\langle \sum_{i=1}^N {\bf r}_i \cdot {\bf f}_i^{\rm ext} \right\rangle
= - PV.
$$


We now define the *internal virial* $W$ in terms of the intermolecular potential energy and forces:

$$
- \frac{1}{3} \sum_{i=1}^N {\bf r}_i \cdot \nabla_{{\bf r}_i} U^{\rm int}
= \frac{1}{3} \sum_{i=1}^N {\bf r}_i \cdot {\bf f}_i^{\rm int}
= W.
$$

This leads to the following virial expression for the pressure,

$$
PV = N k_B T + \langle W \rangle.
$$

or

$$
P = \frac{N k_B T}{V} + \frac{1}{V} \langle W \rangle = P^\mathrm{id} + P^\mathrm{int}.
$$



## 6.8 - Virial theorem another way

We can derive the same result in another way, via the thermodynamic identity

$$
P = - \left( \frac{\partial A}{\partial V} \right)_{N,T},
$$

where $A(N,V,T)$ is the Helmholtz free energy (the thermodynamic potential for the canonical ensemble). This derivation will introduce the concept of *scaled coordinates*, which we'll make extensive use of later. Now recall that $A$ is defined in statistical mechanics by

$$
A(N,V,T) = - \frac{1}{\beta} \ln Z(N,V,T),
$$

where $\beta = (k_B T)^{-1}$ and $Z$ is the canonical partition function. Thus,

$$
P = \frac{1}{\beta} \frac{\partial \ln Z}{\partial V} = \frac{1}{\beta} \frac{1}{Z} \frac{\partial Z}{\partial V}
$$

Now recall that
$$
Z(N,V,T) = Z^\mathrm{id}(N,V,T)\ Z^\mathrm{int}(N,V,T),
$$

where
$$
Z^\mathrm{id}(N,V,T) = \frac{(V / \Lambda^3)^N}{N!} = V^N \left( \frac{\Lambda^{3N}}{N!} \right),
$$

and
$$
Z^\mathrm{int}(N,V,T) = \frac{1}{V^N} \int d{\bf r}^N e^{- \beta U({\bf r}^N)},
$$

so
$$
P = \frac{1}{\beta} \frac{\partial \ln Z}{\partial V}
= \frac{1}{\beta} \frac{1}{Z^\mathrm{id}} \frac{\partial Z^\mathrm{id}}{\partial V}
+ \frac{1}{\beta} \frac{1}{Z^\mathrm{int}} \frac{\partial Z^\mathrm{int}}{\partial V}
\equiv P^\mathrm{id} + P^\mathrm{int} 
$$

Here

$$
P^\mathrm{id} = \frac{1}{\beta} \frac{1}{Z^\mathrm{id}} \frac{\partial Z^\mathrm{id}}{\partial V}
= \frac{1}{\beta} \frac{1}{V^N} \frac{\partial V^N}{\partial V} = \frac{N}{\beta V} = \frac{N k_B T}{V},
$$

which is just the ideal gas equation of state. The interaction contribution to the pressure is

$$
P^\mathrm{int} = \frac{1}{\beta} \frac{1}{Z^\mathrm{int}} \frac{\partial Z^\mathrm{int}}{\partial V},
$$

which can be evaluated by introducing *scaled coordinates* defined (for a cubic box) via $r_{i \alpha} = L s_{i \alpha}$, where $r_{i \alpha}$ is the $\alpha$th cartesian component of the position of particle $i$. If $-L/2 < r_{i \alpha} \leq L/2$, then the scaled coordinates lie within a unit cube: $-1/2 < s_{i \alpha} \leq 1/2$.

A similar scaling transformation can be employed for a unit cell of arbitrary shape, not just a cubic cell. This will come in handy later, for applying periodic boundary conditions or in the implementation of fast cell search algorithms.

In scaled coordinates, the interaction partition function can be written

$$
Z^\mathrm{int}(N,V,T) = \frac{1}{V^N} \int d{\bf r}^N e^{- \beta U({\bf r}^N)}
= \int d{\bf s}^N e^{- \beta U({\bf s}^N;L)} = \int d{\bf s}^N e^{- \beta U({\bf s}^N;V^{1/3})}
$$

where the dependence on $V$ in the Boltzmann factor comes in *implicitly* through $r_{i \alpha} = L s_{i \alpha} = V^{1/3} s_{i \alpha}$. Thus,

$$
\frac{\partial Z^\mathrm{int}}{\partial V} = \int d{\bf s}^N \left[ - \beta \frac{\partial U}{\partial V} \right] e^{- \beta U}
= \beta \int d{\bf s}^N \left[ - \sum_{i = 1}^N \sum_{\alpha = 1}^3
\frac{\partial U}{\partial r_{i \alpha}} \frac{\partial r_{i \alpha}}{\partial V} \right] e^{- \beta U}
$$
$$
= \beta \int d{\bf s}^N \left[ \sum_{i = 1}^N \sum_{\alpha = 1}^3
f_{i \alpha} \frac{1}{3} V^{-2/3} s_{i \alpha} \right] e^{- \beta U}
= \beta \int d{\bf s}^N \left[ \frac{1}{3 V} \sum_{i = 1}^N \sum_{\alpha = 1}^3
f_{i \alpha} r_{i \alpha} \right] e^{- \beta U}
$$
$$
= \frac{\beta}{V} \frac{1}{V^N} \int d{\bf r}^N \left[\frac{1}{3} \sum_{i = 1}^N
{\bf f}_i \cdot {\bf r}_i \right] e^{- \beta U}
= \frac{\beta}{V} \frac{1}{V^N} \int d{\bf r}^N W e^{- \beta U},
$$

where $W = (1/3) \sum_{i = 1}^N {\bf r}_i \cdot {\bf f}_i$ is the virial defined above.

Thus,

$$
P^\mathrm{int} = \frac{1}{\beta} \frac{1}{Z^\mathrm{int}} \frac{\partial Z^\mathrm{int}}{\partial V}
= \frac{1}{V} \frac{1}{Z^\mathrm{int}} \frac{1}{V^N} \int d{\bf r}^N W e^{- \beta U}
= \frac{1}{V} \int d{\bf r}^N W \rho({\bf r}^N) = \frac{1}{V} \langle W \rangle
$$

where $\rho({\bf r}^N) = e^{- \beta U({\bf r}^N)} / Z^\mathrm{int}$ is the configuration-space probability density in the canonical ensemble.

Thus the total pressure is

$$
P = P^\mathrm{id} + P^\mathrm{int} = \frac{N k_B T}{V} + \frac{1}{V} \langle W \rangle,
$$

as shown earlier.

## 6.9 - Virial for pairwise interactions

The virial $W$ defined above depends on the origin of the coordinate system. For systems of particles interacting through pair potentials, $W$ can be expressed in a form independent of the choice of origin, by writing ${\bf f}_i$ as the sum of forces ${\bf f}_{ij}$ on particle $i$ due to particle $j$,

$$
\sum_{i = 1}^N {\bf r}_i \cdot {\bf f}_i = \sum_{i = 1}^N \sum_{j \neq i} {\bf r}_i \cdot {\bf f}_{ij}
= \frac{1}{2} \sum_{i = 1}^N \sum_{j \neq i} \left( {\bf r}_i \cdot {\bf f}_{ij} + {\bf r}_j \cdot {\bf f}_{ji} \right).
$$

But ${\bf f}_{ij} = - {\bf f}_{ij}$ (Newton's third law), so
$$
\sum_{i = 1}^N {\bf r}_i \cdot {\bf f}_i
= \frac{1}{2} \sum_{i = 1}^N \sum_{j \neq i} {\bf r}_{ij} \cdot {\bf f}_{ij}
= \sum_{i = 1}^N \sum_{j > i} {\bf r}_{ij} \cdot {\bf f}_{ij},
$$
where ${\bf r}_{ij} = {\bf r}_i - {\bf r}_j$. The virial is therefore
$$
W = \frac{1}{3} \sum_{i = 1}^N \sum_{j > i} {\bf r}_{ij} \cdot {\bf f}_{ij}.
$$
It's essential to use this form for systems with periodic boundary conditions.