# Lecture 4: A lightning overview of statistical mechanics

## Physics 7810, Spring 2020

## 3.1 - Overview 

In this lecture I'll review some results from statistical mechanics, focusing on those aspects of most relevance to computer simulation. Of particular interest are the various ways we can measure observables in computer simulations.

## 3.2 Subsystems

An isolated macroscopic system with fixed $N,V,E$ can be divided into subsystems in thermal, mechanical, or diffusive contact, with $E = E_1 + E_2$, $V = V_1 + V_2$, and $N = N_1 + N_2$.

<img src="images/Sethna_Fig_3_3.png" alt="Drawing" style="width: 400px;">

Figure from *Statistical Mechanics: Entropy, Order Parameters, and Complexity*, by Jim Sethna

All of statistical mechanics and thermodynamics can be derived by considering how the *total number of microstates* 
(*multiplicity*) $\Omega(N,V,E)$ depends on $E_1$, $V_1$, or $N_1$.
 
Various thermodynamic ensembles are obtained by considering a small subsystem in thermal, mechanical, or diffusive contact with a much larger *reservoir* (of energy, volume, or particles.

## 3.3 Basic postulate of statistical mechanics

*An isolated system with constant particle particle number $N$, volume $V$, and energy $E$ is equally likely to be found in any of its available microstates*.
  
The number of microstates corresponding to the macrostate $N,V,E$ is called the *multiplicity*, denoted $\Omega(N,V,E)$.
 
Given this postulate, the macroscopic properties of a system can be calculated as an unweighted average over the $\Omega(N,V,E)$ microstates ('microcanonical ensemble').
 
In fact, we can obtain all thermodynamic properties (pressure, chemical potential, ...) from $\Omega(N,V,E)$ itself, but calculating $\Omega(N,V,E)$ is hard (except for the classical ideal gas and a few other simple models).

Justification for the basic postulate:

* Liouville theorem: a uniform phase space density is stationary under Hamiltonian dynamics
* Microscopic chaos: mixing flow in phase space causes chaotic systems to evolve toward this stationary state (ergodic hypothesis)
* Eigenstate thermalization hypothesis (?)

## 3.4 Large and very large numbers, entropy


The number of atoms in a macroscopic volume of matter is a *large* number:
 
$$N \sim 10^{23}.$$
 
The multiplicity $\Omega(N,V,E)$, which measures the number of ways of assigning $N$ particles to available states with total energy $E$, is a *very large* (*combinatorially large*) number:
 
$$\Omega(N,V,E) \sim e^N \sim e^{10^{23}}.$$

Such unimaginably large numbers are hard to deal with, so we usually work with their logarithms, which are merely large:
 
$$S(N,V,E) = k_B \ln \Omega(N,V,E) \sim O(N).$$
 
$S(N,V,E)$ is the *entropy*, a logarithmic measure of the number of microstates for a given macrostate $N,V,E$. Boltzmann's constant $k_B$ is introduced for historical and practical reasons.
 
$S(N,V,E)$ usually increases with increasing $N$, $V$, or $E$.


## 3.5 Entropy is an extensive quantity

Consider an isolated system with total energy $E$ consisting of two weakly coupled subsystems that can exchange energy, whose states can be enumerated independently.
 
The total multiplicity is the product of the multiplicities of the two subsystems, integrated over the energy $E_1$ of system 1:
 
$$\Omega(E) = \int_0^E \Omega_1(E_1) \Omega_2(E-E_1) dE_1.$$
 
Note that the integrand divided by the total integral is just the probability density for subsystem 1 to have energy $E_1$:
 
$$\rho(E_1) = \Omega_1(E_1) \Omega_2(E-E_1) / \Omega(E).$$

For large $N$, the integrand is very sharply peaked around $E_1^{\rm max}$, with (small) width $\Delta E_1$, so
 
$$\Omega(E) \approx \Omega_1(E_1^{\rm max}) \Omega_2(E-E_1^{\rm max}) \Delta E_1.$$
(Note: the expression above is based on the property that the distribution is incredibly sharply peaked. To estiamte the area under the curve, only calculating the are of the peak is a simple way. The larger N is, the sharper the peak is. Therefore, this estimation is particularly good for a really large N.) 

The total entropy is therefore:
 
$$S(E) = k_B \ln \Omega(E) \approx S_1(E_1^{\rm max}) + S_2(E - E_1^{\rm max}).$$
 
For large $N$, $\Delta E_1 \sim O(N^{1/2})$, so this is an extremely good approximation.
 
Thus, $S$ is an additive (extensive) property (proportional to $N$).


## 3.6 Second law of thermodynamics

Isolated chaotic or stochastic many-body systems evolve toward more probable (higher entropy) macrostates.
 
The equilibrium macrostate is the state of maximum entropy.
  
**The change in entropy associated with *any process* in an *isolated* system is non-negative:**
 
$$\Delta S \geq 0.$$

For large systems, this law is *never* observably violated.
 
*The force of probability is strong*.


## 3.7 Temperature, pressure, and chemical potential

An isolated system (constant $N,V,E$) is *overwhelmingly* likely to be found in the macrostate that maximizes the entropy $S(E)$.
 
What's the condition for thermal equilbrium between two weakly coupled subsystems that share $E$ (but not $N$ or $V$)?

Maximize the total entropy $S(E) = S_1(E_1) + S_2(E - E_1)$ as a function of $E_1$:
 
$$\left( {{\partial S(E)} \over {\partial E_1}} \right) = 0 
 = \left( {{\partial S_1(E_1)} \over {\partial E_1}} \right) + \left( {{\partial S_2(E - E_1)} \over {\partial E_1}} \right)$$
$$ = \left( {{\partial S_1(E_1)} \over {\partial E_1}} \right) - \left( {{\partial S_2(E_2)} \over {\partial E_2}} \right) = {1 \over T_1} - {1 \over T_2}.$$
 
Here, the absolute *statistical temperature* $T$ is defined as:
 
$${1 \over T} = \left( {{\partial S(N,V,E)} \over {\partial E}} \right)_{N,V} $$
 
$T_1 = T_2$ *is the condition for thermal equilibrium between subsystems*.

The inverse temperature $1/T$ is a logarithmic measure of how rapidly the number of microstates increases with increasing $E$, when $N$ and $E$ are held constant:
  
$$ {1 \over T} = \left( {{\partial S(N,V,E)} \over {\partial E}} \right)_{N,V}.$$
 
Low $T$ (large $1/T$) implies a relatively large increase in entropy (probability) with the addition of given amount of energy.
 
The flow of energy from high-temperature regions to low-temperature regions leads to an increase in the overall entropy, and is *overwhelmingly* probable.
 
This is a consequence of the second law of thermodynamics.

What's the condition for mechanical equilbrium between two subsystems that share $V$ (but not $N$ or $E$)?
 
$$\left( {{\partial S_1(V_1)} \over {\partial V_1}} \right) = \left( {{\partial S_2(V_2)} \over {\partial V_2}} \right).$$

This can be written:
$${P_1 \over T_1} = {P_2 \over T_2},$$
 
where the *statistical pressure* $P$ is defined via:
 
$${P \over T} = \left( {{\partial S(N,V,E)} \over {\partial V}} \right)_{N,E}.$$
 
$P$ is a logarithmic measure of how rapidly the number of available states increases with increasing $V$, when $N$ and $E$ are held constant.

What's the condition for diffusive equilbrium between two subsystems that share $N$ (but not $V$ or $E$)?

$$\left( {{\partial S_1(N_1)} \over {\partial N_1}} \right) = \left( {{\partial S_2(N_2)} \over {\partial N_2}} \right).$$

This can be written:
$${\mu_1 \over T_1} = {\mu_2 \over T_2},$$ 

where the *chemical potential* $\mu$ is defined via: 

$${\mu \over T} = - \left( {{\partial S(N,V,E)} \over {\partial N}} \right)_{V,E}.$$

$\mu$ is a logarithmic measure of how rapidly the number of available states *decreases* with increasing $N$, when $V$ and $E$ are held constant.

## 3.8 - First law of thermodynamics 

We can now write a general expression for how $S(N,V,E)$ varies under arbitrary (small) variations in $N$, $V$, and $E$:

$$dS = \left( {{\partial S} \over {\partial E}} \right)_{N,V} dE + \left( {{\partial S} \over {\partial V}} \right)_{N,E} dV + \left( {{\partial S} \over {\partial N}} \right)_{V,E} dN$$

$$ = {1 \over T} dE + {P \over T} dV - {{\mu} \over T} dN.$$

This can be rearranged to give:
 
$$dE = T dS - P dV + \mu dN$$

$$= dQ  + dW + dW_{\rm chem}.$$
 
Here $dQ = T dS$ is the heat (thermal energy) *added to* the system, and $dW$ and $dW_{\rm chem}$ are, respectively, the mechanical and chemical work *done on* the system.
 
This is just a statement of energy conservation, also called the *first law of thermodynamics* or the *fundamental thermodynamic identity*.

## 3.9 - Third law of thermodynamics 

In the limit of zero absolute temperature, a quantum system should settle into its unique lowest-energy ground state
 
$$\lim_{T \rightarrow 0} \Omega(N,V,E) = 1.$$
 
This implies that the entropy goes to zero for $T \rightarrow 0$ (the *third law of thermodynamics*):
$$\lim_{T \rightarrow 0} S(N,V,E) = 0.$$
Caveats:
* The ground state may be degenerate.
* Systems tend to fall out of equilibrium well before reaching $T = 0$ (glassy dynamics, non-ergodic behavior).

## 4.0 - Canonical ensemble 

Consider a system A in thermal contact with a much larger system B ('heat bath') such that $T_B = T$ is constant.
 
This defines the constant $N,V,T$ ensemble ('canonical' ensemble).
 
The probability of finding system A in a specific energy eigenstate $i$ with energy $E_i$ is:
 
$$P_i = {{\Omega_A(E_i) \Omega_B(E - E_i)} \over {\sum_j \Omega_A(E_j) \Omega_B(E - E_j)}} 
 = {{\Omega_B(E - E_i)} \over {\sum_j \Omega_B(E - E_j)}}.$$
 
Note that $\Omega_A(E_i) = 1$ for any pure energy eigenstate $i$.

$E_i << E$, so we can expand $\ln \Omega_B(E - E_i)$ around $E_i = 0$:
 
$$\ln \Omega_B(E - E_i) \approx \ln \Omega_B(E) - E_i \left( {{\partial \ln \Omega_B(E)} \over {\partial E}} \right)_{N,V} = \ln \Omega_B(E) - \beta E_i,$$
 
where $\beta = (k_B T)^{-1}$. Recall that $S(E) = k_B \ln \Omega_B(E)$ and $1/T = (\partial S(E) / \partial E)_{N,V}$.

We therefore have
  
$$P_i = \frac{e^{-\beta E_i}}{\sum_j e^{-\beta E_j}} = \frac{e^{-\beta E_i}}{Z}.$$
 
This is the famous *Boltzmann distribution*, where $Z$ is the *partition function*:
  
$$Z(N,V,T) =  \sum_j e^{-\beta E_j}.$$

$Z$ plays the same role in the $N,V,T$ ensemble that $\Omega$ plays in the $N,V,E$ ensemble (it counts available states), and is a *very large* number if $N$ is large.

To obtain an extensive quantity that's just a *large* number, we can define the *Helmholtz free energy* $A$:

$$A(N,V,T) = -k_B T \ln Z(N,V,T).$$

$A$ plays an analogous role to $S$, and is *minimized* in thermal equilibrium for constant $N,V,T$.

From $Z$ and $A$, we can calculate all thermodynamic properties, for example the average energy,

$$\langle E \rangle = \sum_i E_i P_i = {1 \over Z} \sum_i E_i e^{-\beta E_i} = - {1 \over Z} {{\partial Z} \over {\partial \beta}} = - {{\partial \ln Z} \over {\partial \beta}} = {{\partial (\beta A)} \over {\partial \beta}},$$

the entropy,

$$S = {1 \over T} \left[\langle E \rangle - A \right],$$

and the heat capacity, 
$$C_V = {{\partial \langle E \rangle } \over {T}} = {1 \over {k_B T^2}} {{\partial^2 \ln Z} \over {\partial \beta^2}}. $$

The canonical ensemble is generally *much* more convenient for calculations than the microcanonical ensemble.

## 4.1 - Planck radiation law 

The energy levels of a 1D harmonic oscillator (neglecting zero-point energy) are $E_n = n \hbar \omega, \ \ n=0,\infty$, so the canonical partition function is

$$Z = \sum_{n=0}^\infty e^{- \beta n \hbar \omega} = \sum_{n=0}^\infty \left( e^{-\beta \hbar \omega} \right)^n
= {1 \over {1 - e^{-\beta \hbar \omega} }}.$$

Here we've used the identity $1 / (1-x) = \sum_{n=0}^\infty x^n$ for $x < 1$.

Thus, the average energy is

$$\left\langle E \right \rangle = - {1 \over Z} {{\partial Z} \over {\partial \beta}}
 = {{\hbar \omega} \over {e^{\beta \hbar \omega} - 1}}.$$

*This is the Planck radiation law!*
