## Complex equilibria

In order to accurately represent complex equilibria, equilibrium constants and, consequently, binding free energies must be accurately calculated. This tutorial summarizes the important points that need to be addressed, mainly based on [this paper](https://doi.org/10.1039/C5CP00628G). We are going to assume water as solvent, but results are applicable with minor changes to any solvent.

The general approach for predicting the standard free energy of binding $\Delta G^\circ_\text{b, aq}$ of a host (R) and a guest (L) in water (aq) is

$$
\require{mhchem}\ce{R(aq) + L(aq) <=> RL(aq)}
$$

which translates to the thermodynamic relationship

$$
\Delta G^\circ_\text{b, aq}
  = G^\circ_\text{aq}(RL) - (G^\circ_\text{aq}(R) + G^\circ_\text{aq}(L))
$$

where $G^\circ_\text{aq}(\ce{X})$ is the absolute free energy of molecule X, which encompasses the electronic energy, rigid rotor-harmonic oscillator (RRHO) energy and solvation free energy, and contains the zero point energy through te RRHO approximation. The standard state ($^\circ$) is assumed to be 1 M.

It's nowadays very common to do fully solvated electronic calculations, without the use of thermodynamic cycles. Furthermore, it's very important to take dispersion, and three-body dispersion in particular, into account.

Most electronic structure codes (such as ORCA) compute RRHO energy corrections for ideal gases, for which the standard state is a pressure of 1 bar (or sometimes 1 atm). In order to correct gas phase free energies, the following transformation is applied

$$
G^{\circ \text{(1 M)}}(\ce{X})
  = G^{\circ \text{(1 bar)}}(\ce{X}) - R T \ln \left( V^{-1} \right)
$$

where V is the volume of an ideal gas at temperature T. At 298 K this correction increases the absolute free energy by 1.90 kcal/mol.

The rigid-rotor rotational entropy is affected by the molecule's symmetry number ($\sigma$). If the electronic structure code hasn't added the correct term, you can add it manually as

$$
G^{\circ \text{(sym)}}(\ce{X})
  = G^{\circ \text{(C$_1$)}}(\ce{X}) + R T \ln \left( \sigma_X \right)
$$

For instance, CB7 has $D_{7h}$ symmetry and a corresponding $\sigma$ of 14, which contributes 1.56 kcal/mol to the absolute free energy at 298 K.

overreact assumes that each molecular structure is a different entity, which means that conformers and acid-base related structures should be explicity included. (It may be wise to include conformers that are within 1.36 kcal/mol from the global minimum, since those contribute more than 0.1 to the Boltzmann sum.) You can always calculate the final apparent equilibrium constant from the final simulated populations.

(On the other hand, accurately calculating acid-base equilibria is another matter althogether due to charge errors. Coordination of other ions, including the treatment of ionic strength, follow the same guidelines as for pH.)

Solvation must be accurately treated with a proper solvation method such as SMD, which is perfectly fine for optimizing geometries. Also bear in mind that the addition of explicit water molecules, specially around ionic sites, can improve binding free energies. (Besides, with methods such as XTB and Crest, there's no excuse to not do conformational searches or find optimum interaction structures for explicit waters nowadays.)

Explicit water molecules can be added and the solvation energy can be calculated as follows:

$$
G^\circ_\text{aq, n}(\ce{X})
  = G^\circ_\text{aq}(\ce{X(H2O)_{n}}) - G^\circ_\text{aq}(\ce{(H2O)_{n}})
$$

The value $n$ can be chosen such that convergence is achieved according to an energy criterium (e.g. 1 kcal/mol). This corresponds to the following free energy change for a host-guest system:

$$
\ce{R(H2O)_{m} (aq) + L(H2O)_{n} (aq) + (H2O)_{l} (liq) <=> RL(H2O)_{l} (aq) + (H2O)_{n} (liq) + (H2O)_{m} (liq)}
$$

When using explicit water molecules, errors in solvation free energy can be reduced by ensuring that the continuum solvation free energy of a single water molecule match the experimental value of -6.32 kcal/mol at 298.15 K.