# Simulated Annealing


## Introduction and Context

**Simulated Annealing (SA)** is a global optimization metaheuristic first proposed by **Kirkpatrick, Gelatt & Vecchi in 1983**. It is inspired by the  **thermodynamic process of metal annealing**. In metallurgy, annealing involves heating a material to high temperatures and then gradually cooling it to reduce defects in its crystalline structure, ultimately reaching a minimum-energy state. Analogously, SA searches for optimal solutions in complex problems by allowing stochastic "moves" that can temporarily accept worse solutions, thus avoiding premature convergence to local minima.

### Physical Mechanism vs. Algorithm

* **Initial Heating**:
  In physical annealing, high temperatures allow atoms to move freely and reorganize.
  In SA, a **high initial temperature** $(T_0)$ enables the acceptance of higher-cost solutions $(\Delta f > 0)$, promoting exploration of distant regions in the search space.

* **Controlled Cooling**:
  In physical annealing, slow cooling $(T \to 0)$ allows atoms to stabilize into low-energy configurations.
  In SA, as $T$ decreases, the algorithm becomes more selective, favoring better solutions $(\Delta f \leq 0)$ and refining the search.


**Table 1** maps thermodynamic concepts to optimization:  

| Thermodynamics         | Optimization               |  
|------------------------|---------------------------|  
| System States          | Feasible Solutions        |  
| Energy                 | Cost $f(x)$               |  
| State Change           | Neighborhood Structure    |  
| Temperature (\(T\))    | Exploration Control       |  
| Frozen State           | Heuristic Solution        |  

**Source:** Dowsland (1995).

## Theoretical Foundations

### Boltzmann Distribution and Acceptance Criterion  

The basic idea of the SA algorithm is to use random search in terms of a Markov chain, which <u>not only accepts changes that improve the objective function but also keeps some changes that are not ideal</u>. 

SA’s theoretical basis lies in the **Boltzmann distribution** from statistical mechanics, which describes the probability of a thermodynamic system occupying a state with energy $E$ at temperature $T$:

$$
P(E) \propto e^{E / (k_B T)},
$$

where $k_B$ is the Boltzmann constant. This probability is also called **transition probability**.

In SA, this distribution is adapted to define the **acceptance probability** of worse solutions. Considerind $\Delta f = f(x_{\text{new}}) - f(x_{\text{current}})$, the probability $p(\Delta f, T)$ is defined as 

$$
p(\Delta f, T) = \begin{cases}
      1  & \Delta f \leq 0 \\
      e^{-\Delta f / T} & \Delta f \gt 0
\end{cases}
$$

* **Interpretation**:

  * If $\Delta f \leq 0$ (better solution), then $p = 1$: the new solution is always accepted.
  * If $\Delta f > 0$ (worse solution), $p$ decays exponentially with $T$.
  * A higher temperature $T$ increases the probability of accepting worse solutions, promoting **exploration** in the early stages of the search.