Skip to content

Commit dcf7264

Browse files
authored
Merge pull request #13 from Qihang-Zhang/master
Qihang/Add Blog: Why the Exponential? From Max‑Entropy RL to the Boltzmann Distribution
2 parents a0dd74f + a56d0dc commit dcf7264

File tree

1 file changed

+17
-0
lines changed

1 file changed

+17
-0
lines changed

_posts/2025-10-11-max-ent-rl.md

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,17 @@
1+
---
2+
layout: distill
3+
title: Why the Exponential? From Max‑Entropy RL to the Boltzmann Distribution
4+
description: This blog post explores why the exponential function appears ubiquitously across modern RL, energy-based modeling, and statistical mechanics. We examine the connection between max-entropy reinforcement learning and the Boltzmann distribution, uncovering the fundamental principles that make the exponential form inevitable and explaining what "temperature" actually does in these frameworks.
5+
tags: reinforcement-learning information-theory boltzmann-distribution
6+
giscus_comments: true
7+
date: 2025-10-11
8+
featured: true
9+
redirect: https://qihang-zhang.com/Learning-Sys-Blog/2025/10/06/max-ent-rl-and-boltzmann-distribution.html
10+
11+
authors:
12+
- name: Qihang Zhang
13+
url: "https://qihang-zhang.com/"
14+
affiliations:
15+
name: UBC
16+
17+
---

0 commit comments

Comments
 (0)