Gibbs variational principle (canonical ensemble)

The canonical Gibbs state minimizes the free-energy functional, equivalently maximizing entropy under an energy penalty; the gap is relative entropy.
Gibbs variational principle (canonical ensemble)

Statement

Fix inverse temperature β=1/(kBT)\beta=1/(k_B T) and a Hamiltonian on a classical with reference measure λ\lambda. Let μ\mu range over probability measures absolutely continuous with respect to λ\lambda, with density ρ=dμ/dλ\rho = d\mu/d\lambda.

Define the entropy functional (Shannon/Gibbs entropy)

S(μ)  =  kBρlogρdλ, S(\mu) \;=\; -k_B \int \rho \log \rho \, d\lambda,

and the mean energy Hμ=Hdμ\langle H\rangle_\mu = \int H\, d\mu.

Then the canonical free energy satisfies the variational formula

F(β)  =  infμ{HμTS(μ)}, F(\beta) \;=\; \inf_{\mu}\Big\{\langle H\rangle_\mu - T\, S(\mu)\Big\},

and the unique minimizer is the μβ\mu_\beta with density ρβ(x)=eβH(x)/Z(β)\rho_\beta(x) = e^{-\beta H(x)}/Z(\beta), where Z(β)Z(\beta) is the .

Equivalently, for any μ\mu,

HμTS(μ)  =  F(β)+kBTD(μμβ), \langle H\rangle_\mu - T S(\mu) \;=\; F(\beta) + k_B T\, D(\mu \,\|\, \mu_\beta),

where D(μμβ)D(\mu\|\mu_\beta) is . In particular, HμTS(μ)F(β)\langle H\rangle_\mu - T S(\mu) \ge F(\beta).

Key hypotheses

  • Z(β)<Z(\beta)<\infty so the exists.
  • μ\mu is absolutely continuous with respect to λ\lambda so that S(μ)S(\mu) is meaningful (otherwise interpret S(μ)=S(\mu)=-\infty).
  • Integrability conditions ensuring Hμ\langle H\rangle_\mu and S(μ)S(\mu) are well-defined for the class of measures considered.

Conclusion

  • The canonical Gibbs measure is characterized as the unique minimizer of the free-energy functional μHμTS(μ)\mu \mapsto \langle H\rangle_\mu - T S(\mu).
  • The “suboptimality gap” to equilibrium is exactly kBTk_B T times the KL divergence to the canonical state.

Proof idea / significance

Rewrite the functional at inverse temperature β\beta by adding and subtracting logZ(β)\log Z(\beta), then identify the remaining term as a KL divergence using the density ρβeβH\rho_\beta \propto e^{-\beta H}. Nonnegativity of KL divergence (a form of ) yields the lower bound and characterizes the minimizer. This principle is the backbone of maximum-entropy and free-energy methods, and it underlies many convexity/duality results such as .