Cumulant generating function

The log moment-generating function; its derivatives at zero produce cumulants, and in statistical mechanics it is realized by log Z with sources.
Cumulant generating function

The cumulant generating function (CGF) is the standard tool for organizing fluctuations beyond the mean and variance. In statistical mechanics, the role of the CGF is played by the logarithm of a suitably “sourced” partition function, making cumulants accessible via derivatives of logZ\log Z.

Definition in probability

Let XX be a real-valued on a . Its (MGF), when finite near t=0t=0, is

MX(t)  =  E ⁣[etX]. M_X(t) \;=\; \mathbb{E}\!\left[e^{tX}\right].

The cumulant generating function is

KX(t)  =  logMX(t)  =  logE ⁣[etX]. K_X(t) \;=\; \log M_X(t) \;=\; \log \mathbb{E}\!\left[e^{tX}\right].

The nnth cumulant of XX is obtained by differentiating at the origin:

κn(X)  =  dndtnKX(t)t=0. \kappa_n(X) \;=\; \left.\frac{\mathrm{d}^n}{\mathrm{d}t^n}K_X(t)\right|_{t=0}.

In particular,

κ1(X)=E[X],κ2(X)=Var(X), \kappa_1(X)=\mathbb{E}[X],\qquad \kappa_2(X)=\mathrm{Var}(X),

linking cumulants to and . For two random variables X,YX,Y, mixed derivatives of K(X,Y)K_{(X,Y)} yield covariances (see ).

Multi-variable CGF and mixed cumulants

For a vector X=(X1,,Xm)X=(X_1,\dots,X_m), define

KX(t)  =  logE ⁣[exp ⁣(i=1mtiXi)], K_X(t) \;=\; \log \mathbb{E}\!\left[\exp\!\left(\sum_{i=1}^m t_i X_i\right)\right],

with t=(t1,,tm)t=(t_1,\dots,t_m). Then mixed partial derivatives at t=0t=0 give joint cumulants:

κ(Xi1,,Xin)  =  nKX(t)ti1tint=0. \kappa(X_{i_1},\dots,X_{i_n}) \;=\; \left.\frac{\partial^n K_X(t)}{\partial t_{i_1}\cdots \partial t_{i_n}}\right|_{t=0}.

These cumulants vanish when the variables split into independent groups, capturing “genuine” correlation structure.

Statistical mechanics realization: log Z as a CGF

In the , introduce “sources” (fields) λi\lambda_i conjugate to observables AiA_i by considering

Z(β,λ)  =  Γexp ⁣(βH0(x)+i=1mλiAi(x))dμ(x). Z(\beta,\lambda) \;=\; \int_{\Gamma}\exp\!\left(-\beta H_0(x) + \sum_{i=1}^m \lambda_i A_i(x)\right)\,\mathrm{d}\mu(x).

Then logZ(β,λ)\log Z(\beta,\lambda) is exactly a CGF for the observables (A1,,Am)(A_1,\dots,A_m) with respect to the canonical weight. Concretely, derivatives of logZ\log Z produce cumulants of the AiA_i (up to powers of β\beta depending on conventions), which is the basis of .

For example, in the common convention where the coupling is β(H0ihiAi)-\beta(H_0-\sum_i h_i A_i), one finds

  • hilogZ=βAi\partial_{h_i}\log Z = \beta\,\langle A_i\rangle,
  • hihjlogZ=β2Cov(Ai,Aj)\partial_{h_i}\partial_{h_j}\log Z = \beta^2\,\mathrm{Cov}(A_i,A_j), and higher derivatives give higher cumulants.

Physical interpretation

  • logZ\log Z packages equilibrium statistics: means, variances, and all higher connected moments are encoded in its derivatives.
  • The convexity of logZ\log Z in the sources (when it exists) implies nonnegativity of variances and a hierarchy of stability inequalities.
  • Connected correlation functions in many-body systems are precisely these cumulants in space-dependent sources; see for the field-theoretic/many-body formulation.

This “CGF viewpoint” also interfaces naturally with large-deviation asymptotics through exponential tilting and Laplace-type limits (compare ).