Jensen's inequality (lemma)
For a convex function φ, one has φ(E[X]) ≤ E[φ(X)] whenever the expectations exist.
Jensen’s inequality (lemma)
Definitions and notation
Statement
Let be a probability space and let be a random variable taking values in a convex set . Let be convex.
Assume is integrable and is integrable, i.e.
- and
- .
Then
If is concave, the inequality is reversed:
Key hypotheses and conclusions
Hypotheses
- is a random variable with values in a convex set .
- is convex on .
- The expectations and exist (finite).
Conclusions
- Convexity pulls outside expectations: .
- If is strictly convex (and mild regularity/“non-degenerate support” assumptions hold), equality forces to be almost surely constant.
Proof idea / significance
A standard proof uses supporting hyperplanes: convexity implies that at the point there exists a subgradient such that for all . Substitute and take expectations; the linear term vanishes because .
In statistical mechanics, Jensen’s inequality is a basic engine behind entropy/variational bounds, including Gibbs' inequality and exponential-moment bounds such as Chernoff bounds .