Sanov's theorem

Large deviations for empirical measures of an independent identically distributed sample.
Sanov’s theorem

Sanov’s theorem: Let (Xi)i1(X_i)_{i\ge 1} be an taking values in a Polish space EE, with common μ\mu. Define the empirical measure

Ln=1ni=1nδXi, L_n=\frac{1}{n}\sum_{i=1}^n \delta_{X_i},

viewed as a random element of P(E)\mathcal{P}(E) (Borel probability measures on EE) equipped with the topology of weak convergence. Then (Ln)(L_n) satisfies a on P(E)\mathcal{P}(E) with speed nn and

I(ν)=H(νμ)={Elog ⁣(dνdμ)dν,νμ,+,otherwise. I(\nu)=H(\nu\|\mu)= \begin{cases} \displaystyle \int_E \log\!\left(\frac{d\nu}{d\mu}\right)\,d\nu, & \nu\ll \mu,\\[6pt] +\infty, & \text{otherwise.} \end{cases}

Here H(νμ)H(\nu\|\mu) is . Combined with the , Sanov’s theorem yields many LDPs for functionals of empirical measures.