Cramér's theorem is a fundamental result in the theory of large deviations, a subdiscipline of probability theory. It determines the rate function of a series of iid random variables. A weak version of this result was first shown by Harald Cramér in 1938.
Statement
The logarithmic moment generating function (which is the cumulant-generating function) of a random variable is defined as:
-
Λ
(
t
)
=
log
E
[
exp
(
t
X
1
)
]
.
{\displaystyle \Lambda (t)=\log \operatorname {E} [\exp(tX_{1})].}
Let
X
1
,
X
2
,
…
{\displaystyle X_{1},X_{2},\dots }
be a sequence of iid real random variables with finite logarithmic moment generating function, i.e.
Λ
(
t
)
<
∞
{\displaystyle \Lambda (t)<\infty }
for all
t
∈
R
{\displaystyle t\in \mathbb {R} }
.
Then the Legendre transform of
Λ
{\displaystyle \Lambda }
:
-
Λ
∗
(
x
)
:=
sup
t
∈
R
(
t
x
−
Λ
(
t
)
)
{\displaystyle \Lambda ^{*}(x):=\sup _{t\in \mathbb {R} }\left(tx-\Lambda (t)\right)}
satisfies,
-
lim
n
→
∞
1
n
log
(
P
(
∑
i
=
1
n
X
i
≥
n
x
)
)
=
−
Λ
∗
(
x
)
{\displaystyle \lim _{n\to \infty }{\frac {1}{n}}\log \left(P\left(\sum _{i=1}^{n}X_{i}\geq nx\right)\right)=-\Lambda ^{*}(x)}
for all
x
>
E
[
X
1
]
.
{\displaystyle x>\operatorname {E} [X_{1}].}
[1]: 508
In the terminology of the theory of large deviations the result can be reformulated as follows:
If
X
1
,
X
2
,
…
{\displaystyle X_{1},X_{2},\dots }
is a series of iid random variables, then the distributions
(
L
(
1
n
∑
i
=
1
n
X
i
)
)
n
∈
N
{\displaystyle \left({\mathcal {L}}({\tfrac {1}{n}}\sum _{i=1}^{n}X_{i})\right)_{n\in \mathbb {N} }}
satisfy a large deviation principle with rate function
Λ
∗
{\displaystyle \Lambda ^{*}}
, where
L
(
X
)
{\displaystyle {\mathcal {L}}(X)}
denotes the distribution of the random variable
X
{\displaystyle X}
.[1]: 514
References
- Klenke, Achim (2008). Probability Theory. Berlin: Springer. doi:10.1007/978-1-84800-048-3. ISBN 978-1-84800-047-6.
- "Cramér theorem", Encyclopedia of Mathematics, EMS Press, 2001 [1994]