请输入您要查询的百科知识:

 

词条 Glivenko–Cantelli theorem
释义

  1. Proof

  2. Empirical measures

  3. Glivenko–Cantelli class

  4. Examples

  5. See also

  6. References

  7. Further reading

{{Multiple issues|{{Refimprove|date=November 2008}}{{More footnotes|date=February 2010}}
}}

In the theory of probability, the Glivenko–Cantelli theorem, named after Valery Ivanovich Glivenko and Francesco Paolo Cantelli, determines the asymptotic behaviour of the empirical distribution function as the number of independent and identically distributed observations grows.[1] The uniform convergence of more general empirical measures becomes an important property of the Glivenko–Cantelli classes of functions or sets.[2] The Glivenko–Cantelli classes arise in Vapnik–Chervonenkis theory, with applications to machine learning. Applications can be found in econometrics making use of M-estimators.

Assume that are independent and identically-distributed random variables in with common cumulative distribution function . The empirical distribution function for is defined by

where is the indicator function of the set . For every (fixed) , is a sequence of random variables which converge to almost surely by the strong law of large numbers, that is, converges to pointwise. Glivenko and Cantelli strengthened this result by proving uniform convergence of to .

Theorem

almost surely.[3]

This theorem originates with Valery Glivenko,[4] and Francesco Cantelli,[5] in 1933.

Remarks
  • If is a stationary ergodic process, then converges almost surely to . The Glivenko–Cantelli theorem gives a stronger mode of convergence than this in the iid case.
  • An even stronger uniform convergence result for the empirical distribution function is available in the form of an extended type of law of the iterated logarithm.[6] See asymptotic properties of the empirical distribution function for this and related results.

Proof

For simplicity, consider a case of continuous random variable . Fix such that for . Now for all there exists such that . Note that

Therefore, almost surely

Since by strong law of large numbers, we can guarantee that for any integer we can find such that for all

,

which is the definition of almost sure convergence.

Empirical measures

One can generalize the empirical distribution function by replacing the set by an arbitrary set C from a class of sets to obtain an empirical measure indexed by sets

Where is the indicator function of each set .

Further generalization is the map induced by on measurable real-valued functions f, which is given by

Then it becomes an important property of these classes that the strong law of large numbers holds uniformly on or .

Glivenko–Cantelli class

Consider a set with a sigma algebra of Borel subsets A and a probability measure P. For a class of subsets,

and a class of functions

define random variables

where is the empirical measure, is the corresponding map, and

, assuming that it exists.

Definitions
  • A class is called a Glivenko–Cantelli class (or GC class) with respect to a probability measure P if any of the following equivalent statements is true.

1. almost surely as .

2. in probability as .

3. , as (convergence in mean).

The Glivenko–Cantelli classes of functions are defined similarly.

  • A class is called a universal Glivenko–Cantelli class if it is a GC class with respect to any probability measure P on (S,A).
  • A class is called uniformly Glivenko–Cantelli if the convergence occurs uniformly over all probability measures P on (S,A):

Theorem (Vapnik and Chervonenkis, 1968)[7]

A class of sets is uniformly GC if and only if it is a Vapnik–Chervonenkis class.

Examples

  • Let and . The classical Glivenko–Cantelli theorem implies that this class is a universal GC class. Furthermore, by Kolmogorov's theorem,

, that is is uniformly Glivenko–Cantelli class.

  • Let P be a nonatomic probability measure on S and be a class of all finite subsets in S. Because , , , we have that and so is not a GC class with respect to P.

See also

  • Donsker's theorem
  • Dvoretzky–Kiefer–Wolfowitz inequality – strengthens Glivenko–Cantelli theorem by quantifying the rate of convergence.

References

1. ^{{Cite journal|author = Howard G.Tucker| year = 1959 | title = A Generalization of the Glivenko-Cantelli Theorem | journal = The Annals of Mathematical Statistics | volume = 30| issue = 3 | doi = 10.1214/aoms/1177706212 | pages=828–830| jstor = 2237422 }}
2. ^{{cite book |last=van der Vaart |first=A. W. |year=1998 |title=Asymptotic Statistics |location= |publisher=Cambridge University Press |isbn=978-0-521-78450-4 |page=279 }}
3. ^{{cite book |last=van der Vaart |first=A. W. |year=1998 |title=Asymptotic Statistics |location= |publisher=Cambridge University Press |isbn=978-0-521-78450-4 |page=265 }}
4. ^Glivenko, V. (1933). Sulla determinazione empirica delle leggi di probabilità.Giorn. Ist. Ital. Attuari 4, 92-99.
5. ^Cantelli, F. P. (1933). Sulla determinazione empirica delle leggi di probabilità.Giorn. Ist. Ital. Attuari 4, 421-424.
6. ^{{cite book |last=van der Vaart |first=A. W. |year=1998 |title=Asymptotic Statistics |location= |publisher=Cambridge University Press |isbn=978-0-521-78450-4 |page=268 }}
7. ^{{cite journal |last=Vapnik |first=V. N. |last2=Chervonenkis |first2=A. Ya |year=1971 |title=On uniform convergence of the frequencies of events to their probabilities |journal=Theor. Prob. Appl. |volume=16 |issue=2 |pages=264–280 |doi=10.1137/1116025 }}

Further reading

  • Dudley, R. M. (1999). Uniform Central Limit Theorems, Cambridge University Press. {{ISBN|0-521-46102-2}}.
  • Shorack, G.R., Wellner J.A. (1986) Empirical Processes with Applications to Statistics, Wiley. {{ISBN|0-471-86725-X}}.
  • van der Vaart, A.W. and Wellner, J.A. (1996) Weak Convergence and Empirical Processes, Springer. {{ISBN|0-387-94640-3}}.
  • Aad W. van der Vaart, Jon A. Wellner (1996) Glivenko-Cantelli Theorems, Springer.
  • Aad W. van der Vaart, Jon A. Wellner (2000) Preservation Theorems for Glivenko-Cantelli and Uniform Glivenko-Cantelli Classes, Springer
{{DEFAULTSORT:Glivenko-Cantelli Theorem}}

4 : Empirical process|Asymptotic theory (statistics)|Probability theorems|Statistical theorems

随便看

 

开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。

 

Copyright © 2023 OENC.NET All Rights Reserved
京ICP备2021023879号 更新时间:2024/11/11 0:34:49