Let be a sequence of independent identically distributed random variables with values in the state space S with probability measure P.
Definition
- The empirical measure is defined for measurable subsets of S and given by
- where is the indicator function and is the Dirac measure.
For a fixed measurable set A, nPn(A) is a binomial random variable with mean nP(A) and variance nP(A)(1 − P(A)). In particular, is an unbiased estimator of P(A).
Definition
- is the empirical measure indexed by, a collection of measurable subsets of S.
To generalize this notion further, observe that the empirical measure Pn maps measurable functions to their empirical mean,
In particular, the empirical measure of A is simply the empirical mean of the indicator function, .
For a fixed measurable function f, is a random variable with mean and variance .
By the strong law of large numbers, converges to P(A) almost surely for fixed A. Similarly converges to almost surely for a fixed measurable function f. The problem of uniform convergence of to P was open until Vapnik and Chervonenkis solved it in 1968.
If the class (or ) is Glivenko–Cantelli with respect to P then converges to P uniformly over (or ). In other words, with probability 1 we have
Read more about this topic: Empirical Measure
Famous quotes containing the word definition:
“The definition of good prose is proper words in their proper places; of good verse, the most proper words in their proper places. The propriety is in either case relative. The words in prose ought to express the intended meaning, and no more; if they attract attention to themselves, it is, in general, a fault.”
—Samuel Taylor Coleridge (17721834)
“One definition of man is an intelligence served by organs.”
—Ralph Waldo Emerson (18031882)
“... we all know the wags definition of a philanthropist: a man whose charity increases directly as the square of the distance.”
—George Eliot [Mary Ann (or Marian)