Rule of Succession - Generalization To Any Number of Possibilities

Generalization To Any Number of Possibilities

This section gives a heuristic derivation to that given in Probability Theory: The Logic of Science.

The rule of succession has many different intuitive interpretations, and depending on which intuition one uses, the generalisation may be different. Thus, the way to proceed from here is very carefully, and to re-derive the results from first principles, rather than to introduce an intuitively sensible generalisation. The full derivation can be found in Jaynes' book, but it does admit an easier to understand alternative derivation, once the solution is known. Another point to emphasise is that the prior state of knowledge described by the rule of succession is given as an enumeration of the possibilities, with the additional information that it is possible for each category to be observed. This can be equivalently stated as observing each category once prior to gathering the data. To denote that this is the knowledge being used, an Im will be put as part of the conditions in the probability assignments.

The rule of succession comes from setting a binomial likelihood, and a uniform prior distribution. Thus a straight forward generalisation is just the multivariate extensions of these two distributions: 1)Setting a uniform prior over the initial m categories, and 2) using the multinomial distribution as the likelihood function (which is the multivariate generalisation of the binomial distribution). It can be shown that the uniform distribution is a special case of the Dirichlet distribution with all of its parameters equal to 1 (just as the uniform is Beta(1,1) in the binary case). The Dirichlet distribution is the conjugate prior for the multinomial distribution, which means that the posterior distribution will also be a Dirichlet distribution with different parameters. Let pi denote the probability that category i will be observed, and let ni denote the number of times category i (i = 1, ..., m) actually was observed. Then the joint posterior distribution of the probabilities p1, ..., pm is given by;


f(p_1,\ldots,p_m \mid n_1,\ldots,n_m,I) = \begin{cases} { \displaystyle \frac{\Gamma\left( \sum_{i=1}^m (n_i+1) \right)}{\prod_{i=1}^m \Gamma(n_i+1)}
p_1^{n_1}\cdots p_m^{n_m}
}, \quad & \sum_{i=1}^m p_i=1 \\ \\
0 & \text{otherwise.} \end{cases}

To get the generalised rule of succession, note that the probability of observing category i on the next observation, conditional on the pi is just pi, we simply require its expectation. Letting Ai denote the event that the next observation is in category i (i = 1, ..., m), and let n = n1 + ... + nm be the total number of observations made. The result, using the properties of the dirichlet distribution is:

This solution reduces to the probability which would be assigned using the principle of indifference before any observations made (i.e. n = 0), consistent with the original rule of succession. It also contains the rule of succession as a special case, when m = 2, as a generalisation should.

Because the propositions or events Ai are mutually exclusive, it is possible to collapse the m categories into 2. Simply add up the Ai probabilities which correspond to "success" to get the probability of success. Supposing that this aggregates c categories as "success" and m-c categories as "failure". Let s denote the sum of the relevant ni values which have been termed "success". The probability of "success" at the next trial is then:

which is different from the original rule of succession. But note that the original rule of succession is based on I2, whereas the generalisation is based on Im. This means that the information contained in Im is different to that contained in I2. This indicates that mere knowledge of more than two outcomes which we are sure are possible is relevant information when collapsing these categories down into just two. This illustrates the subtlety in describing the prior information, and why it is important to specify which prior information one is using.

Read more about this topic:  Rule Of Succession

Famous quotes containing the word number:

    The world is so full of a number of things,
    I’m sure we should all be as happy as kings.
    Robert Louis Stevenson (1850–1894)