Adjusted Mutual Information - Mutual Information of Two Partitions

Mutual Information of Two Partitions

Given a set S of N elements, consider two partitions of S, namely with R clusters, and with C clusters. It is presumed here that the partitions are so-called hard clusters, the partitions are pairwise disjoint:

for all, and complete:

The mutual information of cluster overlap between U and V can be summarized in the form of an RxC contingency table, where denotes the number of objects that are common to clusters and . That is,

Suppose an object is picked at random from S; the probability that the object falls into cluster is:

The entropy associated with the partitioning U is:

H(U) is non-negative and takes the value 0 only when there is no uncertainty determining an object's cluster membership, i.e., when there is only one cluster. Similarly, the entropy of the clustering V can be calculated as:

where . The mutual information (MI) between two partitions:

where P(i,j) denotes the probability that a point belongs to both the cluster in U and cluster in V:

MI is a non-negative quantity upper bounded by the entropies H(U) and H(V). It quantifies the information shared by the two clusterings and thus can be employed as a clustering similarity measure.

Read more about this topic:  Adjusted Mutual Information

Famous quotes containing the words mutual, information and/or partitions:

    Louise Bryant: I’m sorry if you don’t believe in mutual independence and free love and respect.
    Eugene O’Neill: Don’t give me a lot of parlor socialism that you learned in the village. If you were mine, I wouldn’t share you with anybody or anything. It would be just you and me. You’d be at the center of it all. You know it would feel a lot more like love than being left alone with your work.
    Warren Beatty (b. 1937)

    Phenomenal nature shadows him wherever he goes. Clouds in the staring sky transmit to one another, by means of slow signs, incredibly detailed information regarding him. His inmost thoughts are discussed at nightfall, in manual alphabet, by darkly gesticulating trees. Pebbles or stains or sunflecks form patterns representing in some awful way messages which he must intercept. Everything is a cipher and of everything he is the theme.
    Vladimir Nabokov (1899–1977)

    Walls have cracks and partitions ears.
    Chinese proverb.