Association Rule Learning - Statistically Sound Associations

Statistically Sound Associations

One limitation of the standard approach to discovering associations is that by searching massive numbers of possible associations to look for collections of items that appear to be associated, there is a large risk of finding many spurious associations. These are collections of items that co-occur with unexpected frequency in the data, but only do so by chance. For example, suppose we are considering a collection of 10,000 items and looking for rules containing two items in the left-hand-side and 1 item in the right-hand-side. There are approximately 1,000,000,000,000 such rules. If we apply a statistical test for independence with a significance level of 0.05 it means there is only a 5% chance of accepting a rule if there is no association. If we assume there are no associations, we should nonetheless expect to find 50,000,000,000 rules. Statistically sound association discovery controls this risk, in most cases reducing the risk of finding any spurious associations to a user-specified significance level.

Read more about this topic:  Association Rule Learning

Famous quotes containing the words sound and/or associations:

    You can’t, in sound morals, condemn a man for taking care of his own integrity. It is his clear duty. And least of all can you condemn an artist pursuing, however humbly and imperfectly, a creative aim. In that interior world where his thought and his emotions go seeking for the experience of imagined adventures, there are no policemen, no law, no pressure of circumstance or dread of opinion to keep him within bounds. Who then is going to say Nay to his temptations if not his conscience?
    Joseph Conrad (1857–1924)

    There are many ways of discarding [books]. You can give them to friends,—or enemies,—or to associations or to poor Southern libraries. But the surest way is to lend them. Then they never come back to bother you.
    Carolyn Wells (1862?–1942)