Sequence Mining - Itemset Mining

Itemset Mining

Some problems in sequence mining lend themselves discovering frequent itemsets and the order they appear, for example, one is seeking rules of the form "if a {customer buys a car}, he or she is likely to {buy insurance} within 1 week", or in the context of stock prices, "if {Nokia up and Ericsson Up}, it is likely that {Motorolla up and Samsung up} within 2 days". Traditionally, itemset mining is used in marketing applications for discovering regularities between frequently co-occurring items in large transactions. For example, by analysing transactions of customer shopping baskets in a supermarket, one can produce a rule which reads "if a customer buys onions and potatoes together, he or she is likely to also buy hamburger meat in the same transaction".

A survey and taxonomy of the key algorithms for item set mining is presented in the paper Frequent pattern mining: current status and future directions.

The two common techniques that are applied to sequence databases for frequent itemset mining are the influential apriori algorithm and the more-recent FP-Growth technique.

Read more about this topic:  Sequence Mining

Famous quotes containing the word mining:

    For every nineteenth-century middle-class family that protected its wife and child within the family circle, there was an Irish or a German girl scrubbing floors in that home, a Welsh boy mining coal to keep the home-baked goodies warm, a black girl doing the family laundry, a black mother and child picking cotton to be made into clothes for the family, and a Jewish or an Italian daughter in a sweatshop making “ladies” dresses or artificial flowers for the family to purchase.
    Stephanie Coontz (20th century)