Itemset Mining
Some problems in sequence mining lend themselves discovering frequent itemsets and the order they appear, for example, one is seeking rules of the form "if a {customer buys a car}, he or she is likely to {buy insurance} within 1 week", or in the context of stock prices, "if {Nokia up and Ericsson Up}, it is likely that {Motorolla up and Samsung up} within 2 days". Traditionally, itemset mining is used in marketing applications for discovering regularities between frequently co-occurring items in large transactions. For example, by analysing transactions of customer shopping baskets in a supermarket, one can produce a rule which reads "if a customer buys onions and potatoes together, he or she is likely to also buy hamburger meat in the same transaction".
A survey and taxonomy of the key algorithms for item set mining is presented in the paper Frequent pattern mining: current status and future directions.
The two common techniques that are applied to sequence databases for frequent itemset mining are the influential apriori algorithm and the more-recent FP-Growth technique.
Read more about this topic: Sequence Mining
Famous quotes containing the word mining:
“In strict science, all persons underlie the same condition of an infinite remoteness. Shall we fear to cool our love by mining for the metaphysical foundation of this elysian temple? Shall I not be as real as the things I see? If I am, I shall not fear to know them for what they are.”
—Ralph Waldo Emerson (18031882)