Stemming Algorithm - Error Metrics

Error Metrics

There are two error measurements in stemming algorithms, overstemming and understemming. Overstemming is an error where two separate inflected words are stemmed to the same root, but should not have been—a false positive. Understemming is an error where two separate inflected words should be stemmed to the same root, but are not—a false negative. Stemming algorithms attempt to minimize each type of error, although reducing one type can lead to increasing the other.

Read more about this topic:  Stemming Algorithm

Famous quotes containing the word error:

    Mistakes are made on two counts: an argument is either based on error or incorrectly developed.
    Thomas Aquinas (c. 1225–1274)