Error Metrics
There are two error measurements in stemming algorithms, overstemming and understemming. Overstemming is an error where two separate inflected words are stemmed to the same root, but should not have been—a false positive. Understemming is an error where two separate inflected words should be stemmed to the same root, but are not—a false negative. Stemming algorithms attempt to minimize each type of error, although reducing one type can lead to increasing the other.
Read more about this topic: Stemming Algorithm
Famous quotes containing the word error:
“I have often been reproached with the aridity of my genius; a deficiency of imagination has been imputed to me as a crime; and the Pyrrhonism of my opinions has at all times rendered me notorious. Indeed, a strong relish for physical philosophy has, I fear, tinctured my mind with a very common error of this ageI mean the habit of referring occurrences, even the least susceptible of such reference, to the principles of that science.”
—Edgar Allan Poe (18091849)