Ground Truth - Statistics and Machine Learning

Statistics and Machine Learning

In machine learning, the term "ground truth" refers to the accuracy of the training set's classification for supervised learning techniques. This is used in statistical models to prove or disprove research hypotheses. The verb "ground truthing" refers to the process of gathering the proper objective data for this test. Compare with Gold standard (test).

Bayesian spam filtering is a common example of supervised learning. In this system, the algorithm is manually taught the differences between spam and non-spam. This depends on the ground truth of the messages used to train the algorithm; inaccuracies in that ground truth will correlate to inaccuracies in the resulting spam/non-spam verdicts.

Read more about this topic:  Ground Truth

Famous quotes containing the words statistics, machine and/or learning:

    July 4. Statistics show that we lose more fools on this day than in all the other days of the year put together. This proves, by the number left in stock, that one Fourth of July per year is now inadequate, the country has grown so.
    Mark Twain [Samuel Langhorne Clemens] (1835–1910)

    I brush my hair,
    waiting in the pain machine for my bones to get hard,
    for the soft, soft bones that were laid apart
    and were screwed together. They will knit.
    And the other corpse, the fractured heart,
    I feed it piecemeal, little chalice. I’m good to it.
    Anne Sexton (1928–1974)

    Isn’t it odd that networks accept billions of dollars from advertisers to teach people to use products and then proclaim that children aren’t learning about violence from their steady diet of it on television!
    Toni Liebman (20th century)