Training Set - Use in Artificial Intelligence, Machine Learning, and Statistics

Use in Artificial Intelligence, Machine Learning, and Statistics

In artificial intelligence or machine learning, a training set consists of an input vector and an answer vector, and is used together with a supervised learning method to train a knowledge database (e.g. a neural net or a naive bayes classifier) used by an AI machine.

In statistical modeling, a training set is used to fit a model that can be used to predict a "response value" from one or more "predictors." The fitting can include both variable selection and parameter estimation. Statistical models used for prediction are often called regression models, of which linear regression and logistic regression are two examples.

In these fields, a major emphasis is placed on avoiding overfitting, so as to achieve the best possible performance on an independent test set that follows the same probability distribution as the training set.

Read more about this topic:  Training Set

Famous quotes containing the words artificial, machine and/or statistics:

    Merely external emancipation has made of the modern woman an artificial being.... Now, woman is confronted with the necessity of emancipating herself from emancipation, if she really desires to be free.
    Emma Goldman (1869–1940)

    The machine is impersonal, it takes the pride away from a piece of work, the individual merits and defects that go along with all work that is not done by a machine—which is to say, its little bit of humanity.
    Friedrich Nietzsche (1844–1900)

    O for a man who is a man, and, as my neighbor says, has a bone in his back which you cannot pass your hand through! Our statistics are at fault: the population has been returned too large. How many men are there to a square thousand miles in this country? Hardly one.
    Henry David Thoreau (1817–1862)