General
Decision tree learning is a method commonly used in data mining. The goal is to create a model that predicts the value of a target variable based on several input variables. An example is shown on the right. Each interior node corresponds to one of the input variables; there are edges to children for each of the possible values of that input variable. Each leaf represents a value of the target variable given the values of the input variables represented by the path from the root to the leaf.
A tree can be "learned" by splitting the source set into subsets based on an attribute value test. This process is repeated on each derived subset in a recursive manner called recursive partitioning. The recursion is completed when the subset at a node has all the same value of the target variable, or when splitting no longer adds value to the predictions. This process of top-down induction of decision trees (TDIDT) is an example of a greedy algorithm, and it is by far the most common strategy for learning decision trees from data, but it is not the only strategy. In fact, some approaches have been developed recently allowing tree induction to be performed in a bottom-up fashion.
In data mining, decision trees can be described also as the combination of mathematical and computational techniques to aid the description, categorisation and generalisation of a given set of data.
Data comes in records of the form:
The dependent variable, Y, is the target variable that we are trying to understand, classify or generalise. The vector x is composed of the input variables, x1, x2, x3 etc., that are used for that task.
Read more about this topic: Decision Tree Learning
Famous quotes containing the word general:
“There is a mortifying experience in particular, which does not fail to wreak itself also in the general history; I mean the foolish face of praise, the forced smile which we put on in company where we do not feel at ease, in answer to conversation which does not interest us. The muscles, not spontaneously moved but moved, by a low usurping wilfulness, grow tight about the outline of the face, with the most disagreeable sensation.”
—Ralph Waldo Emerson (18031882)
“The man who would change the name of Arkansas is the original, iron-jawed, brass-mouthed, copper-bellied corpse-maker from the wilds of the Ozarks! He is the man they call Sudden Death and General Desolation! Sired by a hurricane, damd by an earthquake, half-brother to the cholera, nearly related to the smallpox on his mothers side!”
—Administration in the State of Arka, U.S. public relief program (1935-1943)
“Some people are under the impression that all that is required to make a good fisherman is the ability to tell lies easily and without blushing; but this is a mistake. Mere bald fabrication is useless; the veriest tyro can manage that. It is in the circumstantial detail, the embellishing touches of probability, the general air of scrupulousalmost of pedanticveracity, that the experienced angler is seen.”
—Jerome K. Jerome (18591927)