Process
The Knowledge Discovery in Databases (KDD) process is commonly defined with the stages:
- (1) Selection
- (2) Pre-processing
- (3) Transformation
- (4) Data Mining
- (5) Interpretation/Evaluation.
It exists, however, in many variations on this theme, such as the Cross Industry Standard Process for Data Mining (CRISP-DM) which defines six phases:
- (1) Business Understanding
- (2) Data Understanding
- (3) Data Preparation
- (4) Modeling
- (5) Evaluation
- (6) Deployment
or a simplified process such as (1) pre-processing, (2) data mining, and (3) results validation.
Polls conducted in 2002, 2004, and 2007 show that the CRISP-DM methodology is the leading methodology used by data miners. The only other data mining standard named in these polls was SEMMA. However, 3-4 times as many people reported using CRISP-DM. Azevedo and Santos conducted a comparison of CRISP-DM and SEMMA in 2008.
Read more about this topic: Data Mining
Famous quotes containing the word process:
“The process of education in the oldest profession in the world is like any other educational process, in that it requires time and effort and patience; it can only be acquired by taking one step at a time, though the steps become accelerated after the first few.”
—Madeleine [Blair], U.S. prostitute and madam. Madeleine, ch. 4 (1919)
“To exist as an advertisement of her husbands income, or her fathers generosity, has become a second nature to many a woman who must have undergone, one would say, some long and subtle process of degradation before she sunk [sic] so low, or grovelled so serenely.”
—Elizabeth Stuart Phelps (18441911)
“[Wellesley College] is about as meaningful to the educational process in America as a perfume factory is to the national economy.”
—Nora Ephron (b. 1941)