Feature Selection

In machine learning and statistics, feature selection, also known as variable selection, feature reduction, attribute selection or variable subset selection, is the process of selecting a subset of relevant features for use in model construction. The central assumption when using a feature selection technique is that the data contains many redundant or irrelevant features. Redundant features are those which provide no more information than the currently selected features, and irrelevant features provide no useful information in any context. Feature selection techniques are a subset of the more general field of feature extraction. Feature extraction creates new features from functions of the original features, whereas feature selection returns a subset of the features. Feature selection techniques are often used in domains where there are many features and comparatively few samples (or datapoints). The archetypal case is the use of feature selection in analysing DNA microarrays, where there are many thousands of features, and a few tens to hundreds of samples. Feature selection techniques provide three main benefits when constructing predictive models:

  • improved model interpretability,
  • shorter training times,
  • enhanced generalisation by reducing Overfitting.

Feature selection is also useful as part of the data analysis process, as shows which features are important for prediction, and how these features are related.

Read more about Feature Selection:  Introduction, Subset Selection, Optimality Criteria, Minimum-redundancy-maximum-relevance (mRMR) Feature Selection, Correlation Feature Selection, General L1-norm Support Vector Machine For Feature Selection, Regularized Trees, Embedded Methods Incorporating Feature Selection, Software For Feature Selection

Famous quotes containing the words feature and/or selection:

    When delicate and feeling souls are separated, there is not a feature in the sky, not a movement of the elements, not an aspiration of the breeze, but hints some cause for a lover’s apprehension.
    Richard Brinsley Sheridan (1751–1816)

    Judge Ginsburg’s selection should be a model—chosen on merit and not ideology, despite some naysaying, with little advance publicity. Her treatment could begin to overturn a terrible precedent: that is, that the most terrifying sentence among the accomplished in America has become, “Honey—the White House is on the phone.”
    Anna Quindlen (b. 1952)