Data Set - Classic Data Sets

Classic Data Sets

Several classic data sets have been used extensively in the statistical literature:

  • Iris flower data set - multivariate data set introduced by Ronald Fisher (1936).
  • Categorical data analysis - Data sets used in the book, An Introduction to Categorical Data Analysis, by Agresti are provided on-line by StatLib.
  • Robust statistics - Data sets used in Robust Regression and Outlier Detection (Rousseeuw and Leroy, 1986). Provided on-line at the University of Cologne.
  • Time series - Data used in Chatfield's book, The Analysis of Time Series, are provided on-line by StatLib.
  • Extreme values - Data used in the book, An Introduction to the Statistical Modeling of Extreme Values are provided on-line by Stuart Coles, the book's author.
  • Bayesian Data Analysis - Data used in the book are provided on-line by Andrew Gelman, one of the book's authors.
  • The Bupa liver data, used in several papers in the machine learning (data mining) literature.
  • Anscombe's quartet Small dataset illustrating the importance of graphing the data to avoid statistical fallacies

Read more about this topic:  Data Set

Famous quotes containing the words classic, data and/or sets:

    On desperate seas long wont to roam,
    The hyacinth hair, thy classic face,
    Thy Naiad airs have brought me home
    To the glory that was Greece,
    And the grandeur that was Rome.
    Edgar Allan Poe (1809–1849)

    To write it, it took three months; to conceive it three minutes; to collect the data in it—all my life.
    F. Scott Fitzgerald (1896–1940)

    It is time to be old,
    To take in sail:—
    The god of bounds,
    Who sets to seas a shore,
    Came to me in his fatal rounds,
    And said: ‘No more!’
    Ralph Waldo Emerson (1803–1882)