Exploratory Data Analysis - EDA Development

EDA Development

Tukey held that too much emphasis in statistics was placed on statistical hypothesis testing (confirmatory data analysis); more emphasis needed to be placed on using data to suggest hypotheses to test. In particular, he held that confusing the two types of analyses and employing them on the same set of data can lead to systematic bias owing to the issues inherent in testing hypotheses suggested by the data.

The objectives of EDA are to:

  • Suggest hypotheses about the causes of observed phenomena
  • Assess assumptions on which statistical inference will be based
  • Support the selection of appropriate statistical tools and techniques
  • Provide a basis for further data collection through surveys or experiments

Many EDA techniques have been adopted into data mining and are being taught to young students as a way to introduce them to statistical thinking.

Read more about this topic:  Exploratory Data Analysis

Famous quotes containing the word development:

    This was the Eastham famous of late years for its camp- meetings, held in a grove near by, to which thousands flock from all parts of the Bay. We conjectured that the reason for the perhaps unusual, if not unhealthful development of the religious sentiment here, was the fact that a large portion of the population are women whose husbands and sons are either abroad on the sea, or else drowned, and there is nobody but they and the ministers left behind.
    Henry David Thoreau (1817–1862)