Problem With Previous Methods
Previous clustering algorithms performed less effectively over very large databases and did not adequately consider the case wherein a data-set was too large to fit in main memory. As a result, there was a lot of overhead maintaining high clustering quality while minimizing the cost of addition IO (input/output) operations. Furthermore, most of Birch's predecessors inspect all data points (or all currently existing clusters) equally for each 'clustering decision' and do not perform heuristic weighting based on the distance between these data points.
Read more about this topic: BIRCH (data Clustering)
Famous quotes containing the words problem, previous and/or methods:
“The problem with marriage is that it ends every night after making love, and it must be rebuilt every morning before breakfast.”
—Gabriel García Márquez (b. 1928)
“I believe that there was a great age, a great epoch when man did not make war: previous to 2000 B.C. Then the self had not really become aware of itself, it had not separated itself off, the spirit was not yet born, so there was no internal conflict, and hence no permanent external conflict.”
—D.H. (David Herbert)
“The ancient bitter opposition to improved methods [of production] on the ancient theory that it more than temporarily deprives men of employment ... has no place in the gospel of American progress.”
—Herbert Hoover (18741964)