Kernel Principal Component Analysis - Large Datasets

Large Datasets

In practice, a large data set leads to a large K, and storing K may become a problem. One way to deal with this is to perform clustering on your large dataset, and populate the kernel with the means of those clusters. Since even this method may yield a relatively large K, it is common to compute only the top P eigenvalues and eigenvectors of K.

Read more about this topic:  Kernel Principal Component Analysis

Famous quotes containing the word large:

    ... nothing seems completely to differentiate the poor but poverty. We find no adjectives to fit them, as a whole, only those of which Want is the mother. “Miserable” covers many; “shabby” most, and I am sadly aware that, in a large majority of minds, “disagreeable” includes them all.
    Albion Fellows Bacon (1865–1933)