Large Datasets
In practice, a large data set leads to a large K, and storing K may become a problem. One way to deal with this is to perform clustering on your large dataset, and populate the kernel with the means of those clusters. Since even this method may yield a relatively large K, it is common to compute only the top P eigenvalues and eigenvectors of K.
Read more about this topic: Kernel Principal Component Analysis
Famous quotes containing the word large:
“Unionism seldom, if ever, uses such power as it has to insure better work; almost always it devotes a large part of that power to safeguarding bad work.”
—H.L. (Henry Lewis)