Probabilistic Latent Semantic Analysis

Probabilistic Latent Semantic Analysis

Probabilistic latent semantic analysis (PLSA), also known as probabilistic latent semantic indexing (PLSI, especially in information retrieval circles) is a statistical technique for the analysis of two-mode and co-occurrence data. In effect, one can derive a low dimensional representation of the observed variables in terms of their affinity to certain hidden variables, just as in latent semantic analysis. PLSA evolved from latent semantic analysis.

Compared to standard latent semantic analysis which stems from linear algebra and downsizes the occurrence tables (usually via a singular value decomposition), probabilistic latent semantic analysis is based on a mixture decomposition derived from a latent class model.

Read more about Probabilistic Latent Semantic Analysis:  Model, Application, Extensions, History

Famous quotes containing the words latent, semantic and/or analysis:

    Perhaps having built a barricade when you’re sixteen provides you with a sort of safety rail. If you’ve once taken part in building one, even inadvertently, doesn’t its usually latent image reappear like a warning signal whenever you’re tempted to join the police, or support any manifestation of Law and Order?
    Jean Genet (1910–1986)

    Watt’s need of semantic succour was at times so great that he would set to trying names on things, and on himself, almost as a woman hats.
    Samuel Beckett (1906–1989)

    Ask anyone committed to Marxist analysis how many angels on the head of a pin, and you will be asked in return to never mind the angels, tell me who controls the production of pins.
    Joan Didion (b. 1934)