Semantic Analysis (machine Learning)

In machine learning, semantic analysis of a corpus is the task of building structures that approximate concepts from a large set of documents. It generally does not involve prior semantic understanding of the documents.

Latent semantic analysis (sometimes latent semantic indexing), is a class of techniques where documents are represented as vectors in term space. A prominent example is PLSI.

Latent Dirichlet allocation involves attributing document terms to topics.

n-grams and hidden Markov models work by representing the term stream as a markov chain where each term is derived from the few terms before it.


Famous quotes containing the words semantic and/or analysis:

    Watt’s need of semantic succour was at times so great that he would set to trying names on things, and on himself, almost as a woman hats.
    Samuel Beckett (1906–1989)

    A commodity appears at first sight an extremely obvious, trivial thing. But its analysis brings out that it is a very strange thing, abounding in metaphysical subtleties and theological niceties.
    Karl Marx (1818–1883)