Semantic Analysis (machine Learning)

In machine learning, semantic analysis of a corpus is the task of building structures that approximate concepts from a large set of documents. It generally does not involve prior semantic understanding of the documents.

Latent semantic analysis (sometimes latent semantic indexing), is a class of techniques where documents are represented as vectors in term space. A prominent example is PLSI.

Latent Dirichlet allocation involves attributing document terms to topics.

n-grams and hidden Markov models work by representing the term stream as a markov chain where each term is derived from the few terms before it.


Famous quotes containing the words semantic and/or analysis:

    Watt’s need of semantic succour was at times so great that he would set to trying names on things, and on himself, almost as a woman hats.
    Samuel Beckett (1906–1989)

    Cubism had been an analysis of the object and an attempt to put it before us in its totality; both as analysis and as synthesis, it was a criticism of appearance. Surrealism transmuted the object, and suddenly a canvas became an apparition: a new figuration, a real transfiguration.
    Octavio Paz (b. 1914)