Sliding Window Based Part-of-speech Tagging

Sliding window based part-of-speech tagging is used to part-of-speech tag a text.

A high percentage of words in a natural language are words which out of context can be assigned more than one part of speech. The percentage of these ambiguous words is typically around 30%, although it depends greatly on the language. Solving this problem is very important in many areas of natural language processing. For example in machine translation changing the part-of-speech of a word can dramatically change its translation.

Sliding window based part-of-speech taggers are programs which assign a single part-of-speech to a given lexical form of a word, by looking at a fixed sized "window" of words around the word to be disambiguated.

The two main advantages of this approach are:

  • It is possible to automatically train the tagger, getting rid of the need of manually tagging a corpus.
  • The tagger can be implemented as a finite state automaton (Mealy machine)

Read more about Sliding Window Based Part-of-speech Tagging:  Formal Definition, Further Reading

Famous quotes containing the words sliding, window, based and/or tagging:

    What opium is instilled into all disaster? It shows formidable as we approach it, but there is at last no rough rasping friction, but the most slippery sliding surfaces. We fall soft on a thought.
    Ralph Waldo Emerson (1803–1882)

    As the end of the century approaches, all our culture is like the culture of flies at the beginning of winter. Having lost their agility, dreamy and demented, they turn slowly about the window in the first icy mists of morning. They give themselves a last wash and brush-up, their ocellated eyes roll, and they fall down the curtains.
    Jean Baudrillard (b. 1929)

    The common erotic project of destroying women makes it possible for men to unite into a brotherhood; this project is the only firm and trustworthy groundwork for cooperation among males and all male bonding is based on it.
    Andrea Dworkin (b. 1946)

    The 5307th has collapsed. From a medical viewpoint, they’re finished as a fighting unit.... I have never seen human beings in such condition. They’re drained, physically and psychologically drained. I’m not tagging them for specific ailments. I’m simply marking every man in the outfit A.O.E.—accumulation of everything.
    Samuel Fuller, U.S. screenwriter, and Milton Sperling. Samuel Fuller. Doc (Andrew Duggan)