List of Natural Language Processing Toolkits - Structures Used in Natural Language Processing

Structures Used in Natural Language Processing

  • Corpus – body of data, optionally tagged (for example, through part-of-speech tagging), providing real world samples for analysis and comparison.
    • Text corpus – large and structured set of texts, nowadays usually electronically stored and processed. They are used to do statistical analysis and hypothesis testing, checking occurrences or validating linguistic rules within a specific subject (or domain).
    • Speech corpus – database of speech audio files and text transcriptions. In Speech technology, speech corpora are used, among other things, to create acoustic models (which can then be used with a speech recognition engine). In Linguistics, spoken corpora are used to do research into phonetic, conversation analysis, dialectology and other fields.

Read more about this topic:  List Of Natural Language Processing Toolkits

Famous quotes containing the words structures, natural and/or language:

    The American who has been confined, in his own country, to the sight of buildings designed after foreign models, is surprised on entering York Minster or St. Peter’s at Rome, by the feeling that these structures are imitations also,—faint copies of an invisible archetype.
    Ralph Waldo Emerson (1803–1882)

    Poetry is the most direct and simple means of expressing oneself in words: the most primitive nations have poetry, but only quite well developed civilizations can produce good prose. So don’t think of poetry as a perverse and unnatural way of distorting ordinary prose statements: prose is a much less natural way of speaking than poetry is. If you listen to small children, and to the amount of chanting and singsong in their speech, you’ll see what I mean.
    Northrop Frye (1912–1991)

    While you are divided from us by geographical lines, which are imaginary, and by a language which is not the same, you have not come to an alien people or land. In the realm of the heart, in the domain of the mind, there are no geographical lines dividing the nations.
    Anna Howard Shaw (1847–1919)