Treebank - List of Treebanks Sorted By Language

List of Treebanks Sorted By Language

  • Arabic:
    • Penn Arabic Treebank
    • Prague Arabic Dependency Treebank (PADT)
    • Quranic Arabic Dependency Treebank (QADT)
  • Bulgarian: BulTreeBank (HPSG-based Syntactic Treebank)
  • Catalan: Cat3LB
  • Chinese: Penn Chinese Treebank, Sinica Treebank by CKIP, a tentative Chinese Dependency Treebank
  • Croatian: Croatian Dependency Treebank
  • Czech: Prague Dependency Treebank
  • Danish: Danish Dependency Treebank, Arboretum: A syntactic tree corpus of Danish
  • Dutch: CGN, Alpino
  • English:
    • Penn;
    • Prague English Dependency Treebank;
    • BLLIP WSJ corpus;
    • British Component of the International Corpus of English (ICE-GB);
    • Diachronic Corpus of Present-Day Spoken English (DCPSE);
    • Lancaster Parsed Corpus;
    • Susanne Corpus, Christine Corpus, Lucy Corpus;
    • Verbmobil treebanks: Tübingen Treebank of English / Spontaneous Speech (TüBa-E/S)
    • LinGO Redwoods;
    • Multi-Treebank;
    • The PARC 700 Dependency Bank;
    • CHILDES Brown Eve corpus with dependency annotation, see Sagae, K., MacWhinney, B., and Lavie, A. (2004) Adding syntactic annotations to transcripts of parent-child dialogs. In Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004). Lisbon, Portugal.
    • SMULTRON - Parallel Treebank EN-DE-SV
  • English-historical:
    • Penn Parsed Corpora of Historical English;
    • York-Toronto-Helsinki Parsed Corpus of Old English Prose (YCOE);
  • Estonian: Syntactically analyzed and disambiguated text corpus, see also Arborest
  • Farsi: see Persian.
  • Finnish: Turku Dependency Treebank (TDT)
  • French: Paris 7, L'Arboratoire
  • French-historical: Corpus MCVF;
  • German:
    • NEGRA;
    • TIGER;
    • Tübingen Treebank of Written German (TüBa-D/Z);
    • Tübingen Treebank of German / Spontaneous Speech (TüBa-D/S);
    • Tübingen Partially Parsed Corpus of Written German (TüPP-D/ZS)
    • SMULTRON - Parallel Treebank EN-DE-SV
  • Greek, Modern: Greek Dependency Treebank
  • Greek, Ancient:
    • Ancient Greek Dependency Treebank
    • PROIEL Corpus
  • Hebrew: Hebrew Treebank
  • Hindi: AnnCorra
  • Hungarian: Hungarian treebank
  • Icelandic: IcePaHC - Icelandic Parsed Historical Corpus
  • Italian:
    • TUT - Turin University Treebank
    • VIT - Venice Italian Treebank
    • ISST - Italian Syntactic-Semantic Treebank
    • SUT - Siena University Treebank
  • Japanese:
    • ATR Dependency corpus;
    • Kyoto Text Corpus;
    • Verbmobil treebanks: Tübingen Treebank of Japanese / Spontaneous Speech (TüBa-J/S)
  • Korean: Korean Treebank
  • Latin:
    • Latin Dependency Treebank;
    • Index Thomisticus Treebank.
    • PROIEL Corpus
  • Norwegian: INESS treebanking infrastructure
  • Persian:
    • PerTreeBank (HPSG-based Syntactic Treebank)
    • Persian Dependency Treebank (PerDT) (Dependency-based Syntactic Treebank)
  • Polish: A Treebank / Test Suite for Polish (HPSG treebank)
  • Portuguese: Projecto Floresta Sintá(c)tica
  • Portuguese-historical: Tycho Brahe corpus
  • Romanian: Romanian Dependency Treebank
  • Russian: SynTagRus Dependency Treebank incorporated in the Russian National Corpus
  • Slovene: Slovene Dependency Treebank
  • Spanish: Cast3LB, UAM Treebank of Spanish
  • Swedish: Talbanken05, Swedish Treebank, SMULTRON - Parallel Treebank EN-DE-SV
  • Thai: NAiST Thai Treebank
  • Turkish: METU-Sabanci Treebank
  • Urdu NU-FAST Treebank):
  • Vietnamese: Viet-Treebank

Read more about this topic:  Treebank

Famous quotes containing the words list of, list, sorted and/or language:

    I made a list of things I have
    to remember and a list
    of things I want to forget,
    but I see they are the same list.
    Linda Pastan (b. 1932)

    A man’s interest in a single bluebird is worth more than a complete but dry list of the fauna and flora of a town.
    Henry David Thoreau (1817–1862)

    It hurts the spirit, somehow, to read the word environments, when the plural means that there are so many alternatives there to be sorted through, as in a market, and voted on.
    Lewis Thomas (b. 1913)

    Language is filled
    with words for deprivation
    images so familiar
    it is hard to crack language open
    into that other country
    the country of being.
    Susan Griffin (b. 1943)