Skip to search
Skip to main content
Skip to first result
Search
Search Results
Type:
corpus
Language:
Estonian
Description:
written general; 600 000 words; local tagset; manually disambiguated
Rights:
Not specified
Publisher:
Universität Bamberg, World Language Documentation Centre
Format:
application/octet-stream
Type:
lexicalConceptualResource
Language:
Afrikaans , Arabic , Basque , Bulgarian , Catalan , Chinese , Czech , Danish , Dutch , English , Esperanto , Estonian , Finnish , French , Galician , Georgian , Modern Greek (1453-) , Hebrew , Hungarian , Icelandic , Indonesian , Interlingua (International Auxiliary Language Association) , Irish , Italian , Japanese , Khmer , Norwegian , Polish , Portuguese , Romanian , Russian , Serbian , Slovak , Spanish , Swedish , Turkish , Ukrainian , and Welsh
Rights:
GFDL or CC and http://www.omegawiki.org/Licensing
Publisher:
University of Tartu
Type:
corpus
Subject:
speech corpus
Language:
Estonian
Description:
Studio recordings of spontaneous Estonian segmented phonetically on word, sound, and other linguistic levels. Current size about 22 hours of speech, 155 000 words. Online search engine lets you search from word-level segments and returns matching 2 second sequences of sound and segmentation.
Rights:
Not specified
Type:
corpus
Language:
Estonian
Description:
written general; 300 000 words; local tagset (POS, syntactic functions)
Rights:
Not specified
Type:
corpus
Language:
Estonian
Description:
200 sentences, TIGER-XML
Rights:
Not specified
Publisher:
University of Tartu
Type:
toolService
Language:
Estonian
Rights:
Not specified
Publisher:
University of Tartu
Type:
lexicalConceptualResource
Language:
Estonian
Description:
Estonian Wordnet, 10000 synsets
Rights:
Not specified
Publisher:
University of Leipzig
Type:
corpus
Language:
Afrikaans , Albanian , Bulgarian , Catalan , Chinese , Croatian , Czech , Danish , Dutch , English , Esperanto , Estonian , Finnish , French , German , Hungarian , Icelandic , Indonesian , Italian , Japanese , Korean , Latin , Latvian , Lithuanian , Malay (macrolanguage) , Norwegian , Occitan (post 1500) , Romanian , Russian , Slovak , Slovenian , Spanish , Sundanese , Swedish , Tagalog , Turkish , Vietnamese , and Welsh
Description:
Collected from newspaper texts, webcrawling, etc.: words (+frequency), cooccurrences (+graph), left/right neighbours, example sentences
Rights:
Not specified