Skip to search
Skip to main content
Skip to first result
Search
Search Results
Publisher:
University of Tartu
Format:
application/octet-stream
Type:
corpus
Language:
Estonian
Description:
Recordings of different Estonian dialects, 900000 words, transcribed and partly (400000 words) morphologically annotated
Rights:
Not specified
Publisher:
University of Tartu
Format:
text/plain
Type:
lexicalConceptualResource
Language:
Estonian
Description:
10000 most frequent lemmas, 1000 most frequent word forms, based on 1 million words of journals and fiction
Rights:
Not specified
Publisher:
University of Tartu
Format:
application/tei+xml
Type:
corpus
Language:
Estonian
Description:
Collection of Estonian texts (divided into subcorpora); ca 175 million words; TEI
Rights:
Not specified
Type:
corpus
Language:
English and Estonian
Description:
written EU legislation; 5 mio words Est, 7.8 mio words Eng; Sentence-aligned
Rights:
Not specified
Publisher:
Tilde
Format:
application/octet-stream
Type:
lexicalConceptualResource
Language:
Estonian and Latvian
Description:
Estonian-Latvian dictionary is based on dictionary of K.Aben and suplemented with new lexical entries of modern lexica, ca. 26 000 lexical entries
Rights:
Not specified
Publisher:
Tilde and Eurotermbank consortium
Format:
application/octet-stream
Type:
lexicalConceptualResource
Language:
English , Estonian , French , German , Hungarian , Latvian , and Lithuanian
Description:
EuroTermBank is single access point to European multilingual terminology resources. It contains more than 1.9 million terms over 25 languages
Rights:
Not specified
Publisher:
Joint Research Centre of the EU
Type:
corpus
Language:
Bulgarian , Czech , Danish , Dutch , English , Estonian , Finnish , French , German , Modern Greek (1453-) , Hungarian , Italian , Latvian , Maltese , Norwegian , Polish , Portuguese , Romanian , Slovak , Slovenian , Spanish , and Swedish
Description:
The largest parallel corpus, contains EU law, the Acquis Communautaire in 22 languages.
Rights:
Not specified
Publisher:
Filosoft
Type:
toolService
Language:
Estonian
Rights:
Not specified
Type:
corpus
Language:
Estonian
Description:
written general; 600 000 words; local tagset; manually disambiguated
Rights:
Not specified
Publisher:
University of Tartu
Type:
corpus
Subject:
speech corpus
Language:
Estonian
Description:
Studio recordings of spontaneous Estonian segmented phonetically on word, sound, and other linguistic levels. Current size about 22 hours of speech, 155 000 words. Online search engine lets you search from word-level segments and returns matching 2 second sequences of sound and segmentation.
Rights:
Not specified