Skip to search
Skip to main content
Skip to first result
Search
Search Results
Type:
corpus
Language:
Estonian
Description:
149 sentences, VISL tagset
Rights:
Not specified
Publisher:
Institute of Cybernetics at Tallinn University of Technology
Type:
corpus
Language:
Estonian
Description:
The database consists of three sets: - Many Talker Set: 30 males, 30 females; each to read 50 numbers, 1-2 connected passages, 1 block of "filler" sentences, and 1 block of syllables. - Few Talker Set: 4 males, 4 females; each to read 50 numbers, 10 connected passages, 1 block of "filler" sentences, and 2-3 blocks of syllables. - Very Few Talker Set: 1 male, 1 female; each to read 2 blocks of 50 numbers, 40 connected passages, 4 blocks of "filler" sentences, and 9 blocks of syllables. Total amount ca 12 hours of speech.
Rights:
Not specified
Publisher:
University of Tartu
Type:
corpus
Language:
Estonian
Description:
Corpus of texts written fully or partly in Estonian, from 13.-19. century; 1,5 million words
Rights:
Not specified
Type:
corpus
Language:
Estonian
Description:
written general; 95 mio words; TEI/SGML
Rights:
Not specified
Type:
corpus
Language:
Estonian
Description:
spoken general; 1 mio words; local tagset
Rights:
Not specified
Type:
corpus
Language:
Estonian
Description:
4.4 mio words; TEI/SGML
Rights:
Not specified
Publisher:
University of Tartu
Type:
corpus
Language:
Estonian
Description:
100000 words, word senses based on TEKsaurus (Estonian Wordnet)
Rights:
Not specified
Publisher:
University of Tartu
Format:
application/octet-stream
Type:
corpus
Language:
Estonian
Description:
Recordings of different Estonian dialects, 900000 words, transcribed and partly (400000 words) morphologically annotated
Rights:
Not specified
Publisher:
University of Tartu
Format:
application/tei+xml
Type:
corpus
Language:
Estonian
Description:
Collection of Estonian texts (divided into subcorpora); ca 175 million words; TEI
Rights:
Not specified
Type:
corpus
Language:
English and Estonian
Description:
written EU legislation; 5 mio words Est, 7.8 mio words Eng; Sentence-aligned
Rights:
Not specified