Skip to search
Skip to main content
Skip to first result
Search
Search Results
Type:
corpus
Language:
Bulgarian and English
Description:
Alignment – TMX, structural – XCES, morphosyntactic – XCES, MTE tagset
Rights:
Not specified
Publisher:
Center for Dutch Language and Speech, University of Antwerp
Type:
corpus
Language:
English
Description:
Bible. Word-alligned corpus
Rights:
Not specified
Publisher:
University of Tartu
Format:
application/octet-stream
Type:
corpus
Language:
Estonian
Description:
Recordings of different Estonian dialects, 900000 words, transcribed and partly (400000 words) morphologically annotated
Rights:
Not specified
Publisher:
University of Tartu
Format:
application/tei+xml
Type:
corpus
Language:
Estonian
Description:
Collection of Estonian texts (divided into subcorpora); ca 175 million words; TEI
Rights:
Not specified
Type:
corpus
Language:
English and Estonian
Description:
written EU legislation; 5 mio words Est, 7.8 mio words Eng; Sentence-aligned
Rights:
Not specified
Type:
corpus
Language:
Portuguese
Description:
Parallel corpus
Rights:
Not specified
Publisher:
Centre for Speech Technology Research, University of Edinburgh
Type:
corpus
Language:
English
Description:
Speech corpus comprising 4608 spoken sentences recorded for speech timing research. The complete archive, available for downloading, includes a structured list of the sentences, the speech recordings and the label files, plus full documentation.
Rights:
Not specified
Type:
corpus
Language:
Slovenian
Description:
reference corpus; 300 mil. words; XML / morphosyntactic tags
Rights:
Not specified
Publisher:
ATILF
Type:
corpus
Language:
French
Description:
mainly literature (17th to 20th century)
Rights:
Not specified
Publisher:
University of Glasgow
Type:
corpus
Language:
French
Description:
French emblem books (27 in total) of the 16th century, together with Latin versions where appropriate. Transcribed and facsimile versions, and extensive search functionality.
Rights:
Not specified