Language: English / Original context has metadata only: true / Type: corpus

Type:: corpus
Language:: Czech, Danish, Dutch, English, Finnish, French, German, Hungarian, Italian, Polish, Portuguese, Russian, Spanish, Swedish, Turkish, Chinese, Hebrew, Japanese, Korean, and Thai
Description:: 28 speech databases containing broadband recordings from 550 adults and 50 children per language. Contains interesting phonetically rich material. All orthographically transcribed. Speaker information included for gender, age, accent. Including pronunciation lexicon.
Rights:: Not specified

Publisher:: Centre for Applied Language Studies, University of Jyväskylä
Type:: corpus
Language:: English, Finnish, French, German, Italian, Russian, Spanish, and Swedish
Description:: The NC test results, background information, speaking and writing performances in 9 foreign / second languages. A web-based data base (html files).
Rights:: Not specified

Publisher:: Centro de Tecnologías y Aplicaciones del Lenguaje y del Habla (TALP)
Type:: corpus
Subject:: trilingual corpus
Language:: Catalan, English, and Spanish
Description:: Trilingual corpus (Catalan, Spanish, English) that contains large portions of the Wikipedia (based on a 2006 dump) and has been automatically enriched with linguistic information. In its present version, it contains over 750 million words.
Rights:: Not specified

Publisher:: University of Leipzig
Type:: corpus
Language:: Afrikaans, Albanian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Esperanto, Estonian, Finnish, French, German, Hungarian, Icelandic, Indonesian, Italian, Japanese, Korean, Latin, Latvian, Lithuanian, Malay (macrolanguage), Norwegian, Occitan (post 1500), Romanian, Russian, Slovak, Slovenian, Spanish, Sundanese, Swedish, Tagalog, Turkish, Vietnamese, and Welsh
Description:: Collected from newspaper texts, webcrawling, etc.: words (+frequency), cooccurrences (+graph), left/right neighbours, example sentences
Rights:: Not specified

Publisher:: University of York
Type:: corpus
Language:: English
Description:: A selection of poetic texts (71,490 words) from the Old English Section of the Helsinki Corpus of English Texts, syntactically and morphologically annotated.
Rights:: Not specified

Publisher:: University of York
Type:: corpus
Language:: English
Description:: 1.5 million word syntactically-annotated corpus of Old English prose texts
Rights:: Not specified

Limit your search