Search
Search Results
- Type:
- corpus
- Language:
- Arabic, Danish, Dutch, English, German, Modern Greek (1453-), Italian, Japanese, Korean, Portuguese, Russian, Spanish, and Turkish
- Description:
- Large set of subtitles available for download in multiple languages. Can be used as parallel corpus.
- Rights:
- Not specified
- Publisher:
- Radboud University Nijmegen, Max Planck Institute for Psycholinguistics, Meertens Institute KNAW The Netherlands, and Babylon Centre for Studies of Multilingualism in the Multicultural Society
- Type:
- corpus
- Language:
- Arabic, Dutch, and Turkish
- Description:
- Audio recordings, transcripts,
- Rights:
- Not specified
- Publisher:
- Max Planck Institute for Psycholinguistics
- Type:
- corpus
- Language:
- Croatian, German, Russian, and Turkish
- Description:
- Language Acquisition corpus
- Rights:
- Not specified
- Publisher:
- University of Leipzig
- Type:
- corpus
- Language:
- Afrikaans, Albanian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Esperanto, Estonian, Finnish, French, German, Hungarian, Icelandic, Indonesian, Italian, Japanese, Korean, Latin, Latvian, Lithuanian, Malay (macrolanguage), Norwegian, Occitan (post 1500), Romanian, Russian, Slovak, Slovenian, Spanish, Sundanese, Swedish, Tagalog, Turkish, Vietnamese, and Welsh
- Description:
- Collected from newspaper texts, webcrawling, etc.: words (+frequency), cooccurrences (+graph), left/right neighbours, example sentences
- Rights:
- Not specified