Search

Search Constraints

Start Over You searched for: Subject corpus Remove constraint Subject: corpus Harvested from LINDAT/CLARIAH-CZ repository Remove constraint Harvested from: LINDAT/CLARIAH-CZ repository

Search Results

1. Balaxan Corpus of Kurmanji

3. CEHugeWebCorpus

4. CORMAP - Corpus for Moroccan Arabic Processing

6. Corpus of contemporary blogs

8. CsEnVi Pairwise Parallel Corpora

9. CWC2011

10. Czech Court Decisions Dataset

11. Czech Legal Text Treebank

12. Czech Malach Cross-lingual Speech Retrieval Test Collection

13. Czech Named Entity Corpus 1.0

14. Czech Named Entity Corpus 1.1

15. Czech Text Document Corpus v 2.0

16. Czech-English Parallel Corpus 1.0 (CzEng 1.0)

17. Diakorp v6: diachronic corpus of Czech

18. EngVallex - English Valency Lexicon 2.0

19. HetWiK: Heterogene Widerstandskulturen

20. Hindi Visual Genome 1.0

21. HindMonoCorp 0.5

22. HWC2023 –Hamburg.de Website Corpus 2023

23. Individual Textual Profiles of Hillary Clinton and Donald Trump

24. Indonesian web corpus (idWac)

26. KAMOKO-Digitalizer

27. KAMOKO: KAsseler MOrgenstern KOrpus

28. KAMOKO: KAsseler MOrgenstern KOrpus (2021-02-09)

29. Khresmoi Query Translation Test Data 1.0

30. Khresmoi Query Translation Test Data 2.0

31. Khresmoi Summary Translation Test Data 1.1

32. Khresmoi Summary Translation Test Data 2.0

33. KonText Web Demo

34. Large-Scale Colloquial Persian 0.5

35. LiFR-Law. Corpus of Paraphrased Czech Administrative Texts with Reading Comprehension for Readability Studies

36. LiFR-Law. Corpus of Paraphrased Czech Administrative Texts with Reading Comprehension for Readability Studies (2023-10-08)

37. Migrant Stories

38. NAFIS Arabic Stemming Gold Standard Corpus

39. OdiEnCorp 2.0

40. onion

41. OpenLegalData (2022 - Corpus)

42. ORAL2006: Corpus of informal spoken Czech

43. Prague Arabic Dependency Treebank 1.0

44. Prague Dependency Treebank 2.0 (PDT 2.0)

45. Prague Dependency Treebank of Spoken Language (PDTSL) 0.5

46. Preamble 1.0

48. SYN v4: large corpus of written Czech

49. SYN v9: large corpus of written Czech

50. SYN2006PUB: corpus of Czech newspapers

51. SYN2009PUB: corpus of Czech newspapers

52. SYN2013PUB: corpus of written Czech newspapers

53. Tamil Dependency Treebank v0.1

55. UFAL Parallel Corpus of North Levantine 1.0

56. Urdu Monolingual Corpus