Search

Search Constraints

Start Over You searched for: Rights PUB Remove constraint Rights: PUB Date Unknown Remove constraint Date: Unknown

Search Results

1. A Gold Standard Word Alignment for English-Swedish (2015-10-12)

2. A Human-Annotated Dataset for Language Modeling and Named Entity Recognition in Medieval Documents

3. A Human-Annotated Dataset for Language Modeling and Named Entity Recognition in Medieval Documents (2023-01-05)

4. A Human-Annotated Dataset of Scanned Images and OCR Texts from Medieval Documents

5. A Human-Annotated Dataset of Scanned Images and OCR Texts from Medieval Documents: Supplementary Materials

6. A morphological layer for the German part of the SMULTRON corpus

7. A Small Dataset for English-to-Czech Speech Translation in the Travel Domain

8. A Speech Test Set of Practice Business Presentations with Additional Relevant Texts

9. Additional German-Czech reference translations of the WMT'11 test set

10. Addressed Arabic Phonetic Rules

11. AdjDeriNet: Words Derived from Adjectives in Czech

12. Air Traffic Control Communication

14. AKCES 2

15. AKCES 2 ver. 2

16. AKCES 3

17. AKCES 4

18. AKCES 5 (CzeSL-SGT)

19. AKCES 5 (CzeSL-SGT) Release 2

20. AKCES-GEC Grammatical Error Correction Dataset for Czech

21. AlbMoRe Movie Reviews in Albanian

22. AlbNER Named Entity Recognition in Albanian

23. AlbNews Albanian Topic Modeling

24. Alex Context NLG Dataset

25. Annotate

26. Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.0)

27. Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.1)

28. Annotated corpora and tools of the PARSEME Shared Task on Semi-Supervised Identification of Verbal Multiword Expressions (edition 1.2)

29. Annotated Corpus of Czech Case Law for Segmentation Tasks

30. APE Shared Task WMT17: Human Post-edits Test Data DE-EN

31. APE Shared Task WMT17: Human Post-edits Test Data EN-DE

32. APE Shared Task WMT18: Human Post-edits and References Test Data EN-DE PBSMT

33. Arabic Phonetic Rules

34. Arabic Triliteral roots Lexicon

35. Arabic WordNet ontology

36. Artificial Treebank with Ellipsis

37. Aspect-Term Annotated Customer Reviews in Czech

38. ATCC: Pronunciation lexicon and n-gram counts for ASR module

39. AudioPSP 24.01: Audio recordings of proceedings of the Chamber of Deputies of the Parliament of the Czech Republic

40. Automatic Paraphrases of Czech Reference Sentences for WMT11, 13 and 14

41. Automatically generated spelling correction corpus for Czech (Czech-SEC-AG)

42. Balaxan Corpus of Kurmanji

43. Bosworth-Toller’s Anglo-Saxon Dictionary online

44. Broken plural list

46. C4Corpus (CC BY-NC part)

47. C4Corpus (CC BY-NC-ND part)

48. C4Corpus (CC BY-NC-SA part)

49. C4Corpus (CC BY-ND part)

50. C4Corpus (CC BY-SA part)

51. C4Corpus (CC-BY part)

52. C4Corpus (publicdomain part)

53. CALEM (Comprehensive Arabic LEMmas)

54. CEHugeWebCorpus

55. Česílko

57. CoCzeFLA Chroma 2022.07

58. CoCzeFLA Chroma 2023.04

59. CoCzeFLA Chroma 2023.07

60. CoNLL 2009 Shared Task - Czech Data

61. CoNLL 2009 Shared Task Czech Trial Set

62. CoNLL 2017 and 2018 Shared Task Blind and Preprocessed Test Data

63. CoNLL 2017 Shared Task - Automatically Annotated Raw Texts and Word Embeddings

64. CoNLL 2017 Shared Task - UDPipe Baseline Models and Supplementary Materials

65. CoNLL 2017 Shared Task System Outputs

66. CoNLL 2018 Shared Task - UDPipe Baseline Models and Supplementary Materials

67. CoNLL 2018 Shared Task System Outputs

68. Continuous Rating; Supplementary materials

69. Coreference in Universal Dependencies 0.1 (CorefUD 0.1)

70. Coreference in Universal Dependencies 0.2 (CorefUD 0.2)

71. Coreference in Universal Dependencies 1.0 (CorefUD 1.0)

72. Coreference in Universal Dependencies 1.1 (CorefUD 1.1)

73. Coreference in Universal Dependencies 1.2 (CorefUD 1.2)

74. CORMAP - Corpus for Moroccan Arabic Processing

75. CorPipe 23 multilingual CorefUD 1.1 model (corpipe23-corefud1.1-231206)

76. Corpus for training and evaluating diacritics restoration systems

77. Corpus of precisely articulated Czech speech

78. COSTRA 1.0: A Dataset of Complex Sentence Transformations

79. COSTRA 1.1: A Dataset of Complex Sentence Transformations and Comparisons

80. Covid-19 Thesaurus

81. CsEnVi Pairwise Parallel Corpora

82. CUBBITT Translation Models (en-cs) (v1.0)

83. CUBBITT Translation Models (en-fr) (v1.0)

84. CUBBITT Translation Models (en-pl) (v1.0)

85. CWC2011

86. Czech and English abstracts of ÚFAL papers

87. Czech and English abstracts of ÚFAL papers (2022-11-11)

88. Czech and English Reflective Dataset (CEReD)

89. Czech Court Decisions Corpus (CzCDC 1.0)

90. Czech Grammar Agreement Dataset for Evaluation of Language Models

91. Czech HS Contracts Dataset (CHSC) 1.0

92. Czech image captioning, machine translation, and sentiment analysis (Neural Monkey models)

93. Czech image captioning, machine translation, sentiment analysis and summarization (Neural Monkey models)

94. Czech Legal Text Treebank

95. Czech Legal Text Treebank 2.0

96. Czech Malach Cross-lingual Speech Retrieval Test Collection

97. Czech Models (CNEC) for NameTag

98. Czech Models (MorfFlex CZ + PDT) for MorphoDiTa

99. Czech Models (MorfFlex CZ 160310 + PDT 3.0) for MorphoDiTa 160310

100. Czech Models (MorfFlex CZ 161115 + PDT 3.0) for MorphoDiTa 161115