Search

Search Constraints

Start Over You searched for: Type corpus Remove constraint Type: corpus

Search Results

2. A Human-Annotated Dataset for Language Modeling and Named Entity Recognition in Medieval Documents

3. A Human-Annotated Dataset for Language Modeling and Named Entity Recognition in Medieval Documents (2023-01-05)

4. A Human-Annotated Dataset of Scanned Images and OCR Texts from Medieval Documents

5. A Human-Annotated Dataset of Scanned Images and OCR Texts from Medieval Documents: Supplementary Materials

6. A morphological layer for the German part of the SMULTRON corpus

7. A Small Dataset for English-to-Czech Speech Translation in the Travel Domain

8. A Speech Test Set of Practice Business Presentations with Additional Relevant Texts

9. Additional German-Czech reference translations of the WMT'11 test set

10. Aging effects in an evolving phonological network

11. Air Traffic Control Communication

13. AKCES 2

14. AKCES 2 ver. 2

15. AKCES 3

16. AKCES 4

17. AKCES 5 (CzeSL-SGT)

18. AKCES 5 (CzeSL-SGT) Release 2

19. AKCES-GEC Grammatical Error Correction Dataset for Czech

20. AlbMoRe Movie Reviews in Albanian

21. AlbNER Named Entity Recognition in Albanian

22. AlbNews Albanian Topic Modeling

23. Alex Context NLG Dataset

27. Amharic Web Corpus

30. Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.0)

31. Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.1)

32. Annotated corpora and tools of the PARSEME Shared Task on Semi-Supervised Identification of Verbal Multiword Expressions (edition 1.2)

33. Annotated Corpus of Czech Case Law for Reference Recognition Tasks

34. Annotated Corpus of Czech Case Law for Reference Recognition Tasks (2019-06-25)

35. Annotated Corpus of Czech Case Law for Segmentation Tasks

36. Annotation of Dramatic Situations in Theater Play Scripts

37. Annotation of Dramatic Situations in Theater Play Scripts (2023)

38. APE Shared Task WMT17: Human Post-edits Test Data DE-EN

39. APE Shared Task WMT17: Human Post-edits Test Data EN-DE

40. APE Shared Task WMT18: Human Post-edits and References Test Data EN-DE PBSMT

41. Arabic ACL corpus

43. Artificial Treebank with Ellipsis

45. Aspect-Term Annotated Customer Reviews in Czech

46. Audio and video database of Latvian folklore

47. Audio Recordings Archive

48. AudioPSP 24.01: Audio recordings of proceedings of the Chamber of Deputies of the Parliament of the Czech Republic

49. Automatic Paraphrases of Czech Reference Sentences for WMT11, 13 and 14

50. Automatically generated spelling correction corpus for Czech (Czech-SEC-AG)