1 - 3 of 3
Number of results to display per page
Search Results
2. Szeged Corpus 2.0
- Publisher:
- Department of Informatics, Human Language Technology Group, University of Szeged
- Format:
- application/xml
- Type:
- corpus
- Subject:
- monolingual corpus, annotated corpus, and POS annotation
- Language:
- Hungarian
- Description:
- written, monolingual, general, manually POS annotated reference corpus; 1,459,288 tokens; MSD tagset, XML (TEI P4) files
- Rights:
- Not specified
3. The Diorisis Ancient Greek Corpus
- Creator:
- Vatri, Alessandro and McGillivray, Barbara
- Publisher:
- Figshare
- Type:
- text and corpus
- Subject:
- annotated corpus, ancient world, lemmatization, and part of speech
- Language:
- Ancient Greek (to 1453)
- Description:
- An annotated corpus of literary Ancient Greek sourced from the Perseus Canonical Greek Lit repository (https://github.com/PerseusDL/canonical-greekLit), “The Little Sailing” digital library (http://www.mikrosapoplous.gr/en/texts1en.html), and the Bibliotheca Augustana digital library (http://www.hs-augsburg.de/~harsch/augustana.html#gr). The corpus consists of 820 texts spanning between the beginnings of the AG literary tradition (Homer) and the fifth century AD, and it counts 10,206,421 words. In addition to referring to this resource, please use the following citation when citing the corpus: Vatri, A., & McGillivray, B. (2018). The Diorisis Ancient Greek Corpus, Research Data Journal for the Humanities and Social Sciences, 3(1), 55-65. doi: https://doi.org/10.1163/24523666-01000013
- Rights:
- Not specified