Original context has metadata only: false / Publisher: Jazykovedný ústav Ľ. Štúra Slovenskej akadémie vied / Rights: PUB / Subject: morphology

Creator:: Gajdošová, Katarína, Šimková, Mária, and et al.
Publisher:: Jazykovedný ústav Ľ. Štúra Slovenskej akadémie vied
Type:: text and corpus
Subject:: dependency, treebank, syntax, and morphology
Language:: Slovak
Description:: Slovak Dependency Treebank (Slovenský závislostný korpus) was created as part of the Slovak National Corpus at the Ľ. Štúr Institute of the Slovak Academy of Sciences. The annotation follows the guidelines of the Prague Dependency Treebank (Czech), slightly modified in the spirit of Slovak grammatical tradition. Morphological tags, lemmas and dependency relations have been assigned manually to every word. The present dataset is a subset of the original treebank. We automatically selected the sentences where the two human annotators 100% agreed on the analysis. This increases the quality and trustworthiness of the data but it also results in selecting short sentences most of the time. An extended version may be published in the future when manually merged and checked annotation is available. The selected sentences have been converted to the CoNLL-X file format (original token IDs are preserved in the FEATS column). This PDT-style annotation will serve as the source for the first Slovak dataset in the Universal Dependencies (to be published separately).
Rights:: Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB

Search