DeriNet 2.1
- Title:
- DeriNet 2.1
- Creator:
- Vidra, Jonáš, Žabokrtský, Zdeněk, Kyjánek, Lukáš, Ševčíková, Magda, Dohnalová, Šárka, Svoboda, Emil, and Bodnár, Jan
- Contributor:
- Ministerstvo školství, mládeže a tělovýchovy České republiky@@LM2015071@@LINDAT/CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat@@nationalFunds@@, Grantová agentura České Republiky@@19-14534S@@Popis slovotvorné struktury českých slov na základě jazykových dat@@nationalFunds@@, Charles University Grant Agency@@1176219@@Developing derivational networks for multiple languages@@nationalFunds@@, and Charles University@@START/HUM/010@@A data-based approach to competition in word-formation: selected semantic categories across seven languages@@nationalFunds@@
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Identifier:
- http://hdl.handle.net/11234/1-3765
- Subject:
- DeriNet, derivation, derivational morphology, lexical network, and MorfFlex
- Type:
- wordnet, text, and lexicalConceptualResource
- Description:
- DeriNet is a lexical network which models derivational relations in the lexicon of Czech. Nodes of the network correspond to Czech lexemes, while edges represent word-formational relations between a derived word and its base word / words. The present version, DeriNet 2.1, contains 1,039,012 lexemes (sampled from the MorfFlex CZ 2.0 dictionary) connected by 782,814 derivational, 50,533 orthographic variant, 1,952 compounding, 295 univerbation and 144 conversion relations. Compared to the previous version, version 2.1 contains annotations of orthographic variants, full automatically generated annotation of affix morpheme boundaries (in addition to the roots annotated in 2.0), 202 affixoid lexemes serving as bases for compounding, annotation of corpus frequency of lexemes, annotation of verbal conjugation classes and a pilot annotation of univerbation. The set of part-of-speech tags was converted to Universal POS from the Universal Dependencies project.
- Language:
- Czech
- Rights:
- Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
PUB
http://creativecommons.org/licenses/by-nc-sa/3.0/ - Relation:
- https://quest.ms.mff.cuni.cz/derisearch2/v2/databases/
http://hdl.handle.net/11234/1-2995 - Source:
- https://ufal.mff.cuni.cz/derinet
- Harvested from:
- LINDAT/CLARIAH-CZ repository
- Metadata only:
- false
- Date:
- 2021-07-25
The item or associated files might be "in copyright"; review the provided rights metadata:
- Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
- PUB
- http://creativecommons.org/licenses/by-nc-sa/3.0/