Zobrazit minimální záznam

 
dc.contributor.author Urešová, Zdeňka
dc.contributor.author Bémová, Alevtina
dc.contributor.author Fučíková, Eva
dc.contributor.author Hajič, Jan
dc.contributor.author Kolářová, Veronika
dc.contributor.author Mikulová, Marie
dc.contributor.author Pajas, Petr
dc.contributor.author Panevová, Jarmila
dc.contributor.author Štěpánek, Jan
dc.date.accessioned 2021-01-22T12:57:05Z
dc.date.available 2021-01-22T12:57:05Z
dc.date.issued 2021-01-20
dc.identifier.uri http://hdl.handle.net/11234/1-3499
dc.description The valency lexicon PDT-Vallex 4.0 has been built in close connection with the annotation of the Prague Dependency Treebank project (PDT) and its successors (mainly the Prague Czech-English Dependency Treebank project, PCEDT, the spoken language corpus (PDTSC) and corpus of user-generated texts in the project Faust). It contains over 14500 valency frames for almost 8500 verbs which occurred in the PDT, PCEDT, PDTSC and Faust corpora. In addition, there are nouns, adjectives and adverbs, linked from the PDT part only, increasing the total to over 17000 valency frames for 13000 words. All the corpora have been published in 2020 as the PDT-C 1.0 corpus with the PDT-Vallex 4.0 dictionary included; this is a copy of the dictionary published as a separate item for those not interested in the corpora themselves. It is available in electronically processable format (XML), and also in more human readable form including corpus examples (see the WEBSITE link below, and the links to its main publications elsewhere in this metadata). The main feature of the lexicon is its linking to the annotated corpora - each occurrence of each verb is linked to the appropriate valency frame with additional (generalized) information about its usage and surface morphosyntactic form alternatives. It replaces the previously published unversioned edition of PDT-Vallex from 2014.
dc.language.iso ces
dc.publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
dc.relation.isreferencedby http://ufal.mff.cuni.cz/~uresova/web.pdf/2003-PDT-VALLEX-Creating%20a%20Large-coverage%20Valency%20Lexicon.pdf
dc.relation.isreferencedby https://www.aclweb.org/anthology/2020.lrec-1.641.pdf
dc.relation.isreferencedby https://ufal.mff.cuni.cz/books/2011-uresova-slovnik
dc.relation.isreferencedby https://ufal.mff.cuni.cz/books/2011-uresova
dc.rights Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/4.0/
dc.source.uri https://ufal.mff.cuni.cz/pdt-vallex-valency-lexicon-linked-czech-corpora
dc.subject verbal valency
dc.subject valency
dc.subject annotation
dc.subject linguistic data
dc.subject lexicon
dc.subject lexical semantics
dc.subject PDT
dc.title PDT-Vallex: Czech Valency lexicon linked to treebanks 4.0 (PDT-Vallex 4.0)
dc.type lexicalConceptualResource
metashare.ResourceInfo#ContentInfo.mediaType text
metashare.ResourceInfo#ContentInfo.detailedType computationalLexicon
dc.rights.label PUB
hidden false
hasMetadata false
has.files yes
branding LINDAT / CLARIAH-CZ
demo.uri http://lindat.mff.cuni.cz/services/PDT-Vallex/
contact.person Jan Hajič hajic@ufal.mff.cuni.cz Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky LM2015071 LINDAT/CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat nationalFunds
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky LM2018101 LINDAT/CLARIAH-CZ: Digitální výzkumná infrastruktura pro jazykové technologie, umění a humanitní vědy nationalFunds
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky CZ.02.1.01/0.0/0.0/16_013/0001781 LINDAT/CLARIN - Výzkumná infrastruktura pro jazykové technologie - rozšíření repozitáře a výpočetní kapacity nationalFunds
sponsor Grantová agentura České republiky GA17-07313S Contextually-based synonymy and valency of verbs in a bilingual setting nationalFunds
size.info 13027 words
size.info 17341 entries
files.size 1689947
files.count 1


 Soubory tohoto záznamu

Icon
Název
PDT-Vallex-4.0.zip
Velikost
1.61 MB
Formát
application/zip
Popis
PDT-Vallex 4.0 XML file and docs
MD5
3f62c72054115ae5070c5a458b2eb71b
 Stáhnout soubor  Náhled
 Náhled souboru  
  • PDT-Vallex-4.0
    • pics
      • default-project-ufal.png4 kB
      • licla.ico99 kB
      • logo_ufal_110u.png2 kB
      • LINDAT-CLARIAH-cz.png272 kB
      • index.ico99 kB
    • credits.html3 kB
    • data
      • pdtvallex-4.0.xml21 MB
    • acknowledgements.html2 kB
    • rest_api.html5 kB
    • publications.html4 kB
    • documentation.html3 kB
    • index.html9 kB
    • data.html2 kB
    • styles
      • main.css1 kB
    • licence.html5 kB

Zobrazit minimální záznam