Zobrazit minimální záznam

 
dc.contributor.author Straka, Milan
dc.date.accessioned 2024-10-07T15:30:49Z
dc.date.available 2024-10-07T15:30:49Z
dc.date.issued 2024-09-06
dc.identifier.uri http://hdl.handle.net/11234/1-5672
dc.description The `corpipe24-corefud1.2-240906` is a `mT5-large`-based multilingual model for coreference resolution usable in CorPipe 24 (https://github.com/ufal/crac2024-corpipe). It is released under the CC BY-NC-SA 4.0 license. The model is language agnostic (no corpus id on input), so it can be in theory used to predict coreference in any `mT5` language. This model jointly predicts also the empty nodes needed for zero coreference. The paper introducing this model also presents an alternative two-stage approach first predicting empty nodes (via https://www.kaggle.com/models/ufal-mff/crac2024_zero_nodes_baseline/) and then performing coreference resolution (via http://hdl.handle.net/11234/1-5673), which is circa twice as slow but slightly better.
dc.language.iso cat
dc.language.iso ces
dc.language.iso deu
dc.language.iso eng
dc.language.iso spa
dc.language.iso fra
dc.language.iso hun
dc.language.iso lit
dc.language.iso nob
dc.language.iso nno
dc.language.iso pol
dc.language.iso rus
dc.language.iso tur
dc.language.iso chu
dc.language.iso grc
dc.language.iso hbo
dc.publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
dc.relation.isreferencedby https://arxiv.org/abs/2410.02756
dc.relation.replaces http://hdl.handle.net/11234/1-5369
dc.rights Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/4.0/
dc.source.uri https://github.com/ufal/crac2024-corpipe
dc.subject coreference resolution
dc.subject CorPipe
dc.subject CorefUD
dc.title CorPipe 24 Multilingual CorefUD 1.2 Model (corpipe24-corefud1.2-240906)
dc.type toolService
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent true
metashare.ResourceInfo#ContentInfo.detailedType tool
dc.rights.label PUB
has.files yes
branding LINDAT / CLARIAH-CZ
contact.person Milan Straka straka@ufal.mff.cuni.cz Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
sponsor Grantová agentura České republiky GX20-16819X LUSyD – Language Understanding: from Syntax to Discourse nationalFunds
files.size 2059183527
files.count 1


 Soubory tohoto záznamu

Icon
Název
corpipe24-corefud1.2-240906.zip
Velikost
1.92 GB
Formát
application/zip
Popis
A multilingual coreference resolution model trained on CorefUD 1.2 based on `mT5-large` usable in CorPipe 24 <https://github.com/ufal/crac2024-corpipe>.
MD5
9525437e590b36187c0e5d095ffdfd69
 Stáhnout soubor  Náhled
 Náhled souboru  

Zobrazit minimální záznam