dc.contributor.author | Straka, Milan |
dc.date.accessioned | 2024-10-07T15:30:49Z |
dc.date.available | 2024-10-07T15:30:49Z |
dc.date.issued | 2024-09-06 |
dc.identifier.uri | http://hdl.handle.net/11234/1-5672 |
dc.description | The `corpipe24-corefud1.2-240906` is a `mT5-large`-based multilingual model for coreference resolution usable in CorPipe 24 (https://github.com/ufal/crac2024-corpipe). It is released under the CC BY-NC-SA 4.0 license. The model is language agnostic (no corpus id on input), so it can be in theory used to predict coreference in any `mT5` language. This model jointly predicts also the empty nodes needed for zero coreference. The paper introducing this model also presents an alternative two-stage approach first predicting empty nodes (via https://www.kaggle.com/models/ufal-mff/crac2024_zero_nodes_baseline/) and then performing coreference resolution (via http://hdl.handle.net/11234/1-5673), which is circa twice as slow but slightly better. |
dc.language.iso | cat |
dc.language.iso | ces |
dc.language.iso | deu |
dc.language.iso | eng |
dc.language.iso | spa |
dc.language.iso | fra |
dc.language.iso | hun |
dc.language.iso | lit |
dc.language.iso | nob |
dc.language.iso | nno |
dc.language.iso | pol |
dc.language.iso | rus |
dc.language.iso | tur |
dc.language.iso | chu |
dc.language.iso | grc |
dc.language.iso | hbo |
dc.publisher | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
dc.relation.isreferencedby | https://arxiv.org/abs/2410.02756 |
dc.relation.replaces | http://hdl.handle.net/11234/1-5369 |
dc.rights | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ |
dc.source.uri | https://github.com/ufal/crac2024-corpipe |
dc.subject | coreference resolution |
dc.subject | CorPipe |
dc.subject | CorefUD |
dc.title | CorPipe 24 Multilingual CorefUD 1.2 Model (corpipe24-corefud1.2-240906) |
dc.type | toolService |
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent | true |
metashare.ResourceInfo#ContentInfo.detailedType | tool |
dc.rights.label | PUB |
has.files | yes |
branding | LINDAT / CLARIAH-CZ |
contact.person | Milan Straka straka@ufal.mff.cuni.cz Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
sponsor | Grantová agentura České republiky GX20-16819X LUSyD – Language Understanding: from Syntax to Discourse nationalFunds |
files.size | 2059183527 |
files.count | 1 |
Files in this item
This item is
Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
- Name
- corpipe24-corefud1.2-240906.zip
- Size
- 1.92 GB
- Format
- application/zip
- Description
- A multilingual coreference resolution model trained on CorefUD 1.2 based on `mT5-large` usable in CorPipe 24 <https://github.com/ufal/crac2024-corpipe>.
- MD5
- 9525437e590b36187c0e5d095ffdfd69
- corpipe24-corefud1.2-240906
- zdeprels.txt250 B
- LICENSE20 kB
- README.md4 kB
- options.json2 kB
- model.h52 GB
- tags.txt1 kB