Skip to search
Skip to main content
Skip to first result
Search
Search Results
Creator:
Gurevych, Iryna , Habernal, Ivan , and Zayed, Omnia
Publisher:
Technische Universität Darmstadt
Type:
text and corpus
Subject:
CommonCrawl , Creative Commons , Web corpus , and Amazon Web Services
Language:
Afrikaans , Arabic , Bengali , Bulgarian , Czech , Danish , German , Modern Greek (1453-) , English , Estonian , Persian , Finnish , French , Gujarati , Hebrew , Hindi , Croatian , Hungarian , Indonesian , Italian , Japanese , Korean , Latvian , Lithuanian , Malayalam , Marathi , Macedonian , Nepali (macrolanguage) , Dutch , Norwegian , Polish , Portuguese , Romanian , Russian , Slovak , Slovenian , Somali , Spanish , Albanian , Swahili (macrolanguage) , Swedish , Tamil , Telugu , Tagalog , Thai , Turkish , Ukrainian , Undetermined , Urdu , Vietnamese , and Chinese
Description:
A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
Rights:
Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) , http://creativecommons.org/licenses/by-nc-sa/4.0/ , and PUB
Creator:
Stanislav Brouček
Publisher:
Ústav pro etnografii a folkloristiku ČSAV
Format:
print and 274 s. : foto. příl., mp.
Type:
model:monograph and TEXT
Subject:
Etnologie. Etnografie. Folklor , 19. století , dějiny , etnografie , folkloristika , češství , Česko , 39 , 94(437.3) , 398 , 316.344.8(=162.3) , (048.8) , and 1
Language:
Czech , German , and Russian
Description:
Stanislav Brouček., Ruské a německé resumé, and Společný český a německý název
Rights:
http://creativecommons.org/licenses/by-nc-sa/4.0/ and policy:public
Format:
print
Type:
model:internalpart and TEXT
Language:
Czech , German , French , and Russian
Rights:
http://creativecommons.org/licenses/by-nc-sa/4.0/ and policy:public
Format:
print
Type:
model:internalpart and TEXT
Language:
Czech , German , French , and Russian
Rights:
http://creativecommons.org/licenses/by-nc-sa/4.0/ and policy:public
Format:
print
Type:
model:internalpart and TEXT
Language:
Czech , German , French , and Russian
Rights:
http://creativecommons.org/licenses/by-nc-sa/4.0/ and policy:public
Format:
print
Type:
model:internalpart and TEXT
Language:
Czech , German , and Russian
Rights:
http://creativecommons.org/licenses/by-nc-sa/4.0/ and policy:public
Format:
print
Type:
model:internalpart and TEXT
Language:
Czech , German , French , and Russian
Rights:
http://creativecommons.org/licenses/by-nc-sa/4.0/ and policy:public
Creator:
Straka, Milan
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
tool and toolService
Subject:
coreference resolution , CorPipe , and CorefUD
Language:
Catalan , Czech , German , English , Spanish , French , Hungarian , Lithuanian , Norwegian Bokmål , Norwegian Nynorsk , Polish , Russian , and Turkish
Description:
The `corpipe23-corefud1.1-231206` is a `mT5-large`-based multilingual model for coreference resolution usable in CorPipe 23 (https://github.com/ufal/crac2023-corpipe). It is released under the CC BY-NC-SA 4.0 license.
The model is language agnostic (no _corpus id_ on input), so it can be used to predict coreference in any `mT5` language (for zero-shot evaluation, see the paper). However, note that the empty nodes must be present already on input, they are not predicted (the same settings as in the CRAC23 shared task).
Rights:
Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) , http://creativecommons.org/licenses/by-nc-sa/4.0/ , and PUB
Format:
print
Type:
model:internalpart and TEXT
Language:
Czech , Russian , and English
Rights:
http://creativecommons.org/licenses/by-nc-sa/4.0/ and policy:public
Format:
print
Type:
model:internalpart and TEXT
Language:
Czech , Russian , and English
Rights:
http://creativecommons.org/licenses/by-nc-sa/4.0/ and policy:public