dc.contributor.author | Macková, Kateřina |
dc.contributor.author | Straka, Milan |
dc.date.accessioned | 2020-08-03T11:07:20Z |
dc.date.available | 2020-08-03T11:07:20Z |
dc.date.issued | 2020-07-01 |
dc.identifier.uri | http://hdl.handle.net/11234/1-3249 |
dc.description | The Czech translation of SQuAD 2.0 and SQuAD 1.1 datasets contains automatically translated texts, questions and answers from the training set and the development set of the respective datasets. The test set is missing, because it is not publicly available. The data is released under the CC BY-NC-SA 4.0 license. If you use the dataset, please cite the following paper (the exact format was not available during the submission of the dataset): Kateřina Macková and Straka Milan: Reading Comprehension in Czech via Machine Translation and Cross-lingual Transfer, presented at TSD 2020, Brno, Czech Republic, September 8-11 2020. |
dc.language.iso | ces |
dc.publisher | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
dc.relation.isreferencedby | https://arxiv.org/abs/2007.01667 |
dc.rights | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ |
dc.subject | SQuAD |
dc.subject | reading comprehension |
dc.title | Czech Translation of SQuAD 2.0 and 1.1 |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
dc.rights.label | PUB |
has.files | yes |
branding | LINDAT / CLARIAH-CZ |
contact.person | Milan Straka straka@ufal.mff.cuni.cz Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
sponsor | Grantová agentura České republiky GX20-16819X LUSyD – Language Understanding: from Syntax to Discourse nationalFunds |
sponsor | Ministerstvo školství, mládeže a tělovýchovy České republiky LM2018101 LINDAT/CLARIAH-CZ: Digitální výzkumná infrastruktura pro jazykové technologie, umění a humanitní vědy nationalFunds |
sponsor | Univerzita Karlova (mimo GAUK) SVV 260 575 Specifický vysokoškolský výzkum nationalFunds |
size.info | 117933 items |
files.size | 20509180 |
files.count | 1 |
Soubory tohoto záznamu
Licenční kategorie:
Licence: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
Licence: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
- Název
- czech-squad.zip
- Velikost
- 19.56 MB
- Formát
- application/zip
- Popis
- Czech Translation of SQuAD 2.0 and 1.1
- MD5
- 32d7b5ae6daf4856a6c3924ba0610e05
- squad-1.1-cs
- train-v1.1.json27 MB
- dev-v1.1.json4 MB
- squad-2.0-cs
- train-v2.0.json37 MB
- dev-v2.0.json4 MB
- LICENSE20 kB
- README.md1 kB