dc.contributor.author | Variš, Dušan |
dc.date.accessioned | 2022-03-17T16:24:43Z |
dc.date.available | 2022-03-17T16:24:43Z |
dc.date.issued | 2022-03-15 |
dc.identifier.uri | http://hdl.handle.net/11234/1-4681 |
dc.description | En-Ru translation models, exported via TensorFlow Serving, available in the Lindat translation service (https://lindat.mff.cuni.cz/services/translation/). The models were trained using the MCSQ social surveys dataset (available at https://repo.clarino.uib.no/xmlui/bitstream/handle/11509/142/mcsq_v3.zip). Their main use should be in-domain translation of social surveys. Models are compatible with Tensor2tensor version 1.6.6. For details about the model training (data, model hyper-parameters), please contact the archive maintainer. Evaluation on MCSQ test set (BLEU): en->ru: 64.3 (train: genuine in-domain MCSQ data) ru->en: 74.7 (train: additional backtranslated in-domain MCSQ data) (Evaluated using multeval: https://github.com/jhclark/multeval) |
dc.language.iso | eng |
dc.language.iso | rus |
dc.publisher | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
dc.relation | info:eu-repo/grantAgreement/EC/H2020/823782 |
dc.rights | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ |
dc.subject | machine translation |
dc.subject | neural machine translation |
dc.subject | transformer |
dc.title | MCSQ Translation Models (en-ru) (v1.0) |
dc.type | toolService |
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent | true |
metashare.ResourceInfo#ContentInfo.detailedType | tool |
dc.rights.label | PUB |
has.files | yes |
branding | LINDAT / CLARIAH-CZ |
contact.person | Dušan Variš varis@ufal.mff.cuni.cz Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
sponsor | European Union EC/H2020/823782 SSHOC - Social Sciences & Humanities Open Cloud euFunds info:eu-repo/grantAgreement/EC/H2020/823782 |
files.size | 1385582705 |
files.count | 2 |
Soubory tohoto záznamu
Licenční kategorie:
Licence: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
Licence: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
- Název
- mcsq.en-ru.zip
- Velikost
- 660.12 MB
- Formát
- application/zip
- Popis
- English-to-Russian
- MD5
- 221c01740843f327162953932678135a
- mcsq.en-ru
- vocab.enru.32768156 kB
- export
- Servo
- 1647265123
- saved_model.pbtxt7 MB
- variables
- variables.index10 kB
- variables.data-00000-of-00001711 MB
- 1647265123
- Servo
- Název
- mcsq.ru-en.zip
- Velikost
- 661.27 MB
- Formát
- application/zip
- Popis
- Russian-to-English
- MD5
- 5bcec1e0a11e6b797d559984722b2557
- mcsq.ru-en
- vocab.enru.32768213 kB
- export
- Servo
- 1647265298
- saved_model.pbtxt7 MB
- variables
- variables.index10 kB
- variables.data-00000-of-00001713 MB
- 1647265298
- Servo