Files in this item
This item is
Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
- Name
- en2cs_model.tgz
- Size
- 37.56 GB
- Format
- application/x-gzip
- Description
- Contains Lexical Models (lex.e2f and lex.f2e), Phrase Table (phrase-table.gz), word based reordering model (reordering-table.wbe-msd-bidirectional-fe.gz), hierarchical reordering model (reordering-table.hier-msd-bidirectional-fe.gz) and the moses.ini file
- MD5
- bed88bbeef3afc454c3f02845ab72769
- Name
- wmt16.czeng.blm.cs.tgz
- Size
- 9.28 GB
- Format
- application/x-gzip
- Description
- kenlm 5-gram language model (binarized) trained only on the Czech side of CzEng parallel data used
- MD5
- de338fd4ba04b82631aab9488c468cd6
- Name
- wmt16.mono.blm.cs.tgz
- Size
- 18.98 GB
- Format
- application/x-gzip
- Description
- kenlm 5-gram language model (binarized) trained on all Czech mono data available for WMT except Common Crawl (see the Makefile for the details of mono data used)
- MD5
- 6347aa8e420db60142cb1384ea1cab0d
- Name
- Makefile
- Size
- 16.96 KB
- Format
- Unknown
- Description
- You can recreate the models using this Makefile
- MD5
- 5f56434491ccb9591c35d8fe20fb8aa9
- Name
- moses.ini
- Size
- 1.29 KB
- Format
- Unknown
- Description
- The moses.ini file for tuning
- MD5
- 8c60e67f303419ad03fee1fe00aef8cf