Files in this item
This item is
Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
- Name
- cs2en_model.tgz
- Size
- 36.56 GB
- Format
- application/x-gzip
- Description
- Contains Lexical Models (lex.e2f and lex.f2e), Phrase Table (phrase-table.gz), word based reordering model (reordering-table.wbe-msd-bidirectional-fe.gz), hierarchical reordering model (reordering-table.hier-msd-bidirectional-fe.gz) and the moses.ini file
- MD5
- 9f97c40bab9bbc8844b362437ead3c71
- Name
- wmt16.czeng.blm.en.tgz
- Size
- 7.79 GB
- Format
- application/x-gzip
- Description
- kenlm 5-gram language model (binarized) trained only on the English side of CzEng parallel data used
- MD5
- dd910814d89f3bb41261ead0a95930dc
- Name
- wmt16.mono.blm.en.tgz
- Size
- 60.04 GB
- Format
- application/x-gzip
- Description
- kenlm 5-gram language model (binarized) trained on all English mono data available for WMT except Common Crawl (see the Makefile for the details of mono data used)
- MD5
- a57e4fd4f43c05f826cda33cfe257eed
- Name
- Makefile
- Size
- 16.96 KB
- Format
- Unknown
- Description
- You can recreate the models using this Makefile
- MD5
- 5f56434491ccb9591c35d8fe20fb8aa9
- Name
- moses.ini
- Size
- 1.29 KB
- Format
- Unknown
- Description
- The moses.ini file for tuning
- MD5
- 52551ca476c84dbaf9409ca3083b02c2