dc.contributor.author | Straka, Milan |
dc.contributor.author | Straková, Jana |
dc.date.accessioned | 2022-07-24T08:51:49Z |
dc.date.available | 2022-07-24T08:51:49Z |
dc.date.issued | 2022-07-10 |
dc.identifier.uri | http://hdl.handle.net/11234/1-4794 |
dc.description | Czech models for MorphoDiTa, providing morphological analysis, morphological generation and part-of-speech tagging. The morphological dictionary is created from MorfFlex CZ 2.0, DeriNet 2.1 and the PoS tagger is trained on Prague Dependency Treebank - Consolidated 1.0. |
dc.description.sponsorship | This work has been using language resources developed and/or stored and/or distributed by the LINDAT/CLARIN project of the Ministry of Education of the Czech Republic (project LM2010013). The Czech morphologic system was devised by Jan Hajič. The MorfFlex CZ dictionary was created by Jan Hajič and Jaroslava Hlaváčová. The morphologic guesser research was supported by the projects 1ET101120503 and 1ET101120413 of Academy of Sciences of the Czech Republic and 100008/2008 of Charles University Grant Agency. The research was performed by Jan Hajič, Jaroslava Hlaváčová and David Kolovratník. The tagger algorithm and feature set research was supported by the projects MSM0021620838 and LC536 of Ministry of Education, Youth and Sports of the Czech Republic, GA405/09/0278 of the Grant Agency of the Czech Republic and 1ET101120503 of Academy of Sciences of the Czech Republic. The research was performed by Drahomíra "johanka" Spoustová, Jan Hajič, Jan Raab and Miroslav Spousta. The tagger is trained on morphological layer of Prague Dependency Treebank PDT 2.5, which was supported by the projects LM2010013, LC536, LN00A063 and MSM0021620838 of Ministry of Education, Youth and Sports of the Czech Republic, and developed by Martin Buben, Jan Hajič, Jiří Hana, Hana Hanová, Barbora Hladká, Emil Jeřábek, Lenka Kebortová, Kristýna Kupková, Pavel Květoň, Jiří Mírovský, Andrea Pfimpfrová, Jan Štěpánek and Daniel Zeman. |
dc.language.iso | ces |
dc.publisher | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
dc.relation.replaces | http://hdl.handle.net/11234/1-1836 |
dc.rights | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ |
dc.source.uri | http://ufal.mff.cuni.cz/morphodita/users-manual#czech-morfflex2-pdtc |
dc.subject | MorphoDiTa |
dc.subject | Czech |
dc.subject | morphological analysis |
dc.subject | morphological generation |
dc.subject | PoS tagging |
dc.title | Czech Models (MorfFlex CZ 2.0 + PDT-C 1.0) for MorphoDiTa 220710 |
dc.type | languageDescription |
metashare.ResourceInfo#ContactInfo#PersonInfo.surname | Straka |
metashare.ResourceInfo#ContactInfo#PersonInfo.givenName | Milan |
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo.organizationName | Charles University in Prague, UFAL |
metashare.ResourceInfo#DistributionInfo.availability | unrestrictedUse |
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse | academic-nonCommercialUse |
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse | attribution |
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse | shareAlike |
metashare.ResourceInfo#ContentInfo.mediaType | text |
metashare.ResourceInfo#TextInfo#SizeInfo.size | 68 |
metashare.ResourceInfo#TextInfo#SizeInfo.sizeUnit | mb |
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo#CommunicationInfo.email | straka@ufal.mff.cuni.cz |
metashare.ResourceInfo#ContentInfo.detailedType | mlmodel |
dc.rights.label | PUB |
has.files | yes |
branding | LINDAT / CLARIAH-CZ |
demo.uri | http://lindat.mff.cuni.cz/services/morphodita/ |
contact.person | Milan Straka straka@ufal.mff.cuni.cz Charles University in Prague, UFAL |
sponsor | Ministerstvo školství, mládeže a tělovýchovy České republiky LM2018101 LINDAT/CLARIAH-CZ: Digitální výzkumná infrastruktura pro jazykové technologie, umění a humanitní vědy nationalFunds |
size.info | 95 mb |
files.size | 98838696 |
files.count | 1 |
Soubory tohoto záznamu
Licenční kategorie:
Licence: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
Licence: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
- Název
- czech-morfflex2.0-pdtc1.0-220710.zip
- Velikost
- 94.26 MB
- Formát
- application/zip
- Popis
- Czech Models (MorfFlex CZ 2.0 + PDT-C 1.0) for MorphoDiTa 220710
- MD5
- 819fb1d6a5a827bee8f3a9384aa3273a
- czech-morfflex2.0-pdtc1.0-220710
- czech-morfflex2.0-pdtc1.0-220710-no_dia-pos_only.tagger14 MB
- README.html9 kB
- czech-morfflex2.0-pdtc1.0-220710-pos_only.tagger9 MB
- README6 kB
- czech-morfflex2.0-220710-no_dia.dict4 MB
- czech-morfflex2.0-pdtc1.0-220710.tagger24 MB
- czech-morfflex2.0-220710.dict3 MB
- czech-morfflex2.0-220710-no_dia-pos_only.dict4 MB
- czech-morfflex2.0-pdtc1.0-220710-no_dia.tagger30 MB
- czech-morfflex2.0-220710-pos_only.dict3 MB
- LICENSE20 kB