Skip to search
Skip to main content
Skip to first result
Search
Search Results
Publisher:
Institute of Mathematics and Computer Science, University of Latvia
Type:
toolService
Subject:
morphological analyzer
Language:
Latvian
Description:
A simplified front-end (in a form of a RESTful web service) of the SemTi-Kamols morphological analyzer. Mainly for demonstration purposes.
Rights:
Not specified
Publisher:
Tilde
Type:
toolService
Language:
Latvian
Description:
Web service
Rights:
Not specified
Publisher:
Tilde
Type:
toolService
Language:
Latvian
Description:
Morphologycal analyser and form generation tool.
Rights:
Not specified
Publisher:
Institute of Mathematics and Computer Science, University of Latvia
Type:
toolService
Language:
Latvian
Description:
HMM-based tagger of Latvian texts. The tagger uses information from SemTi-Kamols morphological analyser, the tagset is derived from MULTEXT-East project.
Rights:
Not specified
Publisher:
Institute of Mathematics and Computer Science, University of Latvia
Type:
toolService
Language:
Latvian
Description:
Latvian Text-to-Speech Synthesizer: a RESTful web service.
Rights:
Not specified
Publisher:
Institute of Mathematics and Computer Science, University of Latvia
Type:
toolService
Language:
Latvian
Description:
A standards compliant RESTful web service, based on the lexicon of the Dictionary of the Standard Latvian Language. The morphological database contains 57 613 lemmas (1 332 889 word forms).
Rights:
Not specified
Creator:
Paikens, Pēteris , Borodkins, Imants , and Poikāns, Ilmārs
Publisher:
Institute of Mathematics and Computer Science, University of Latvia
Type:
toolService
Language:
Latvian
Description:
Semi-automatic corpus annotation tool for Latvian. Incorporates the SemTi-Kamols morphological analyzer and dependency chunker.
Rights:
Not specified
Type:
toolService
Subject:
morphological analyzer
Language:
Latvian
Description:
A Java library for morphological analysis of Latvian. The lexicon covers ~50 000 lemmas. A set of robust derivation rules is also used.
Rights:
Not specified
Creator:
Kondratyuk, Dan and Straka, Milan
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
tool and toolService
Subject:
syntax , dependency parser , and universal dependencies
Language:
Ancient Greek (to 1453) , Arabic , Basque , Bulgarian , Croatian , Czech , Danish , Dutch , English , Estonian , Finnish , French , German , Gothic , Modern Greek (1453-) , Hebrew , Hindi , Hungarian , Indonesian , Irish , Italian , Japanese , Latin , Norwegian , Church Slavic , Persian , Polish , Portuguese , Romanian , Slovenian , Spanish , Swedish , Tamil , Catalan , Chinese , Galician , Kazakh , Latvian , Russian , Turkish , Coptic , Sanskrit , Slovak , Ukrainian , Uighur , Vietnamese , Belarusian , Korean , Lithuanian , Urdu , Russia Buriat , Northern Kurdish , Northern Sami , Upper Sorbian , Afrikaans , Yue Chinese , Marathi , Serbian , Swedish Sign Language , Telugu , Amharic , Armenian , Breton , Faroese , Komi-Zyrian , Nigerian Pidgin , Old French (842-ca. 1400) , Tagalog , Thai , Warlpiri , Yoruba , Akkadian , Bambara , Erzya , and Maltese
Description:
Pretrained model weights for the UDify model, and extracted BERT weights in pytorch-transformers format. Note that these weights slightly differ from those used in the paper.
Rights:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) , http://creativecommons.org/licenses/by-sa/4.0/ , and PUB
Creator:
Straka, Milan
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
tool and toolService
Subject:
tokenizer , POS tagger , lemmatization , tagger , parser , and dependency parser
Language:
Afrikaans , Arabic , Belarusian , Bulgarian , Catalan , Czech , Church Slavic , Coptic , Welsh , Danish , German , Modern Greek (1453-) , English , Estonian , Basque , Faroese , Persian , Finnish , French , Old French (842-ca. 1400) , Scottish Gaelic , Irish , Galician , Gothic , Ancient Greek (to 1453) , Ancient Hebrew , Hebrew , Hindi , Croatian , Hungarian , Armenian , Western Armenian , Indonesian , Icelandic , Italian , Japanese , Korean , Latin , Latvian , Lithuanian , Literary Chinese , Marathi , Maltese , Dutch , Norwegian Nynorsk , Norwegian Bokmål , Old Russian , Nigerian Pidgin , Polish , Portuguese , Romanian , Russian , Slovak , Slovenian , Northern Sami , Spanish , Serbian , Swedish , Tamil , Telugu , Turkish , Uighur , Ukrainian , Urdu , Vietnamese , Gambian Wolof , Wolof , and Chinese
Description:
Tokenizer, POS Tagger, Lemmatizer and Parser models for 123 treebanks of 69 languages of Universal Depenencies 2.10 Treebanks, created solely using UD 2.10 data (https://hdl.handle.net/11234/1-4758). The model documentation including performance can be found at https://ufal.mff.cuni.cz/udpipe/2/models#universal_dependencies_210_models .
To use these models, you need UDPipe version 2.0, which you can download from https://ufal.mff.cuni.cz/udpipe/2 .
Rights:
Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) , http://creativecommons.org/licenses/by-nc-sa/4.0/ , and PUB