Number of results to display per page
Search Results
22. Retrograde Morphemic Dictionary of Czech - verbs
- Creator:
- Slavíčková, Eleonora, Hlaváčová, Jaroslava, and Pognan, Patrice
- Publisher:
- Academia
- Type:
- text, lexicon, and lexicalConceptualResource
- Subject:
- morphemes, morphology, prefix, and root
- Language:
- Czech
- Description:
- The file contains all Czech verbs included in the Retrograde Morphemic Dictionary of Czech Language (Slavíčková Eleonora, Academia 1975). The data was obtained by scanning a portion of the dictionary that contains words ending in -ci and -ti. Among them, there were 18 non-verbs, which were removed. Using OCR, the data was converted into the plain text format and the result was checked by two independent readers. However, if a user encounters a forgotten error, please report.
- Rights:
- Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB
23. Semantically annotated sample of Czech and English conversion pairs of verbs and nouns
- Creator:
- Hledíková, Hana
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text, wordList, and lexicalConceptualResource
- Subject:
- word-formation, morphology, conversion, semantics, and cognitive
- Language:
- English and Czech
- Description:
- Supplementary files for a comparative study of word-formation without the addition of derivational affixes (conversion) in English and Czech. The two .csv files contain 300 verb-noun conversion pairs in English and 300 verb-noun conversion pairs in Czech, i.e. pairs where either the noun is created from the verb or the verb is created from the noun without the use of derivational affixes. In English, the noun and verb in the conversion pair have the same form. In Czech, the noun and verb in the conversion pair differ in inflectional affixes. The pairs are supplied with manual semantic annotation based on cognitive event schemata. A file with the Appendix includes a list of dictionary definition phrases used as a basis for the semantic annotation.
- Rights:
- Creative Commons - Attribution 4.0 International (CC BY 4.0), http://creativecommons.org/licenses/by/4.0/, and PUB
24. Slovak Dependency Treebank
- Creator:
- Gajdošová, Katarína, Šimková, Mária, and et al.
- Publisher:
- Jazykovedný ústav Ľ. Štúra Slovenskej akadémie vied
- Type:
- text and corpus
- Subject:
- dependency, treebank, syntax, and morphology
- Language:
- Slovak
- Description:
- Slovak Dependency Treebank (Slovenský závislostný korpus) was created as part of the Slovak National Corpus at the Ľ. Štúr Institute of the Slovak Academy of Sciences. The annotation follows the guidelines of the Prague Dependency Treebank (Czech), slightly modified in the spirit of Slovak grammatical tradition. Morphological tags, lemmas and dependency relations have been assigned manually to every word. The present dataset is a subset of the original treebank. We automatically selected the sentences where the two human annotators 100% agreed on the analysis. This increases the quality and trustworthiness of the data but it also results in selecting short sentences most of the time. An extended version may be published in the future when manually merged and checked annotation is available. The selected sentences have been converted to the CoNLL-X file format (original token IDs are preserved in the FEATS column). This PDT-style annotation will serve as the source for the first Slovak dataset in the Universal Dependencies (to be published separately).
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
25. STYX
- Creator:
- Kučera, Ondřej
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- toolService
- Subject:
- education, morphology, and syntax
- Language:
- Czech
- Description:
- The STYX system is an electronic exercise book for practising Czech morphology and syntax consisting of more than 11, 000 sentences.
- Rights:
- GNU General Public Licence, version 3, http://opensource.org/licenses/GPL-3.0, and PUB
26. Universal Dependencies 1.0
- Creator:
- Nivre, Joakim, Bosco, Cristina, Choi, Jinho, de Marneffe, Marie-Catherine, Dozat, Timothy, Farkas, Richárd, Foster, Jennifer, Ginter, Filip, Goldberg, Yoav, Hajič, Jan, Kanerva, Jenna, Laippala, Veronika, Lenci, Alessandro, Lynn, Teresa, Manning, Christopher, McDonald, Ryan, Missilä, Anna, Montemagni, Simonetta, Petrov, Slav, Pyysalo, Sampo, Silveira, Natalia, Simi, Maria, Smith, Aaron, Tsarfaty, Reut, Vincze, Veronika, and Zeman, Daniel
- Publisher:
- Universal Dependencies Consortium
- Type:
- text and corpus
- Subject:
- treebank, dependency, syntax, morphology, harmonized annotation, interset, universal tagset, and stanford dependencies
- Language:
- Czech, German, English, Spanish, Finnish, French, Irish, Italian, Swedish, and Hungarian
- Description:
- Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008).
- Rights:
- Universal Dependencies 1.0 License Set, https://lindat.mff.cuni.cz/repository/xmlui/page/license-ud-1.0, and PUB
27. Universal Dependencies 1.1
- Creator:
- Agić, Željko, Aranzabe, Maria Jesus, Atutxa, Aitziber, Bosco, Cristina, Choi, Jinho, de Marneffe, Marie-Catherine, Dozat, Timothy, Farkas, Richárd, Foster, Jennifer, Ginter, Filip, Goenaga, Iakes, Gojenola, Koldo, Goldberg, Yoav, Hajič, Jan, Johannsen, Anders Trærup, Kanerva, Jenna, Kuokkala, Juha, Laippala, Veronika, Lenci, Alessandro, Lindén, Krister, Ljubešić, Nikola, Lynn, Teresa, Manning, Christopher, Martínez, Héctor Alonso, McDonald, Ryan, Missilä, Anna, Montemagni, Simonetta, Nivre, Joakim, Nurmi, Hanna, Osenova, Petya, Petrov, Slav, Piitulainen, Jussi, Plank, Barbara, Prokopidis, Prokopis, Pyysalo, Sampo, Seeker, Wolfgang, Seraji, Mojgan, Silveira, Natalia, Simi, Maria, Simov, Kiril, Smith, Aaron, Tsarfaty, Reut, Vincze, Veronika, and Zeman, Daniel
- Publisher:
- Universal Dependencies Consortium
- Type:
- text and corpus
- Subject:
- treebank, dependency syntax, morphology, harmonized annotation, interset, universal tagset, stanford dependencies, and universal dependencies
- Language:
- Basque, Bulgarian, Croatian, Czech, Danish, English, Finnish, French, German, Modern Greek (1453-), Hebrew, Hungarian, Indonesian, Irish, Italian, Persian, Spanish, and Swedish
- Description:
- Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008). This is the second release of UD Treebanks, Version 1.1.
- Rights:
- Licence Universal Dependencies v1.1, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-1.1, and PUB
28. Universal Dependencies 1.2
- Creator:
- Nivre, Joakim, Agić, Željko, Aranzabe, Maria Jesus, Asahara, Masayuki, Atutxa, Aitziber, Ballesteros, Miguel, Bauer, John, Bengoetxea, Kepa, Bhat, Riyaz Ahmad, Bosco, Cristina, Bowman, Sam, Celano, Giuseppe G. A., Connor, Miriam, de Marneffe, Marie-Catherine, Diaz de Ilarraza, Arantza, Dobrovoljc, Kaja, Dozat, Timothy, Erjavec, Tomaž, Farkas, Richárd, Foster, Jennifer, Galbraith, Daniel, Ginter, Filip, Goenaga, Iakes, Gojenola, Koldo, Goldberg, Yoav, Gonzales, Berta, Guillaume, Bruno, Hajič, Jan, Haug, Dag, Ion, Radu, Irimia, Elena, Johannsen, Anders, Kanayama, Hiroshi, Kanerva, Jenna, Krek, Simon, Laippala, Veronika, Lenci, Alessandro, Ljubešić, Nikola, Lynn, Teresa, Manning, Christopher, Mărănduc, Cătălina, Mareček, David, Martínez Alonso, Héctor, Mašek, Jan, Matsumoto, Yuji, McDonald, Ryan, Missilä, Anna, Mititelu, Verginica, Miyao, Yusuke, Montemagni, Simonetta, Mori, Shunsuke, Nurmi, Hanna, Osenova, Petya, Øvrelid, Lilja, Pascual, Elena, Passarotti, Marco, Perez, Cenel-Augusto, Petrov, Slav, Piitulainen, Jussi, Plank, Barbara, Popel, Martin, Prokopidis, Prokopis, Pyysalo, Sampo, Ramasamy, Loganathan, Rosa, Rudolf, Saleh, Shadi, Schuster, Sebastian, Seeker, Wolfgang, Seraji, Mojgan, Silveira, Natalia, Simi, Maria, Simionescu, Radu, Simkó, Katalin, Simov, Kiril, Smith, Aaron, Štěpánek, Jan, Suhr, Alane, Szántó, Zsolt, Tanaka, Takaaki, Tsarfaty, Reut, Uematsu, Sumire, Uria, Larraitz, Varga, Viktor, Vincze, Veronika, Žabokrtský, Zdeněk, Zeman, Daniel, and Zhu, Hanzhi
- Publisher:
- Universal Dependencies Consortium
- Type:
- text and corpus
- Subject:
- treebank, dependency, syntax, morphology, harmonized annotation, interset, universal tagset, and stanford dependencies
- Language:
- Ancient Greek (to 1453), Arabic, Basque, Bulgarian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Gothic, Modern Greek (1453-), Hebrew, Hindi, Hungarian, Indonesian, Irish, Italian, Japanese, Latin, Norwegian, Church Slavic, Persian, Polish, Portuguese, Romanian, Slovenian, Spanish, Swedish, and Tamil
- Description:
- Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008).
- Rights:
- Licence Universal Dependencies v1.2, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-1.2, and PUB
29. Universal Dependencies 1.3
- Creator:
- Nivre, Joakim, Agić, Željko, Ahrenberg, Lars, Aranzabe, Maria Jesus, Asahara, Masayuki, Atutxa, Aitziber, Ballesteros, Miguel, Bauer, John, Bengoetxea, Kepa, Berzak, Yevgeni, Bhat, Riyaz Ahmad, Bosco, Cristina, Bouma, Gosse, Bowman, Sam, Cebiroğlu Eryiğit, Gülşen, Celano, Giuseppe G. A., Çöltekin, Çağrı, Connor, Miriam, de Marneffe, Marie-Catherine, Diaz de Ilarraza, Arantza, Dobrovoljc, Kaja, Dozat, Timothy, Droganova, Kira, Erjavec, Tomaž, Farkas, Richárd, Foster, Jennifer, Galbraith, Daniel, Garza, Sebastian, Ginter, Filip, Goenaga, Iakes, Gojenola, Koldo, Gokirmak, Memduh, Goldberg, Yoav, Gómez Guinovart, Xavier, Gonzáles Saavedra, Berta, Grūzītis, Normunds, Guillaume, Bruno, Hajič, Jan, Haug, Dag, Hladká, Barbora, Ion, Radu, Irimia, Elena, Johannsen, Anders, Kaşıkara, Hüner, Kanayama, Hiroshi, Kanerva, Jenna, Katz, Boris, Kenney, Jessica, Krek, Simon, Laippala, Veronika, Lam, Lucia, Lenci, Alessandro, Ljubešić, Nikola, Lyashevskaya, Olga, Lynn, Teresa, Makazhanov, Aibek, Manning, Christopher, Mărănduc, Cătălina, Mareček, David, Martínez Alonso, Héctor, Mašek, Jan, Matsumoto, Yuji, McDonald, Ryan, Missilä, Anna, Mititelu, Verginica, Miyao, Yusuke, Montemagni, Simonetta, Mori, Keiko Sophie, Mori, Shunsuke, Muischnek, Kadri, Mustafina, Nina, Müürisep, Kaili, Nikolaev, Vitaly, Nurmi, Hanna, Osenova, Petya, Øvrelid, Lilja, Pascual, Elena, Passarotti, Marco, Perez, Cenel-Augusto, Petrov, Slav, Piitulainen, Jussi, Plank, Barbara, Popel, Martin, Pretkalniņa, Lauma, Prokopidis, Prokopis, Puolakainen, Tiina, Pyysalo, Sampo, Ramasamy, Loganathan, Rituma, Laura, Rosa, Rudolf, Saleh, Shadi, Saulīte, Baiba, Schuster, Sebastian, Seeker, Wolfgang, Seraji, Mojgan, Shakurova, Lena, Shen, Mo, Silveira, Natalia, Simi, Maria, Simionescu, Radu, Simkó, Katalin, Simov, Kiril, Smith, Aaron, Spadine, Carolyn, Suhr, Alane, Sulubacak, Umut, Szántó, Zsolt, Tanaka, Takaaki, Tsarfaty, Reut, Tyers, Francis, Uematsu, Sumire, Uria, Larraitz, van Noord, Gertjan, Varga, Viktor, Vincze, Veronika, Wang, Jing Xian, Washington, Jonathan North, Žabokrtský, Zdeněk, Zeman, Daniel, and Zhu, Hanzhi
- Publisher:
- Universal Dependencies Consortium
- Type:
- text and corpus
- Subject:
- treebank, dependency, syntax, morphology, harmonized annotation, interset, universal tagset, and stanford dependencies
- Language:
- Ancient Greek (to 1453), Arabic, Basque, Bulgarian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Gothic, Modern Greek (1453-), Hebrew, Hindi, Hungarian, Indonesian, Irish, Italian, Japanese, Latin, Norwegian, Church Slavic, Persian, Polish, Portuguese, Romanian, Slovenian, Spanish, Swedish, Tamil, Catalan, Chinese, Galician, Kazakh, Latvian, Russian, and Turkish
- Description:
- Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008).
- Rights:
- Licence Universal Dependencies v1.3, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-1.3, and PUB
30. Universal Dependencies 1.4
- Creator:
- Nivre, Joakim, Agić, Željko, Ahrenberg, Lars, Aranzabe, Maria Jesus, Asahara, Masayuki, Atutxa, Aitziber, Ballesteros, Miguel, Bauer, John, Bengoetxea, Kepa, Berzak, Yevgeni, Bhat, Riyaz Ahmad, Bick, Eckhard, Börstell, Carl, Bosco, Cristina, Bouma, Gosse, Bowman, Sam, Cebiroğlu Eryiğit, Gülşen, Celano, Giuseppe G. A., Chalub, Fabricio, Çöltekin, Çağrı, Connor, Miriam, Davidson, Elizabeth, de Marneffe, Marie-Catherine, Diaz de Ilarraza, Arantza, Dobrovoljc, Kaja, Dozat, Timothy, Droganova, Kira, Dwivedi, Puneet, Eli, Marhaba, Erjavec, Tomaž, Farkas, Richárd, Foster, Jennifer, Freitas, Claudia, Gajdošová, Katarína, Galbraith, Daniel, Garcia, Marcos, Gärdenfors, Moa, Garza, Sebastian, Ginter, Filip, Goenaga, Iakes, Gojenola, Koldo, Gökırmak, Memduh, Goldberg, Yoav, Gómez Guinovart, Xavier, Gonzáles Saavedra, Berta, Grioni, Matias, Grūzītis, Normunds, Guillaume, Bruno, Hajič, Jan, Hà Mỹ, Linh, Haug, Dag, Hladká, Barbora, Ion, Radu, Irimia, Elena, Johannsen, Anders, Jørgensen, Fredrik, Kaşıkara, Hüner, Kanayama, Hiroshi, Kanerva, Jenna, Katz, Boris, Kenney, Jessica, Kotsyba, Natalia, Krek, Simon, Laippala, Veronika, Lam, Lucia, Lê Hồng, Phương, Lenci, Alessandro, Ljubešić, Nikola, Lyashevskaya, Olga, Lynn, Teresa, Makazhanov, Aibek, Manning, Christopher, Mărănduc, Cătălina, Mareček, David, Martínez Alonso, Héctor, Martins, André, Mašek, Jan, Matsumoto, Yuji, McDonald, Ryan, Missilä, Anna, Mititelu, Verginica, Miyao, Yusuke, Montemagni, Simonetta, Mori, Keiko Sophie, Mori, Shunsuke, Moskalevskyi, Bohdan, Muischnek, Kadri, Mustafina, Nina, Müürisep, Kaili, Nguyễn Thị, Lương, Nguyễn Thị Minh, Huyền, Nikolaev, Vitaly, Nurmi, Hanna, Osenova, Petya, Östling, Robert, Øvrelid, Lilja, Paiva, Valeria, Pascual, Elena, Passarotti, Marco, Perez, Cenel-Augusto, Petrov, Slav, Piitulainen, Jussi, Plank, Barbara, Popel, Martin, Pretkalniņa, Lauma, Prokopidis, Prokopis, Puolakainen, Tiina, Pyysalo, Sampo, Rademaker, Alexandre, Ramasamy, Loganathan, Real, Livy, Rituma, Laura, Rosa, Rudolf, Saleh, Shadi, Saulīte, Baiba, Schuster, Sebastian, Seeker, Wolfgang, Seraji, Mojgan, Shakurova, Lena, Shen, Mo, Silveira, Natalia, Simi, Maria, Simionescu, Radu, Simkó, Katalin, Šimková, Mária, Simov, Kiril, Smith, Aaron, Spadine, Carolyn, Suhr, Alane, Sulubacak, Umut, Szántó, Zsolt, Tanaka, Takaaki, Tsarfaty, Reut, Tyers, Francis, Uematsu, Sumire, Uria, Larraitz, van Noord, Gertjan, Varga, Viktor, Vincze, Veronika, Wallin, Lars, Wang, Jing Xian, Washington, Jonathan North, Wirén, Mats, Žabokrtský, Zdeněk, Zeldes, Amir, Zeman, Daniel, and Zhu, Hanzhi
- Publisher:
- Universal Dependencies Consortium
- Type:
- text and corpus
- Subject:
- treebank, dependency, syntax, morphology, harmonized annotation, interset, universal tagset, and stanford dependencies
- Language:
- Ancient Greek (to 1453), Arabic, Basque, Bulgarian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Gothic, Modern Greek (1453-), Hebrew, Hindi, Hungarian, Indonesian, Irish, Italian, Japanese, Latin, Norwegian, Church Slavic, Persian, Polish, Portuguese, Romanian, Slovenian, Spanish, Swedish, Tamil, Catalan, Chinese, Galician, Kazakh, Latvian, Russian, Turkish, Coptic, Sanskrit, Slovak, Swedish Sign Language, Ukrainian, Uighur, and Vietnamese
- Description:
- Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008).
- Rights:
- Licence Universal Dependencies v1.4, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-1.4, and PUB