Skip to search
Skip to main content
Skip to first result
Search
Search Results
Creator:
Kyjánek, Lukáš , Žabokrtský, Zdeněk , Vidra, Jonáš , and Ševčíková, Magda
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
text , lexicon , and lexicalConceptualResource
Subject:
universal derivations , uder , word-formation , derivation , derivational morphology , lexical network , and harmonization
Language:
Czech , English , Estonian , Finnish , German , French , Latin , Persian , Polish , Portuguese , Spanish , Catalan , Turkish , Scottish Gaelic , Russian , Swedish , Serbo-Croatian , Italian , Dutch , and Croatian
Description:
Universal Derivations (UDer) is a collection of harmonized lexical networks capturing word-formation, especially derivational relations, in a cross-linguistically consistent annotation scheme for many languages. The annotation scheme is based on a rooted tree data structure, in which nodes correspond to lexemes, while edges represent derivational relations or compounding. The current version of the UDer collection contains twenty-seven harmonized resources covering twenty different languages.
Rights:
Universal Derivations v1.0 License Agreement , https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UDer-1.0 , and PUB
Creator:
Kyjánek, Lukáš , Žabokrtský, Zdeněk , Vidra, Jonáš , and Ševčíková, Magda
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
lexicon , text , and lexicalConceptualResource
Subject:
universal derivations , uder , word-formation , derivation , derivational morphology , lexical network , and harmonization
Language:
Czech , English , Estonian , Finnish , German , French , Latin , Persian , Polish , Portuguese , Spanish , Catalan , Turkish , Scottish Gaelic , Russian , Swedish , Serbo-Croatian , Italian , Dutch , Croatian , and Slovenian
Description:
Universal Derivations (UDer) is a collection of harmonized lexical networks capturing word-formation, especially derivational relations, in a cross-linguistically consistent annotation scheme for many languages. The annotation scheme is based on a rooted tree data structure, in which nodes correspond to lexemes, while edges represent derivational relations or compounding. The current version of the UDer collection contains thirty-one harmonized resources covering twenty-one different languages.
Rights:
Universal Derivations v1.1 License Agreement , PUB , and https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UDer-1.1
Creator:
Žabokrtský, Zdeněk , Bafna, Nyati , Bodnár, Jan , Kyjánek, Lukáš , Svoboda, Emil , Ševčíková, Magda , Vidra, Jonáš , Angle, Sachi , Ansari, Ebrahim , Arkhangelskiy, Timofey , Batsuren, Khuyagbaatar , Bella, Gábor , Bertinetto, Pier Marco , Bonami, Olivier , Celata, Chiara , Daniel, Michael , Fedorenko, Alexei , Filko, Matea , Giunchiglia, Fausto , Haghdoost, Hamid , Hathout, Nabil , Khomchenkova, Irina , Khurshudyan, Victoria , Levonian, Dmitri , Litta, Eleonora , Medvedeva, Maria , Muralikrishna, S. N. , Namer, Fiammetta , Nikravesh, Mahshid , Padó, Sebastian , Passarotti, Marco , Plungian, Vladimir , Polyakov, Alexey , Potapov, Mihail , Pruthwik, Mishra , Rao B, Ashwath , Rubakov, Sergei , Samar, Husain , Sharma, Dipti Misra , Šnajder, Jan , Šojat, Krešimir , Štefanec, Vanja , Talamo, Luigi , Tribout, Delphine , Vodolazsky, Daniil , Vydrin, Arseniy , Zakirova, Aigul , and Zeller, Britta
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
text , lexicon , and lexicalConceptualResource
Subject:
universal segmentations , morphological segmentation , word segmentation , segmentation , morphology , morphemes , morphological dictionary , unisegments , morph , and multilingual
Language:
Czech , Catalan , German , English , Persian , Finnish , French , Serbo-Croatian , Croatian , Hungarian , Italian , Komi-Zyrian , Latin , Moksha , Mari (Russia) , Mongolian , Erzya , Polish , Portuguese , Russian , Spanish , Swedish , Tajik , Udmurt , Armenian , Bengali , Hindi , Malayalam , Marathi , and Kannada
Description:
Universal Segmentations (UniSegments) is a collection of lexical resources capturing morphological segmentations harmonised into a cross-linguistically consistent annotation scheme for many languages. The annotation scheme consists of simple tab-separated columns that stores a word and its morphological segmentations, including pieces of information about the word and the segmented units, e.g., part-of-speech categories, type of morphs/morphemes etc. The current public version of the collection contains 38 harmonised segmentation datasets covering 30 different languages.
Rights:
Universal Segmentations 1.0 License Terms , https://lindat.mff.cuni.cz/repository/xmlui/page/licence-unisegs-1.0 , and PUB
Creator:
Jan Patočka
Publisher:
Str. 204–237.
Type:
Text
Subject:
1975 , 1977/34 , 1979/25 , 1981/6 , 1981/7 , 1988/28 , 1988/31 , 1988/32 , 1988/34 , 1994/7 , 1996/4 , 1996/7 , 1998/3 , 1999/8 , 2 , 2001/9 , 2002/21 , 2006/1 , 2007/1 , 2008/3 , be , bg , cs , de , en , es , fr , fulltext , hu , it , lt , no , pl , ru , sr , SS-3/PD-III , sv , uk , and v
Language:
Czech , English , Bulgarian , French , Italian , Lithuanian , Hungarian , German , Norwegian , Polish , Russian , Belarusian , Serbian , Spanish , Swedish , and Ukrainian
Rights:
open access and Rights holder: Archiv Jana Patočky, z.s.
Creator:
Gustafsson, Lars,
Type:
text and monografie
Subject:
Dějiny skandinávských zemí , publicistika , Švédsko , právní vědy, politologie, právníci, politologové , světové dějiny 1492-1648 , and české země 1620-1740
Language:
Swedish
Rights:
unknown
Creator:
Macek, Jiří,
Type:
text and biografie
Subject:
Hudebníci, skladatelé a jiná hudební povolání , Kaprálová, Vítězslava, , skladatelé , hudba , Československo 1918-1945 , hudba, tanec, hudební nástroje , Francie , and světové dějiny 1918-1945
Language:
Swedish
Description:
Obsahuje skladatelovu tvorbu
Rights:
unknown
Creator:
Jan Patočka
Publisher:
Praha (samizdat) 1975, 19 s., Edice Kvart. Stať.
Type:
Text
Subject:
1975 , 1976/8 , 1979/25 , 1981/7 , 1988/19 , 1988/28 , 1988/31 , 1988/32 , 1990/8 , 1994/7 , 1996/4 , 1996/7 , 1996/8 , 1998/3 , 1999/8 , 2000/33 , 2002/1 , 2002/21 , 2006/1 , 2007/1 , 2007/10 , 2008/3 , 299/300 , 5/1981 , AS/PD-6 , bg , cs , de , en , es , fr , fulltext , hu , it , kt , lt , no , pl , ru , SS-3/PD-III , and sv
Language:
Czech , English , Bulgarian , French , Italian , Lithuanian , Hungarian , German , Norwegian , Polish , Russian , Spanish , and Swedish
Rights:
open access and Rights holder: Archiv Jana Patočky, z.s.
Type:
text and slovníky
Subject:
Historická věda. Pomocné vědy historické. Archivnictví , Lingvistika. Jazyky , sfragistika , and terminologie odborná
Language:
Slovak , Czech , Polish , Hungarian , Belarusian , German , Spanish , French , English , Italian , Lithuanian , Norwegian , Dutch , Portuguese , Romanian , Russian , Swedish , and Ukrainian
Description:
"Adaptovaný a ilustrovaný slovensko-česko-poľsko-maďarský preklad Medzinárodného sfragistického slovníka ... s pripojenými prekladmi názvov hesiel v bieloruštine, nemčine, španielčine, francúzštine, angličtine, taliančine, litovčine, nórčine, holandštine, portugalčine, rumunčine, ruštine, švédštine a ukrajinčine"--Strana 5, Přeloženo z francouzštiny?, and Obsahuje rejstříky
Rights:
unknown
Creator:
Majliš, Martin
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
text and corpus
Subject:
multilingual corpora
Language:
Afrikaans , Tosk Albanian , Amharic , Arabic , Aragonese , Egyptian Arabic , Asturian , Azerbaijani , Belarusian , Bengali , Bosnian , Bishnupriya , Breton , Buginese , Bulgarian , Catalan , Cebuano , Czech , Chuvash , Corsican , Welsh , Danish , German , Dimli (individual language) , Modern Greek (1453-) , English , Esperanto , Estonian , Basque , Faroese , Persian , Finnish , French , Western Frisian , Gan Chinese , Scottish Gaelic , Irish , Galician , Gilaki , Gujarati , Haitian , Serbo-Croatian , Hebrew , Fiji Hindi , Hindi , Croatian , Upper Sorbian , Hungarian , Armenian , Ido , Interlingua (International Auxiliary Language Association) , Indonesian , Icelandic , Italian , Javanese , Japanese , Kannada , Georgian , Kazakh , Korean , Kurdish , Latin , Latvian , Limburgan , Lithuanian , Lombard , Luxembourgish , Malayalam , Marathi , Macedonian , Malagasy , Mongolian , Maori , Malay (macrolanguage) , Burmese , Neapolitan , Low German , Nepali (macrolanguage) , Newari , Dutch , Norwegian Nynorsk , Norwegian , Occitan (post 1500) , Ossetian , Pampanga , Piemontese , Polish , Portuguese , Quechua , Romanian , Russian , Yakut , Sicilian , Scots , Slovak , Slovenian , Spanish , Albanian , Serbian , Sundanese , Swahili (macrolanguage) , Swedish , Tamil , Tatar , Telugu , Tajik , Tagalog , Thai , Turkish , Ukrainian , Urdu , Uzbek , Venetian , Vietnamese , Volapük , Waray (Philippines) , Walloon , Yiddish , Yoruba , and Chinese
Description:
A set of corpora for 120 languages automatically collected from wikipedia and the web.
Collected using the W2C toolset: http://hdl.handle.net/11858/00-097C-0000-0022-60D6-1
Rights:
Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0) , http://creativecommons.org/licenses/by-sa/3.0/ , and PUB
Publisher:
University of Leipzig
Type:
corpus
Language:
Afrikaans , Albanian , Bulgarian , Catalan , Chinese , Croatian , Czech , Danish , Dutch , English , Esperanto , Estonian , Finnish , French , German , Hungarian , Icelandic , Indonesian , Italian , Japanese , Korean , Latin , Latvian , Lithuanian , Malay (macrolanguage) , Norwegian , Occitan (post 1500) , Romanian , Russian , Slovak , Slovenian , Spanish , Sundanese , Swedish , Tagalog , Turkish , Vietnamese , and Welsh
Description:
Collected from newspaper texts, webcrawling, etc.: words (+frequency), cooccurrences (+graph), left/right neighbours, example sentences
Rights:
Not specified