Skip to search
Skip to main content
Skip to first result
Search
Search Results
Creator:
Mareček, David , Yu, Zhiwei , Zeman, Daniel , and Žabokrtský, Zdeněk
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
text and corpus
Subject:
part of speech , tagging , semi-supervised , and cross-language
Language:
Belarusian , Bosnian , Bulgarian , Czech , Serbo-Croatian , Croatian , Upper Sorbian , Macedonian , Polish , Russian , Slovak , Slovenian , Serbian , Ukrainian , Latvian , Lithuanian , Afrikaans , Danish , German , English , Faroese , Western Frisian , Swiss German , Icelandic , Limburgan , Luxembourgish , Low German , Dutch , Norwegian Nynorsk , Norwegian , Scots , Swedish , Yiddish , Aragonese , Asturian , Catalan , French , Galician , Haitian , Italian , Latin , Lombard , Neapolitan , Piemontese , Portuguese , Romanian , Spanish , Venetian , Walloon , Breton , Welsh , Scottish Gaelic , Irish , Modern Greek (1453-) , Armenian , Albanian , Dimli (individual language) , Persian , Gilaki , Kurdish , Tajik , Bengali , Bishnupriya , Gujarati , Fiji Hindi , Hindi , Marathi , Nepali (macrolanguage) , Urdu , Amharic , Arabic , Egyptian Arabic , Hebrew , Estonian , Finnish , Hungarian , Basque , Georgian , Chuvash , Azerbaijani , Turkish , Uzbek , Kazakh , Tatar , Yakut , Korean , Mongolian , Telugu , Kannada , Malayalam , Tamil , Newari , Vietnamese , Indonesian , Javanese , Malagasy , Maori , Malay (macrolanguage) , Pampanga , Sundanese , Tagalog , Waray (Philippines) , Swahili (macrolanguage) , Esperanto , Ido , Interlingua (International Auxiliary Language Association) , and Volapük
Description:
Texts in 107 languages from the W2C corpus (http://hdl.handle.net/11858/00-097C-0000-0022-6133-9), first 1,000,000 tokens per language, tagged by the delexicalized tagger described in Yu et al. (2016, LREC, Portorož, Slovenia).
Rights:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) , http://creativecommons.org/licenses/by-sa/4.0/ , and PUB
Creator:
Mareček, David , Yu, Zhiwei , Zeman, Daniel , and Žabokrtský, Zdeněk
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
text and corpus
Subject:
part of speech , tagging , semi-supervised , and cross-language
Language:
Belarusian , Bosnian , Bulgarian , Czech , Serbo-Croatian , Croatian , Upper Sorbian , Macedonian , Polish , Russian , Slovak , Slovenian , Serbian , Ukrainian , Latvian , Lithuanian , Afrikaans , Danish , German , English , Faroese , Western Frisian , Swiss German , Icelandic , Limburgan , Luxembourgish , Low German , Dutch , Norwegian Nynorsk , Norwegian , Scots , Swedish , Yiddish , Aragonese , Asturian , Catalan , French , Galician , Haitian , Italian , Latin , Lombard , Neapolitan , Piemontese , Portuguese , Romanian , Spanish , Venetian , Walloon , Breton , Welsh , Scottish Gaelic , Irish , Modern Greek (1453-) , Armenian , Albanian , Dimli (individual language) , Persian , Gilaki , Kurdish , Tajik , Bengali , Bishnupriya , Gujarati , Fiji Hindi , Hindi , Marathi , Nepali (macrolanguage) , Urdu , Amharic , Arabic , Egyptian Arabic , Hebrew , Estonian , Finnish , Hungarian , Basque , Georgian , Chuvash , Azerbaijani , Turkish , Uzbek , Kazakh , Tatar , Yakut , Korean , Mongolian , Telugu , Kannada , Malayalam , Tamil , Newari , Vietnamese , Indonesian , Javanese , Malagasy , Maori , Malay (macrolanguage) , Pampanga , Sundanese , Tagalog , Waray (Philippines) , Swahili (macrolanguage) , Esperanto , Ido , Interlingua (International Auxiliary Language Association) , and Volapük
Description:
Texts in 107 languages from the W2C corpus (http://hdl.handle.net/11858/00-097C-0000-0022-6133-9), first 1,000,000 tokens per language, tagged by the delexicalized tagger described in Yu et al. (2016, LREC, Portorož, Slovenia).
Changes in version 1.1:
1. Universal Dependencies tagset instead of the older and smaller Google Universal POS tagset.
2. SVM classifier trained on Universal Dependencies 1.2 instead of HamleDT 2.0.
3. Balto-Slavic languages, Germanic languages and Romance languages were tagged by classifier trained only on the respective group of languages. Other languages were tagged by a classifier trained on all available languages. The "c7" combination from version 1.0 is no longer used.
Rights:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) , http://creativecommons.org/licenses/by-sa/4.0/ , and PUB
Publisher:
Marsigli
Format:
print and 8 pp + 3 obr. příl. ; 8°
Type:
model:monograph and TEXT
Subject:
století 19. and letectví
Language:
Italian
Rights:
http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
Creator:
Gios, Pierantonio
Publisher:
Civis,
Subject:
vizitace biskupské , biskupství padovské , světové dějiny středověku (do r. 1492) , Itálie , and církevní správa a hospodářství
Language:
Italian
Rights:
unknown
Creator:
Amendola, Nadia
Type:
text and studie
Subject:
Muzikologie. Dějiny hudby , Ghedini, Giorgio Federico, , skladatelé , hudba , styly hudební , Itálie , světové dějiny 1918-1945 , and hudba, tanec, hudební nástroje
Language:
Italian
Description:
Příklady odklonu od minulosti ve vokální komorní hudbě G. F. Ghediniho: církevní, archaické a populární aspekty.
Rights:
unknown
Creator:
Meyerbeer, Giacomo and Schmidt, J. P.
Publisher:
in der Schlesingerschen Buch- und Musikhandlung
Format:
hudebnina and 1 klavírní výtah (183 stran) ; 29 x 33 cm
Type:
notated music , sheetmusic , model:sheetmusic , and TEXT
Subject:
opery , vzácné hudební tisky MKP , and klavírní výtahy vokálních děl s vokální linkou
Language:
German and Italian
Description:
componirt von Giacomo Meyerbeer and vollständiger Klavier-Auszug mit deutschem und italienischem Text von J. P. Schmidt
Rights:
http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
Type:
text and encyklopedie
Subject:
Italská literatura (o ní) , Alighieri, Dante, , básníci italští , dějiny literatury , encyklopedie oborové , Itálie , literatura, spisovatelé , světové dějiny středověku (do r. 1492) , and oborové slovníky
Language:
Italian
Rights:
unknown
Creator:
Pallottino, Massimo,
Type:
text and monografie
Subject:
Dějiny zemí starověkého světa , dějiny etnik , antika , říše římská , Etruskové , přehledná zpracování (tematicky) , and Etruskové, starověký Řím
Language:
Italian
Rights:
unknown
Type:
text and katalogy
Subject:
Dopravní prostředky , doprava železniční , technika železniční , české země 1848-1918 , dopravní technika , and Československo 1918-1992
Language:
Italian , French , and German
Description:
Název z obálky, Rok vyd. z katalogu NTK, and Reprint dobových katalogů, lišících se pouze obr. částí - otevřené a uzavřené nákladní vagony
Rights:
unknown
Creator:
Pardubský, Matěj
Publisher:
Pardubský, Matěj
Format:
print and [4] ff ; 4°
Type:
model:monograph and TEXT
Subject:
století 17. , poezie , Rambauzius, Václav, -1625 , and Truplová, Anna, -1614
Language:
Italian and Czech
Description:
K15777
Rights:
http://creativecommons.org/publicdomain/mark/1.0/ and policy:public