Number of results to display per page
Search Results
22. Deltacorpus
- Creator:
- Mareček, David, Yu, Zhiwei, Zeman, Daniel, and Žabokrtský, Zdeněk
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- part of speech, tagging, semi-supervised, and cross-language
- Language:
- Belarusian, Bosnian, Bulgarian, Czech, Serbo-Croatian, Croatian, Upper Sorbian, Macedonian, Polish, Russian, Slovak, Slovenian, Serbian, Ukrainian, Latvian, Lithuanian, Afrikaans, Danish, German, English, Faroese, Western Frisian, Swiss German, Icelandic, Limburgan, Luxembourgish, Low German, Dutch, Norwegian Nynorsk, Norwegian, Scots, Swedish, Yiddish, Aragonese, Asturian, Catalan, French, Galician, Haitian, Italian, Latin, Lombard, Neapolitan, Piemontese, Portuguese, Romanian, Spanish, Venetian, Walloon, Breton, Welsh, Scottish Gaelic, Irish, Modern Greek (1453-), Armenian, Albanian, Dimli (individual language), Persian, Gilaki, Kurdish, Tajik, Bengali, Bishnupriya, Gujarati, Fiji Hindi, Hindi, Marathi, Nepali (macrolanguage), Urdu, Amharic, Arabic, Egyptian Arabic, Hebrew, Estonian, Finnish, Hungarian, Basque, Georgian, Chuvash, Azerbaijani, Turkish, Uzbek, Kazakh, Tatar, Yakut, Korean, Mongolian, Telugu, Kannada, Malayalam, Tamil, Newari, Vietnamese, Indonesian, Javanese, Malagasy, Maori, Malay (macrolanguage), Pampanga, Sundanese, Tagalog, Waray (Philippines), Swahili (macrolanguage), Esperanto, Ido, Interlingua (International Auxiliary Language Association), and Volapük
- Description:
- Texts in 107 languages from the W2C corpus (http://hdl.handle.net/11858/00-097C-0000-0022-6133-9), first 1,000,000 tokens per language, tagged by the delexicalized tagger described in Yu et al. (2016, LREC, Portorož, Slovenia).
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
23. Deltacorpus 1.1
- Creator:
- Mareček, David, Yu, Zhiwei, Zeman, Daniel, and Žabokrtský, Zdeněk
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- part of speech, tagging, semi-supervised, and cross-language
- Language:
- Belarusian, Bosnian, Bulgarian, Czech, Serbo-Croatian, Croatian, Upper Sorbian, Macedonian, Polish, Russian, Slovak, Slovenian, Serbian, Ukrainian, Latvian, Lithuanian, Afrikaans, Danish, German, English, Faroese, Western Frisian, Swiss German, Icelandic, Limburgan, Luxembourgish, Low German, Dutch, Norwegian Nynorsk, Norwegian, Scots, Swedish, Yiddish, Aragonese, Asturian, Catalan, French, Galician, Haitian, Italian, Latin, Lombard, Neapolitan, Piemontese, Portuguese, Romanian, Spanish, Venetian, Walloon, Breton, Welsh, Scottish Gaelic, Irish, Modern Greek (1453-), Armenian, Albanian, Dimli (individual language), Persian, Gilaki, Kurdish, Tajik, Bengali, Bishnupriya, Gujarati, Fiji Hindi, Hindi, Marathi, Nepali (macrolanguage), Urdu, Amharic, Arabic, Egyptian Arabic, Hebrew, Estonian, Finnish, Hungarian, Basque, Georgian, Chuvash, Azerbaijani, Turkish, Uzbek, Kazakh, Tatar, Yakut, Korean, Mongolian, Telugu, Kannada, Malayalam, Tamil, Newari, Vietnamese, Indonesian, Javanese, Malagasy, Maori, Malay (macrolanguage), Pampanga, Sundanese, Tagalog, Waray (Philippines), Swahili (macrolanguage), Esperanto, Ido, Interlingua (International Auxiliary Language Association), and Volapük
- Description:
- Texts in 107 languages from the W2C corpus (http://hdl.handle.net/11858/00-097C-0000-0022-6133-9), first 1,000,000 tokens per language, tagged by the delexicalized tagger described in Yu et al. (2016, LREC, Portorož, Slovenia). Changes in version 1.1: 1. Universal Dependencies tagset instead of the older and smaller Google Universal POS tagset. 2. SVM classifier trained on Universal Dependencies 1.2 instead of HamleDT 2.0. 3. Balto-Slavic languages, Germanic languages and Romance languages were tagged by classifier trained only on the respective group of languages. Other languages were tagged by a classifier trained on all available languages. The "c7" combination from version 1.0 is no longer used.
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
24. Der Bolzanoprozess :
- Creator:
- Winter, Eduard,
- Type:
- text, monografie, and dokumenty
- Subject:
- Filozofie, Bolzano, Bernard,, filozofové, procesy soudní, univerzity, české země 1792-1847, and filozofie, filozofové
- Language:
- German, Italian, and Latin
- Rights:
- unknown
25. Documenta Bohemica bellum tricennale illustrantia.
- Type:
- text, prameny, and edice
- Subject:
- Dějiny Evropy, válka třicetiletá (1618-1648), dějiny vojenství, and české země 1526-1792
- Language:
- German, French, Italian, Latin, and Spanish
- Rights:
- unknown
26. Duce a kacíř :
- Creator:
- Helan, Pavel,
- Type:
- text, studie, and edice
- Subject:
- Politika a náboženství. Vztahy mezi církví a státem, Biografie, Mussolini, Benito,, Hus, Jan,, vztahy česko-italské, vztahy italsko-české, politici italští, legie československé, činnost literární, edice, Itálie, světové dějiny 1918-1945, and politické dějiny, politici
- Language:
- Czech, Latin, and Italian
- Description:
- Část. přeloženo z italštiny
- Rights:
- unknown
27. Epistulae et acta nuntiorum apostolicorum apud imperatorem 1592-1628.
- Creator:
- Speciano, Cesare,
- Type:
- text, prameny, edice, studie, and korespondence
- Subject:
- Politika a náboženství. Vztahy mezi církví a státem, Speciano, Cesare,, nunciové, diplomacie, dvory panovnické, protireformace, rekatolizace, politika církevní, fondy archivní, české země 1526-1620, papežství, církevní politika, Habsburská monarchie, světové dějiny 1492-1648, and politické dějiny, politici
- Language:
- Italian, German, and Latin
- Description:
- Částečně přeloženo z latiny a italštiny?
- Rights:
- unknown
28. Epistulae et acta nuntiorum apostolicorum apud imperatorem 1592-1628.
- Creator:
- Speciano, Cesare,
- Type:
- text, prameny, edice, studie, and korespondence
- Subject:
- Politika a náboženství. Vztahy mezi církví a státem, Speciano, Cesare,, nunciové, diplomacie, dvory panovnické, protireformace, rekatolizace, politika církevní, fondy archivní, české země 1526-1620, papežství, církevní politika, Habsburská monarchie, světové dějiny 1492-1648, and politické dějiny, politici
- Language:
- Italian, German, and Latin
- Description:
- Částečně přeloženo z latiny a italštiny?
- Rights:
- unknown
29. Epistulae et acta nuntiorum apostolicorum apud imperatorem 1592-1628.
- Creator:
- Speciano, Cesare,
- Type:
- text, prameny, edice, studie, and korespondence
- Subject:
- Politika a náboženství. Vztahy mezi církví a státem, Speciano, Cesare,, nunciové, diplomacie, dvory panovnické, protireformace, rekatolizace, politika církevní, fondy archivní, české země 1526-1620, papežství, církevní politika, Habsburská monarchie, světové dějiny 1492-1648, and politické dějiny, politici
- Language:
- Italian, German, and Latin
- Description:
- Částečně přeloženo z latiny a italštiny?
- Rights:
- unknown
30. Epistulae et acta nuntiorum apostolicorum apud imperatorem 1592-1628.
- Creator:
- Caetani, Antonio,
- Type:
- text, korespondence, prameny, studie, and edice
- Subject:
- Politika a náboženství. Vztahy mezi církví a státem, Caetani, Antonio,, nunciové, dvory panovnické, diplomacie, papežství, politika církevní, české země 1526-1620, papežství, církevní politika, and zahraniční politika, mezinárodní vztahy
- Language:
- Italian, German, and Latin
- Rights:
- unknown