« Previous |
1 - 50 of 96
|
Next »
Number of results to display per page
Search Results
2. Basta György hadvezér Levelezèse és iratai. :
- Creator:
- Basta, Giorgio,
- Type:
- text, korespondence, and edice
- Subject:
- Dějiny zemí střední Evropy, Basta, Giorgio,, velitelé vojenští, šlechtici, Maďarsko, přehledná zpracování (tematicky), and světové dějiny středověku (do r. 1492)
- Language:
- Hungarian, Italian, and Latin
- Rights:
- unknown
3. Bibliografický přehled českých národních písní: seznam studií, starších sbírek rukopisných, sbírek tištěných, překladů s vybranými ukázkami a podrobný abecední ukazatel písní, v knize uvedených i vůbec písní tiskem uveřejněných
- Creator:
- Čeněk Zíbrt and Česká akademie císaře Františka Josefa pro vědy, slovesnost a umění
- Publisher:
- Nákladem České akademie císaře Františka Josefa pro vědy, slovesnost a umění
- Format:
- print, svazek, and 326 stran.
- Type:
- model:monograph and TEXT
- Subject:
- Vokální hudba, Bibliografie. Katalogy, české lidové písně, historické prameny, Česko, 784.4(=162.3), (016), (437.3), 9, 12, 784, and 01
- Language:
- Czech, English, French, German, Italian, Latin, Polish, and Russian
- Description:
- sestavil Čeněk Zíbrt., Obsahuje rejstříky., Částečně souběžný anglický, francouzský, německý, italský, latinský, polský a ruský text, and Vydává III. třída České akademie císaře Františka Josefa pro vědy, slovesnost a umění v Praze
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
4. Česká církev v dějinách /
- Creator:
- Polc, Jaroslav V.
- Type:
- text and sborníky jubilejní
- Subject:
- Dějiny křesťanské církve, Polc, Jaroslav V., sborníky, historici čeští, jubilea životní, dějiny církevní, bibliografie personální, přehledná zpracování dějin českých zemí (chronologicky), církevní a náboženské dějiny, historici (jubilea, nekrology apod.), personální bibliografie, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech, Italian, French, Latin, and German
- Description:
- Vydavatel: Katolická teologická fakulta Univerzity Karlovy
- Rights:
- unknown
5. Československo a Svatý stolec.
- Type:
- text, dokumenty, and edice
- Subject:
- Politika a náboženství. Vztahy mezi církví a státem, církev římskokatolická, správa církevní, politika církevní, vztahy československo-vatikánské, vztahy stát-církev, vztahy diplomatické, Československo 1918-1938, Vatikán, světové dějiny 1918-1945, zahraniční politika, mezinárodní vztahy, and papežství, církevní politika
- Language:
- Czech, Italian, and Latin
- Description:
- Czechoslovakia and the Holy See III. The diplomatic correspondence and other documents 1917-1928.
- Rights:
- unknown
6. Československo a Svatý stolec.
- Publisher:
- Masarykův ústav a Archiv Akademie věd ČR,
- Type:
- dokumenty and edice
- Subject:
- Politika a náboženství. Vztahy mezi církví a státem, církev římskokatolická, správa církevní, politika církevní, vztahy československo-vatikánské, vztahy stát-církev, vztahy diplomatické, Československo 1918-1938, Vatikán, světové dějiny 1918-1945, zahraniční politika, mezinárodní vztahy, and papežství, církevní politika
- Language:
- Czech, Italian, and Latin
- Rights:
- unknown
7. Československo a Svatý stolec.
- Type:
- text, dokumenty, prameny, and edice
- Subject:
- Politika a náboženství. Vztahy mezi církví a státem, církev katolická, politika církevní, vztahy stát-církev, vztahy československo-vatikánské, papežství, vztahy diplomatické, Československo 1918-1938, zahraniční politika, mezinárodní vztahy, Vatikán, světové dějiny 1918-1945, and papežství, církevní politika
- Language:
- Czech, French, Italian, and Latin
- Description:
- Czechslovakia and the Holy See II/2.2. The Sacred Congregation for Extraordinary Ecclesiastical Affairs 1926-1927.
- Rights:
- unknown
8. Československo a Svatý stolec.
- Type:
- text, dokumenty, and edice
- Subject:
- Politika a náboženství. Vztahy mezi církví a státem, vztahy československo-vatikánské, politika církevní, vztahy stát-církev, diplomacie, Československo 1918-1938, Vatikán, světové dějiny 1918-1945, zahraniční politika, mezinárodní vztahy, and papežství, církevní politika
- Language:
- Czech, French, Italian, and Latin
- Description:
- Czechoslovakia and the Holy See II/3. The Sacred Congregation for Extraordinary Ecclesiastical Affairs 1929-1931.
- Rights:
- unknown
9. Československo a Svatý stolec.
- Type:
- text, dokumenty, and edice
- Subject:
- Politika a náboženství. Vztahy mezi církví a státem, vztahy československo-vatikánské, vztahy stát-církev, církev římskokatolická, politika církevní, diplomacie, korespondence diplomatická, Československo 1918-1938, Vatikán, světové dějiny 1918-1945, zahraniční politika, mezinárodní vztahy, and papežství, církevní politika
- Language:
- Czech, French, Italian, Latin, and Slovak
- Rights:
- unknown
10. Christian Gottfried Krause: O hudební poezii /
- Creator:
- Krause, Christian Gottfried,
- Type:
- text, spisy, and překlady
- Subject:
- Vokální hudba, Krause, Christian Gottfried,, hudba, literatura, estetika, muzikologie, písně, Německo, světové dějiny 1648-1789, and hudba, tanec, hudební nástroje
- Language:
- Czech, French, German, Italian, and Latin
- Description:
- Převážně přeloženo z němčiny and Christian Gottfried Krause : Von der musiklalischen Poesie - An Annotated Translation.
- Rights:
- unknown
11. Cölestin V. (1294), (Peter vom Morrone), der Engelpapst :
- Creator:
- Herde, Peter,
- Type:
- text and monografie
- Subject:
- Dějiny křesťanské církve, Celestin, kněží, papeži, papežství, poustevníci, světové dějiny středověku (do r. 1492), and papežství, církevní politika
- Language:
- German, Italian, and Latin
- Rights:
- unknown
12. CoNLL 2017 and 2018 Shared Task Blind and Preprocessed Test Data
- Creator:
- Zeman, Daniel and Straka, Milan
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- tokenization, word segmentation, morphology, tagging, syntax, parsing, and universal dependencies
- Language:
- Afrikaans, Arabic, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Persian, Finnish, French, Old French (842-ca. 1400), Irish, Galician, Gothic, Ancient Greek (to 1453), Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Latin, Latvian, Dutch, Norwegian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Thai, Turkish, Uighur, Ukrainian, Urdu, Vietnamese, and Chinese
- Description:
- CoNLL 2017 and 2018 shared tasks: Multilingual Parsing from Raw Text to Universal Dependencies This package contains the test data in the form in which they ware presented to the participating systems: raw text files and files preprocessed by UDPipe. The metadata.json files contain lists of files to process and to output; README files in the respective folders describe the syntax of metadata.json. For full training, development and gold standard test data, see Universal Dependencies 2.0 (CoNLL 2017) Universal Dependencies 2.2 (CoNLL 2018) See the download links at http://universaldependencies.org/. For more information on the shared tasks, see http://universaldependencies.org/conll17/ http://universaldependencies.org/conll18/ Contents: conll17-ud-test-2017-05-09 ... CoNLL 2017 test data conll18-ud-test-2018-05-06 ... CoNLL 2018 test data conll18-ud-test-2018-05-06-for-conll17 ... CoNLL 2018 test data with metadata and filenames modified so that it is digestible by the 2017 systems.
- Rights:
- Licence Universal Dependencies v2.2, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.2, and PUB
13. CoNLL 2017 Shared Task System Outputs
- Creator:
- Zeman, Daniel, Potthast, Martin, Straka, Milan, Popel, Martin, Dozat, Timothy, Qi, Peng, Manning, Christopher, Shi, Tianze, Wu, Felix G., Chen, Xilun, Cheng, Yao, Björkelund, Anders, Falenska, Agnieszka, Yu, Xiang, Kuhn, Jonas, Che, Wanxiang, Guo, Jiang, Wang, Yuxuan, Zheng, Bo, Zhao, Huaipeng, Liu, Yang, Teng, Dechuan, Liu, Ting, Lim, Kyungtae, Poibeau, Thierry, Sato, Motoki, Manabe, Hitoshi, Noji, Hiroshi, Matsumoto, Yuji, Kırnap, Ömer, Önder, Berkay Furkan, Yuret, Deniz, Straková, Jana, Vania, Clara, Zhang, Xingxing, Lopez, Adam, Heinecke, Johannes, Asadullah, Munshi, Kanerva, Jenna, Luotolahti, Juhani, Ginter, Filip, Kuan, Yu, Sofroniev, Pavel, Schill, Erik, Hinrichs, Erhard, Nguyen, Dat Quoc, Dras, Mark, Johnson, Mark, Qian, Xian, Vilares, David, Gómez-Rodríguez, Carlos, Aufrant, Lauriane, Wisniewski, Guillaume, Yvon, François, Dumitrescu, Stefan Daniel, Boroş, Tiberiu, Tufiş, Dan, Das, Ayan, Zaffar, Affan, Sarkar, Sudeshna, Wang, Hao, Zhao, Hai, Zhang, Zhisong, Hornby, Ryan, Taylor, Clark, Park, Jungyeul, de Lhoneux, Miryam, Shao, Yan, Basirat, Ali, Kiperwasser, Eliyahu, Stymne, Sara, Goldberg, Yoav, Nivre, Joakim, Akkuş, Burak Kerim, Azizoglu, Heval, Cakici, Ruket, Moor, Christophe, Merlo, Paola, Henderson, James, Wang, Haozhou, Ji, Tao, Wu, Yuanbin, Lan, Man, de la Clergerie, Eric, Sagot, Benoît, Seddah, Djamé, More, Amir, Tsarfaty, Reut, Kanayama, Hiroshi, Muraoka, Masayasu, Yoshikawa, Katsumasa, Garcia, Marcos, and Gamallo, Pablo
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- dependency parser and parsebank
- Language:
- Arabic, Bulgarian, Russia Buriat, Czech, Catalan, Church Slavic, Danish, German, Modern Greek (1453-), English, Spanish, Estonian, Basque, Persian, Finnish, French, Irish, Galician, Gothic, Ancient Greek (to 1453), Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Latin, Latvian, Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Northern Sami, Swedish, Turkish, Uighur, Ukrainian, Urdu, Vietnamese, and Chinese
- Description:
- This package contains the system outputs from the CoNLL 2017 Shared Task in Multilingual Parsing from Raw Text to Universal Dependencies.
- Rights:
- Licence Universal Dependencies v2.0, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.0, and PUB
14. CoNLL 2018 Shared Task System Outputs
- Creator:
- Zeman, Daniel, Potthast, Martin, Duthoo, Elie, Mesnard, Olivier, Rybak, Piotr, Wróblewska, Alina, Che, Wanxiang, Liu, Yijia, Wang, Yuxuan, Zheng, Bo, Liu, Ting, Li, Zuchao, He, Shexia, Zhang, Zhuosheng, Zhao, Hai, Wu, Yingting, Tong, Jia-Jun, Nguyen, Dat Quoc, Verspoor, Karin, Wan, Hui, Naseem, Tahira, Lee, Young-Suk, Castelli, Vittorio, Ballesteros, Miguel, Hershcovich, Daniel, Abend, Omri, Rappoport, Ari, Smith, Aaron, Bohnet, Bernd, de Lhoneux, Miryam, Nivre, Joakim, Shao, Yan, Stymne, Sara, Kırnap, Ömer, Dayanık, Erenay, Yuret, Deniz, Kanerva, Jenna, Ginter, Filip, Miekka, Niko, Leino, Akseli, Salakoski, Tapio, Lim, KyungTae, Park, Cheoneum, Lee, Changki, Poibeau, Thierry, Bhat, Riyaz Ahmad, Bhat, Irshad, Bangalore, Srinivas, Qi, Peng, Dozat, Timothy, Zhang, Yuhao, Manning, Christopher, Boroș, Tiberiu, Dumitrescu, Stefan Daniel, Burtica, Ruxandra, Arakelyan, Gor, Hambardzumyan, Karen, Khachatrian, Hrant, Rosa, Rudolf, Mareček, David, Straka, Milan, Seker, Amit, More, Amir, Tsarfaty, Reut, Önder, Berkay Furkan, Gümeli, Can, Jawahar, Ganesh, Muller, Benjamin, Fethi, Amal, Martin, Louis, Villemonte de la Clergerie, Eric, Sagot, Benoît, Seddah, Djamé, Özateş, Şaziye Betül, Özgür, Arzucan, Gungor, Tunga, Öztürk, Balkız, Ji, Tao, Liu, Yufang, Wang, Yijun, Wu, Yuanbin, Lan, Man, Chen, Danlu, Lin, Mengxiao, Hu, Zhifeng, and Qiu, Xipeng
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- parsed data, conllu, and universal dependencies
- Language:
- Afrikaans, Arabic, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Persian, Finnish, French, Old French (842-ca. 1400), Irish, Galician, Gothic, Ancient Greek (to 1453), Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Latin, Latvian, Dutch, Norwegian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Thai, Turkish, Uighur, Ukrainian, Urdu, Vietnamese, and Chinese
- Description:
- Test data parsed by systems submitted to the CoNLL 2018 UD parsing shared task.
- Rights:
- Licence Universal Dependencies v2.2, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.2, and PUB
15. DaMuEL 1.0: A Large Multilingual Dataset for Entity Linking
- Creator:
- Kubeša, David and Straka, Milan
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- entity linking, NEL, NER, dataset, and knowledge base
- Language:
- Afrikaans, Arabic, Armenian, Basque, Belarusian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Hebrew, Hindi, Hungarian, Indonesian, Irish, Italian, Japanese, Korean, Latin, Latvian, Lithuanian, Maltese, Marathi, Modern Greek (1453-), Northern Sami, Norwegian Nynorsk, Persian, Polish, Portuguese, Romanian, Russian, Scottish Gaelic, Serbian, Slovak, Slovenian, Spanish, Swedish, Tamil, Telugu, Uighur, Ukrainian, Urdu, Vietnamese, and Wolof
- Description:
- We present DaMuEL, a large Multilingual Dataset for Entity Linking containing data in 53 languages. DaMuEL consists of two components: a knowledge base that contains language-agnostic information about entities, including their claims from Wikidata and named entity types (PER, ORG, LOC, EVENT, BRAND, WORK_OF_ART, MANUFACTURED); and Wikipedia texts with entity mentions linked to the knowledge base, along with language-specific text from Wikidata such as labels, aliases, and descriptions, stored separately for each language. The Wikidata QID is used as a persistent, language-agnostic identifier, enabling the combination of the knowledge base with language-specific texts and information for each entity. Wikipedia documents deliberately annotate only a single mention for every entity present; we further automatically detect all mentions of named entities linked from each document. The dataset contains 27.9M named entities in the knowledge base and 12.3G tokens from Wikipedia texts. The dataset is published under the CC BY-SA licence.
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
16. Deep Universal Dependencies 2.4
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, and Galician
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-2988). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.4, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.4, and PUB
17. Deep Universal Dependencies 2.5
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, and Skolt Sami
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3105). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.5, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.5, and PUB
18. Deep Universal Dependencies 2.6
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, Skolt Sami, Icelandic, Albanian, and Persian
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3226). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.6, https://lindat.mff.cuni.cz/repository/xmlui/page/license-ud-2.6, and PUB
19. Deep Universal Dependencies 2.7
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, Skolt Sami, Icelandic, Albanian, Persian, Akuntsu, Apurinã, Khunsari, Manx, Mundurukú, Nayini, Soi, South Levantine Arabic, and Tupinambá
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3424). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.7, https://lindat.mff.cuni.cz/repository/xmlui/page/license-ud-2.7, and PUB
20. Deep Universal Dependencies 2.8
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, Skolt Sami, Icelandic, Albanian, Persian, Akuntsu, Apurinã, Khunsari, Manx, Mundurukú, Nayini, Soi, South Levantine Arabic, Tupinambá, Beja, Western Frisian, Urubú-Kaapor, Kangri, K'iche', Low German, Makuráp, Western Armenian, and Central Siberian Yupik
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3687). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.8, https://lindat.mff.cuni.cz/repository/xmlui/page/license-ud-2.8, and PUB
21. Dějiny loutkového divadla v Evropě /
- Creator:
- Magnin, Charles,
- Type:
- text and monografie
- Subject:
- Divadlo. Divadelní představení, divadlo loutkové, přehledná zpracování světových dějin (chronologicky), and divadlo, film, fotografie
- Language:
- Czech, English, French, German, Italian, and Latin
- Description:
- Poznámky
- Rights:
- unknown
22. Deltacorpus
- Creator:
- Mareček, David, Yu, Zhiwei, Zeman, Daniel, and Žabokrtský, Zdeněk
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- part of speech, tagging, semi-supervised, and cross-language
- Language:
- Belarusian, Bosnian, Bulgarian, Czech, Serbo-Croatian, Croatian, Upper Sorbian, Macedonian, Polish, Russian, Slovak, Slovenian, Serbian, Ukrainian, Latvian, Lithuanian, Afrikaans, Danish, German, English, Faroese, Western Frisian, Swiss German, Icelandic, Limburgan, Luxembourgish, Low German, Dutch, Norwegian Nynorsk, Norwegian, Scots, Swedish, Yiddish, Aragonese, Asturian, Catalan, French, Galician, Haitian, Italian, Latin, Lombard, Neapolitan, Piemontese, Portuguese, Romanian, Spanish, Venetian, Walloon, Breton, Welsh, Scottish Gaelic, Irish, Modern Greek (1453-), Armenian, Albanian, Dimli (individual language), Persian, Gilaki, Kurdish, Tajik, Bengali, Bishnupriya, Gujarati, Fiji Hindi, Hindi, Marathi, Nepali (macrolanguage), Urdu, Amharic, Arabic, Egyptian Arabic, Hebrew, Estonian, Finnish, Hungarian, Basque, Georgian, Chuvash, Azerbaijani, Turkish, Uzbek, Kazakh, Tatar, Yakut, Korean, Mongolian, Telugu, Kannada, Malayalam, Tamil, Newari, Vietnamese, Indonesian, Javanese, Malagasy, Maori, Malay (macrolanguage), Pampanga, Sundanese, Tagalog, Waray (Philippines), Swahili (macrolanguage), Esperanto, Ido, Interlingua (International Auxiliary Language Association), and Volapük
- Description:
- Texts in 107 languages from the W2C corpus (http://hdl.handle.net/11858/00-097C-0000-0022-6133-9), first 1,000,000 tokens per language, tagged by the delexicalized tagger described in Yu et al. (2016, LREC, Portorož, Slovenia).
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
23. Deltacorpus 1.1
- Creator:
- Mareček, David, Yu, Zhiwei, Zeman, Daniel, and Žabokrtský, Zdeněk
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- part of speech, tagging, semi-supervised, and cross-language
- Language:
- Belarusian, Bosnian, Bulgarian, Czech, Serbo-Croatian, Croatian, Upper Sorbian, Macedonian, Polish, Russian, Slovak, Slovenian, Serbian, Ukrainian, Latvian, Lithuanian, Afrikaans, Danish, German, English, Faroese, Western Frisian, Swiss German, Icelandic, Limburgan, Luxembourgish, Low German, Dutch, Norwegian Nynorsk, Norwegian, Scots, Swedish, Yiddish, Aragonese, Asturian, Catalan, French, Galician, Haitian, Italian, Latin, Lombard, Neapolitan, Piemontese, Portuguese, Romanian, Spanish, Venetian, Walloon, Breton, Welsh, Scottish Gaelic, Irish, Modern Greek (1453-), Armenian, Albanian, Dimli (individual language), Persian, Gilaki, Kurdish, Tajik, Bengali, Bishnupriya, Gujarati, Fiji Hindi, Hindi, Marathi, Nepali (macrolanguage), Urdu, Amharic, Arabic, Egyptian Arabic, Hebrew, Estonian, Finnish, Hungarian, Basque, Georgian, Chuvash, Azerbaijani, Turkish, Uzbek, Kazakh, Tatar, Yakut, Korean, Mongolian, Telugu, Kannada, Malayalam, Tamil, Newari, Vietnamese, Indonesian, Javanese, Malagasy, Maori, Malay (macrolanguage), Pampanga, Sundanese, Tagalog, Waray (Philippines), Swahili (macrolanguage), Esperanto, Ido, Interlingua (International Auxiliary Language Association), and Volapük
- Description:
- Texts in 107 languages from the W2C corpus (http://hdl.handle.net/11858/00-097C-0000-0022-6133-9), first 1,000,000 tokens per language, tagged by the delexicalized tagger described in Yu et al. (2016, LREC, Portorož, Slovenia). Changes in version 1.1: 1. Universal Dependencies tagset instead of the older and smaller Google Universal POS tagset. 2. SVM classifier trained on Universal Dependencies 1.2 instead of HamleDT 2.0. 3. Balto-Slavic languages, Germanic languages and Romance languages were tagged by classifier trained only on the respective group of languages. Other languages were tagged by a classifier trained on all available languages. The "c7" combination from version 1.0 is no longer used.
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
24. Der Bolzanoprozess :
- Creator:
- Winter, Eduard,
- Type:
- text, monografie, and dokumenty
- Subject:
- Filozofie, Bolzano, Bernard,, filozofové, procesy soudní, univerzity, české země 1792-1847, and filozofie, filozofové
- Language:
- German, Italian, and Latin
- Rights:
- unknown
25. Documenta Bohemica bellum tricennale illustrantia.
- Type:
- text, prameny, and edice
- Subject:
- Dějiny Evropy, válka třicetiletá (1618-1648), dějiny vojenství, and české země 1526-1792
- Language:
- German, French, Italian, Latin, and Spanish
- Rights:
- unknown
26. Duce a kacíř :
- Creator:
- Helan, Pavel,
- Type:
- text, studie, and edice
- Subject:
- Politika a náboženství. Vztahy mezi církví a státem, Biografie, Mussolini, Benito,, Hus, Jan,, vztahy česko-italské, vztahy italsko-české, politici italští, legie československé, činnost literární, edice, Itálie, světové dějiny 1918-1945, and politické dějiny, politici
- Language:
- Czech, Latin, and Italian
- Description:
- Část. přeloženo z italštiny
- Rights:
- unknown
27. Epistulae et acta nuntiorum apostolicorum apud imperatorem 1592-1628.
- Creator:
- Speciano, Cesare,
- Type:
- text, prameny, edice, studie, and korespondence
- Subject:
- Politika a náboženství. Vztahy mezi církví a státem, Speciano, Cesare,, nunciové, diplomacie, dvory panovnické, protireformace, rekatolizace, politika církevní, fondy archivní, české země 1526-1620, papežství, církevní politika, Habsburská monarchie, světové dějiny 1492-1648, and politické dějiny, politici
- Language:
- Italian, German, and Latin
- Description:
- Částečně přeloženo z latiny a italštiny?
- Rights:
- unknown
28. Epistulae et acta nuntiorum apostolicorum apud imperatorem 1592-1628.
- Creator:
- Speciano, Cesare,
- Type:
- text, prameny, edice, studie, and korespondence
- Subject:
- Politika a náboženství. Vztahy mezi církví a státem, Speciano, Cesare,, nunciové, diplomacie, dvory panovnické, protireformace, rekatolizace, politika církevní, fondy archivní, české země 1526-1620, papežství, církevní politika, Habsburská monarchie, světové dějiny 1492-1648, and politické dějiny, politici
- Language:
- Italian, German, and Latin
- Description:
- Částečně přeloženo z latiny a italštiny?
- Rights:
- unknown
29. Epistulae et acta nuntiorum apostolicorum apud imperatorem 1592-1628.
- Creator:
- Speciano, Cesare,
- Type:
- text, prameny, edice, studie, and korespondence
- Subject:
- Politika a náboženství. Vztahy mezi církví a státem, Speciano, Cesare,, nunciové, diplomacie, dvory panovnické, protireformace, rekatolizace, politika církevní, fondy archivní, české země 1526-1620, papežství, církevní politika, Habsburská monarchie, světové dějiny 1492-1648, and politické dějiny, politici
- Language:
- Italian, German, and Latin
- Description:
- Částečně přeloženo z latiny a italštiny?
- Rights:
- unknown
30. Epistulae et acta nuntiorum apostolicorum apud imperatorem 1592-1628.
- Creator:
- Caetani, Antonio,
- Type:
- text, korespondence, prameny, studie, and edice
- Subject:
- Politika a náboženství. Vztahy mezi církví a státem, Caetani, Antonio,, nunciové, dvory panovnické, diplomacie, papežství, politika církevní, české země 1526-1620, papežství, církevní politika, and zahraniční politika, mezinárodní vztahy
- Language:
- Italian, German, and Latin
- Rights:
- unknown
31. Feuer=Lösch=Ordnung Der königl: Residentz Kleinern Stadt Prag. ...
- Creator:
- Weingarten, Jan Jakub
- Format:
- print and [8] ff ; 4°
- Type:
- model:monograph and TEXT
- Subject:
- právo, století 17., and požáry - ochrana
- Language:
- German, Latin, and Italian
- Description:
- Jiné vydání, koncová viněta. and BCBT41548
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
32. Habsburg und Siebenbürgen, 1600-1605 :
- Creator:
- Arens, Meinolf,
- Type:
- text and monografie
- Subject:
- Dějiny států a území na Balkánském poloostrově, Dějiny Evropy, Habsburkové (rod), politika zahraniční, spory územní, operace vojenské, Habsburská monarchie, zahraniční politika, mezinárodní vztahy, and světové dějiny 1492-1648
- Language:
- German, Latin, and Italian
- Description:
- Revision of the author's thesis (doctoral-- Westfalische Wilhelms-Universitat Munster, Wintersemester 2000/2001).
- Rights:
- unknown
33. HamleDT 2.0
- Creator:
- Zeman, Daniel, Mareček, David, Mašek, Jan, Popel, Martin, Ramasamy, Loganathan, Rosa, Rudolf, Štěpánek, Jan, and Žabokrtský, Zdeněk
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- treebank, Stanford dependencies, Prague dependencies, harmonization, common annotation style, and Interset
- Language:
- Arabic, Bulgarian, Bengali, Catalan, Czech, Danish, German, Modern Greek (1453-), English, Spanish, Estonian, Basque, Persian, Finnish, Ancient Greek (to 1453), Hindi, Hungarian, Italian, Japanese, Latin, Dutch, Portuguese, Romanian, Russian, Slovak, Slovenian, Swedish, Tamil, Telugu, and Turkish
- Description:
- HamleDT 2.0 is a collection of 30 existing treebanks harmonized into a common annotation style, the Prague Dependencies, and further transformed into Stanford Dependencies, a treebank annotation style that became popular recently. We use the newest basic Universal Stanford Dependencies, without added language-specific subtypes.
- Rights:
- HamleDT 2.0 Licence Agreement, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-hamledt-2.0, and ACA
34. HamleDT 3.0
- Creator:
- Zeman, Daniel, Mareček, David, Mašek, Jan, Popel, Martin, Ramasamy, Loganathan, Rosa, Rudolf, Štěpánek, Jan, and Žabokrtský, Zdeněk
- Publisher:
- Charles University
- Type:
- text and corpus
- Subject:
- annotated corpus, morphology, syntax, dependency, treebank, harmonized annotation, and common annotation style
- Language:
- Arabic, Basque, Bengali, Bulgarian, Catalan, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Modern Greek (1453-), Ancient Greek (to 1453), Hebrew, Hindi, Hungarian, Indonesian, Irish, Italian, Japanese, Latin, Persian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Spanish, Swedish, Tamil, Telugu, and Turkish
- Description:
- HamleDT (HArmonized Multi-LanguagE Dependency Treebank) is a compilation of existing dependency treebanks (or dependency conversions of other treebanks), transformed so that they all conform to the same annotation style. This version uses Universal Dependencies as the common annotation style. Update (November 1017): for a current collection of harmonized dependency treebanks, we recommend using the Universal Dependencies (UD). All of the corpora that are distributed in HamleDT in full are also part of the UD project; only some corpora from the Patch group (where HamleDT provides only the harmonizing scripts but not the full corpus data) are available in HamleDT but not in UD.
- Rights:
- HamleDT 3.0 License Terms, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-hamledt-3.0, and PUB
35. Historický vývoj geometrických transformací /
- Creator:
- Trkovská, Dana
- Type:
- text and monografie
- Subject:
- Geometrie, matematika, geometrie, dějiny matematiky, světové dějiny 1789-1918, světové dějiny od r. 1918 do současnosti, and matematika, kybernetika
- Language:
- Czech, English, French, German, Ancient Greek (to 1453), Italian, and Latin
- Description:
- Nad názvem: katedra didaktiky matematiky MFF UK
- Rights:
- unknown
36. Ius exclusivae :
- Creator:
- Suchánek, Drahomír,
- Publisher:
- Aleš Skřivan ml.,
- Type:
- monografie
- Subject:
- Právo, právo církevní, právo kanonické, papežství, volby papežské, přehledná zpracování světových dějin (chronologicky), and papežství, církevní politika
- Language:
- Czech, Italian, and Latin
- Rights:
- unknown
37. Kaiser Maximilian I. (1459-1519) und die Hofkultur seiner Zeit /
- Type:
- text and biografie
- Subject:
- Dějiny zemí střední Evropy, Maxmilián, sborníky, panovníci habsburské monarchie, kultura dvorská, panovníci čeští, dějiny vědy, umění, kultury a techniky, kulturní vztahy, světové dějiny 1648-1789, světové dějiny 1789-1918, panovníci, panovnické rody, dvory, and české země 1471-1526
- Language:
- German, English, Italian, and Latin
- Rights:
- unknown
38. Le compagnon de tous ou dictionnaire polyglotte pour les écoles, et pour ceux qui s'occupent de lnfues étrangères et aux Arabes qui étudient les Langues Occidentales: enrichi des termes nouveaux de sciences et arts , choisis ou approuvés dans une réunion de sceïkhs. par Louis Calligaris
- Creator:
- Calligaris, Luigi
- Type:
- model:monograph and TEXT
- Language:
- French, Arabic, Latin, Italian, Spanish, Portuguese, German, English, and Modern Greek (1453-)
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
39. Literarische Reise nach Italien im Jahre 1837 zur Aufsuchung von Quellen der böhmischen und mährischen Geschichte /
- Creator:
- Palacký, František,
- Type:
- text and zprávy výzkumné
- Subject:
- Historická věda. Pomocné vědy historické. Archivnictví, historiografie, vztahy česko-italské, české země 1792-1847, dějepisectví, historické vědy, historici, Itálie, and přehledná zpracování (tematicky)
- Language:
- German, Italian, and Latin
- Description:
- Název na doplňkové titulní stránce: Palacky's italienische Reise im Jahre 1837
- Rights:
- unknown
40. Miscellanea Francesco Ehrle :
- Type:
- text and sborníky jubilejní
- Subject:
- Historická věda. Pomocné vědy historické. Archivnictví, Ehrle, Franz,, historiografie, and zahraniční periodika a sborníky
- Language:
- Italian, German, and Latin
- Rights:
- unknown
41. Na památku třístého výročí smrti Karla st. z Žerotína :
- Creator:
- <<ze >>Žerotína, Karel,
- Type:
- text, tisky pamětní, korespondence, and edice
- Subject:
- Historická věda. Pomocné vědy historické. Archivnictví, <<ze >>Žerotína, Karel,, šlechtici, myšlení politické, české země 1526-1620, and šlechta, buržoazie, měšťanstvo, podnikatelé
- Language:
- Czech, French, German, Italian, and Latin
- Description:
- Bibliofilie, Sáňka 5495, 600 výtisků a 10 číslovaných výtisků na měditiskovém papíře Sanders, and Obálkový název: Deset listů Karla st. z Žerotína
- Rights:
- unknown
42. Obsah - Forma :
- Type:
- text and sborníky konferenční
- Subject:
- Výtvarné umění, umění výtvarné, náměty umělecké, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech, English, Italian, Latin, and Slovak
- Rights:
- unknown
43. Petrarca v Provence :
- Creator:
- Špička, Jiří,
- Type:
- text and monografie
- Subject:
- Italská literatura (o ní), Petrarca, Francesco,, literatura italská, literatura francouzská, recepce literatury, Itálie, světové dějiny středověku (do r. 1492), literatura, spisovatelé, and Francie
- Language:
- Czech, Italian, and Latin
- Description:
- Částečně obsahuje Petrarcovy verše v italském nebo latinském originálu se souběžným českým překladem (převážně v překladu Jiřího Špičky)
- Rights:
- unknown
44. Plaintext Wikipedia dump 2018
- Creator:
- Rosa, Rudolf
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- Wikipedia, text corpora, and monolingual corpus
- Language:
- Abkhazian, Achinese, Adyghe, Afrikaans, Akan, Tosk Albanian, Amharic, Old English (ca. 450-1100), Arabic, Official Aramaic (700-300 BCE), Aragonese, Egyptian Arabic, Assamese, Asturian, Atikamekw, Avaric, Aymara, South Azerbaijani, Azerbaijani, Bashkir, Bambara, Bavarian, Central Bikol, Belarusian, Bengali, Bislama, Banjar, Tibetan, Bosnian, Bishnupriya, Breton, Buginese, Bulgarian, Russia Buriat, Catalan, Min Dong Chinese, Cebuano, Czech, Chamorro, Chechen, Cherokee, Church Slavic, Chuvash, Cheyenne, Central Kurdish, Cornish, Corsican, Cree, Crimean Tatar, Kashubian, Welsh, Danish, German, Dinka, Dimli (individual language), Dhivehi, Lower Sorbian, Dzongkha, Modern Greek (1453-), English, Esperanto, Estonian, Basque, Ewe, Extremaduran, Faroese, Persian, Fijian, Finnish, French, Arpitan, Northern Frisian, Western Frisian, Fulah, Friulian, Gagauz, Gan Chinese, Scottish Gaelic, Irish, Galician, Gilaki, Manx, Goan Konkani, Gothic, Guarani, Gujarati, Hakka Chinese, Haitian, Hausa, Hawaiian, Serbo-Croatian, Hebrew, Herero, Fiji Hindi, Hindi, Hiri Motu, Croatian, Upper Sorbian, Hungarian, Armenian, Igbo, Ido, Inuktitut, Interlingue, Iloko, Interlingua (International Auxiliary Language Association), Indonesian, Inupiaq, Icelandic, Italian, Jamaican Creole English, Javanese, Lojban, Japanese, Kara-Kalpak, Kabyle, Kalaallisut, Kannada, Kashmiri, Georgian, Kanuri, Kazakh, Kabardian, Kabiyè, Khmer, Kikuyu, Kinyarwanda, Kirghiz, Komi-Permyak, Komi, Kongo, Korean, Karachay-Balkar, Kölsch, Kurdish, Ladino, Lao, Latin, Latvian, Lak, Lezghian, Ligurian, Limburgan, Lingala, Lithuanian, Lombard, Northern Luri, Latgalian, Luxembourgish, Ganda, Literary Chinese, Marshallese, Maithili, Malayalam, Marathi, Moksha, Eastern Mari, Minangkabau, Macedonian, Malagasy, Maltese, Mongolian, Maori, Western Mari, Malay (macrolanguage), Creek, Mirandese, Burmese, Erzya, Mazanderani, Min Nan Chinese, Neapolitan, Nauru, Navajo, Ndonga, Low German, Nepali (macrolanguage), Newari, Dutch, Norwegian Nynorsk, Norwegian, Novial, Pedi, Nyanja, Occitan (post 1500), Livvi, Oriya (macrolanguage), Oromo, Ossetian, Pangasinan, Pampanga, Panjabi, Papiamento, Picard, Pennsylvania German, Pfaelzisch, Pitcairn-Norfolk, Pali, Piemontese, Western Panjabi, Pontic, Polish, Portuguese, Pushto, Quechua, Vlax Romani, Romansh, Romanian, Rusyn, Rundi, Macedo-Romanian, Russian, Sango, Yakut, Sanskrit, Sicilian, Scots, Samogitian, Sinhala, Slovak, Slovenian, Northern Sami, Samoan, Shona, Sindhi, Somali, Southern Sotho, Spanish, Albanian, Sardinian, Sranan Tongo, Serbian, Swati, Saterfriesisch, Sundanese, Swahili (macrolanguage), Swedish, Silesian, Tahitian, Tamil, Tatar, Tulu, Telugu, Tama (Colombia), Tetum, Tajik, Tagalog, Thai, Tigrinya, Tonga (Tonga Islands), Tok Pisin, Tswana, Tsonga, Turkmen, Tumbuka, Turkish, Twi, Tuvinian, Udmurt, Uighur, Ukrainian, Urdu, Uzbek, Venetian, Venda, Veps, Vietnamese, Vlaams, Volapük, Võro, Waray (Philippines), Walloon, Wolof, Wu Chinese, Kalmyk, Xhosa, Mingrelian, Yiddish, Yoruba, Yue Chinese, Zeeuws, Zhuang, Chinese, Zulu, and Dotyali
- Description:
- Wikipedia plain text data obtained from Wikipedia dumps with WikiExtractor in February 2018. The data come from all Wikipedias for which dumps could be downloaded at [https://dumps.wikimedia.org/]. This amounts to 297 Wikipedias, usually corresponding to individual languages and identified by their ISO codes. Several special Wikipedias are included, most notably "simple" (Simple English Wikipedia) and "incubator" (tiny hatching Wikipedias in various languages). For a list of all the Wikipedias, see [https://meta.wikimedia.org/wiki/List_of_Wikipedias]. The script which can be used to get new version of the data is included, but note that Wikipedia limits the download speed for downloading a lot of the dumps, so it takes a few days to download all of them (but one or a few can be downloaded fast). Also, the format of the dumps changes time to time, so the script will probably eventually stop working one day. The WikiExtractor tool [http://medialab.di.unipi.it/wiki/Wikipedia_Extractor] used to extract text from the Wikipedia dumps is not mine, I only modified it slightly to produce plaintext outputs [https://github.com/ptakopysk/wikiextractor].
- Rights:
- Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0), http://creativecommons.org/licenses/by-sa/3.0/, and PUB
45. Pohřeb Jeho Eminence Miloslava kardinála Vlka, emeritního arcibiskupa pražského :
- Type:
- text and tisky pamětní
- Subject:
- Liturgie. Křesťanské umění a symbolika. Duchovní život, Vlk, Miloslav,, arcibiskupové pražští, kardinálové, pohřby, Československo 1918-1992, and jednotlivci (církevní dějiny)
- Language:
- Czech, Italian, and Latin
- Rights:
- unknown
46. Poselství republiky dubrovnické k císařovně Kateřině II. v l. 1771-1775 :
- Creator:
- Jireček, Konstantin Josef,
- Type:
- text and monografie
- Subject:
- Dějiny států a území na Balkánském poloostrově, Kateřina, vztahy mezinárodní, Rusko, světové dějiny 1648-1789, zahraniční politika, mezinárodní vztahy, and Chorvatsko
- Language:
- Czech, Croatian, Italian, Latin, and Ukrainian
- Rights:
- unknown
47. Project Gutenberg
- Type:
- corpus
- Language:
- Danish, Dutch, English, Finnish, French, German, Italian, Latin, Portuguese, Russian, Spanish, Swedish, and Telugu
- Description:
- Possibility to download or to browse free electronic books; Angebot: Download von und Online-Zugang zu frei verfügbaren E-Books; deutschsprachige Literatur stellt nur einen Teilbereich der verfügbaren E-Books dar
- Rights:
- Not specified
48. Raccolta praghese di scritti di Luca Fieschi /
- Creator:
- Hledíková, Zdeňka,
- Type:
- text, korespondence, and edice
- Subject:
- Křesťanské církve, sekty, denominace, Biografie, Fieschi, Luca,, kardinálové italští, papežství, české země 1306-1419, and jednotlivci (církevní dějiny)
- Language:
- Italian and Latin
- Rights:
- unknown
49. Roma - Praga, Praha - Řím :
- Type:
- text and sborníky jubilejní
- Subject:
- Dějiny Česka a Slovenska, Hledíková, Zdeňka,, historici čeští, jubilea životní, and české (československé) sborníky a kolektivní monografie
- Language:
- Italian, English, German, French, Latin, and Czech
- Description:
- 400 výt.
- Rights:
- unknown
50. Rukopisy palácové knihovny hrabat Czerninů z Chudenic v Praze na Hradčanech dochované ve fondu pražské lobkowiczké knihovny v Národní knihovně České republiky.
- Creator:
- Svobodová, Milada,
- Type:
- text and katalogy
- Subject:
- Rukopisy, prvotisky, staré tisky. Vzácná a pozoruhodná díla, Czerninové z Chudenic (rod), Lobkowiczové (rod), rukopisy, knihovny soukromé, and knihovny šlechtické
- Language:
- Czech, English, French, German, Italian, and Latin
- Description:
- Název z titulní obrazovky
- Rights:
- unknown