Language: Hebrew - LINDAT/CLARIAH-CZ Catalog Search Results

Start Over Language Hebrew

1. "A vypravuj synu svému--" :

Type:: text
Subject:: Judaismus, katalogy výstav, muzea židovská, Židé, hagady, sbírky muzejní, přehledná zpracování světových dějin (chronologicky), židovská věda, kultura a školství, and česká a československá muzea, galerie, expozice
Language:: Czech, English, and Hebrew
Description:: 1000 výt.
Rights:: unknown

2. "Yayin nesech" (Gentile's wine) in Mahara's Teachings, Rabbi Yehuda Loeb son of Bezalel of Prague Reading his Sermon: Sermon on the Precepts /

Creator:: Dushinsky, Michael
Type:: studie
Subject:: Judaismus, Jehuda Leva ben Becalel,, judaismus, české země 1526-1620, and židovské náboženství, filozofie
Language:: Hebrew
Description:: Text v hebrejštině
Rights:: unknown

3. Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.0)

Creator:: Savary, Agata, Ramisch, Carlos, Cordeiro, Silvio Ricardo, Sangati, Federico, Vincze, Veronika, QasemiZadeh, Behrang, Candito, Marie, Cap, Fabienne, Giouli, Voula, Stoyanova, Ivelina, Doucet, Antoine, Adalı, Kübra, Barbu Mititelu, Verginica, Bejček, Eduard, El Maarouf, Ismail, Eryiğit, Gülşen, Galea, Luke, Ha-Cohen Kerner, Yaakov, Liebeskind, Chaya, Monti, Johanna, Parra Escartín, Carla, Kovalevskaitė, Jolanta, Krek, Simon, van der Plas, Lonneke, Aceta, Cristina, Aduriz, Itziar, Antoine, Jean-Yves, Attard, Greta, Azzopardi, Kirsty, Boizou, Loic, Bonnici, Janice, Boz, Mert, Bumbulienė, Ieva, Busuttil, Jael, Caruso, Valeria, Cherchi, Manuela, Constant, Matthieu, Czerepowicka, Monika, De Santis, Anna, Dimitrova, Tsvetana, Dinç, Tutkum, Elyovich, Hevi, Fabri, Ray, Farrugia, Alison, Findlay, Jamie, Fotopoulou, Aggeliki, Foufi, Vassiliki, Galea, Sara Anne, Gantar, Polona, Gatt, Albert, Gatt, Anabelle, Herrero, Carlos, Iñurrieta, Uxoa, Jagfeld, Glorianna, Hnátková, Milena, Ionescu, Mihaela, Klyueva, Natalia, Koeva, Svetla, Kovács, Viktória, Kuzman, Taja, Leseva, Svetlozara, Louisou, Sevi, Lynn, Teresa, Malka, Ruth, Martínez Alonso, Héctor, McCrae, John, de Medeiros Caseli, Helena, Miral, Ayşenur, Muscat, Amanda, Nivre, Joakim, Oakes, Michael, Onofrei, Mihaela, Parmentier, Yannick, Pasquer, Caroline, Pia di Buono, Maria, Priego Sanchez, Belem, Raffone, Annalisa, Ramisch, Renata, Rimkutė, Erika, Rizea, Monica-Mihaela, Simkó, Katalin, Spagnol, Michael, Stefanova, Valentina, Stymne, Sara, Sulubacak, Umut, Tabone, Nicole, Tanti, Marc, Todorova, Maria, Urešová, Zdenka, Villavicencio, Aline, and Zilio, Leonardo
Publisher:: PARSEME
Type:: text and corpus
Subject:: Multiword expressions, verbal multiword expressions, idioms, light-verb constructions, verb-particle constructions, and inherently reflexive verbs
Language:: Bulgarian, Czech, German, Modern Greek (1453-), Spanish, Persian, French, Hebrew, Hungarian, Italian, Lithuanian, Maltese, Polish, Portuguese, Romanian, Slovenian, Swedish, and Turkish
Description:: The PARSEME shared task aims at identifying verbal MWEs in running texts. Verbal MWEs include idioms (let the cat out of the bag), light verb constructions (make a decision), verb-particle constructions (give up), and inherently reflexive verbs (se suicider 'to suicide' in French). VMWEs were annotated according to the universal guidelines in 18 languages. The corpora are provided in the parsemetsv format, inspired by the CONLL-U format. For most languages, paired files in the CONLL-U format - not necessarily using UD tagsets - containing parts of speech, lemmas, morphological features and/or syntactic dependencies are also provided. Depending on the language, the information comes from treebanks (e.g., Universal Dependencies) or from automatic parsers trained on treebanks (e.g., UDPipe). This item contains training and test data, tools and the universal guidelines file.
Rights:: PARSEME Shared Task Data (v. 1.0) Agreement, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-mwe-1.0, and PUB

4. Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.1)

Creator:: Ramisch, Carlos, Cordeiro, Silvio Ricardo, Savary, Agata, Vincze, Veronika, Barbu Mititelu, Verginica, Bhatia, Archna, Buljan, Maja, Candito, Marie, Gantar, Polona, Giouli, Voula, Güngör, Tunga, Hawwari, Abdelati, Iñurrieta, Uxoa, Kovalevskaitė, Jolanta, Krek, Simon, Lichte, Timm, Liebeskind, Chaya, Monti, Johanna, Parra Escartín, Carla, QasemiZadeh, Behrang, Ramisch, Renata, Schneider, Nathan, Stoyanova, Ivelina, Vaidya, Ashwini, Walsh, Abigail, Aceta, Cristina, Aduriz, Itziar, Antoine, Jean-Yves, Arhar Holdt, Špela, Berk, Gözde, Bielinskienė, Agnė, Blagus, Goranka, Boizou, Loic, Bonial, Claire, Caruso, Valeria, Čibej, Jaka, Constant, Matthieu, Cook, Paul, Diab, Mona, Dimitrova, Tsvetana, Ehren, Rafael, Elbadrashiny, Mohamed, Elyovich, Hevi, Erden, Berna, Estarrona, Ainara, Fotopoulou, Aggeliki, Foufi, Vassiliki, Geeraert, Kristina, van Gompel, Maarten, Gonzalez, Itziar, Gurrutxaga, Antton, Ha-Cohen Kerner, Yaakov, Ibrahim, Rehab, Ionescu, Mihaela, Jain, Kanishka, Jazbec, Ivo-Pavao, Kavčič, Teja, Klyueva, Natalia, Kocijan, Kristina, Kovács, Viktória, Kuzman, Taja, Leseva, Svetlozara, Ljubešić, Nikola, Malka, Ruth, Markantonatou, Stella, Martínez Alonso, Héctor, Matas, Ivana, McCrae, John, de Medeiros Caseli, Helena, Onofrei, Mihaela, Palka-Binkiewicz, Emilia, Papadelli, Stella, Parmentier, Yannick, Pascucci, Antonio, Pasquer, Caroline, Pia di Buono, Maria, Puri, Vandana, Raffone, Annalisa, Ratori, Shraddha, Riccio, Anna, Sangati, Federico, Shukla, Vishakha, Simkó, Katalin, Šnajder, Jan, Somers, Clarissa, Srivastava, Shubham, Stefanova, Valentina, Taslimipoor, Shiva, Theoxari, Natasa, Todorova, Maria, Urizar, Ruben, Villavicencio, Aline, and Zilio, Leonardo
Publisher:: PARSEME
Type:: text and corpus
Subject:: Multiword expressions, verbal multiword expressions, light-verb constructions, verb-particle constructions, inherently reflexive verbs, verbal idioms, and multi-verb constructions
Language:: Bulgarian, German, Modern Greek (1453-), Spanish, Persian, French, Hebrew, Hungarian, Italian, Lithuanian, Polish, Portuguese, Romanian, Slovenian, Turkish, Hindi, Basque, English, and Croatian
Description:: This multilingual resource contains corpora in which verbal MWEs have been manually annotated. VMWEs include idioms (let the cat out of the bag), light-verb constructions (make a decision), verb-particle constructions (give up), inherently reflexive verbs (help oneself), and multi-verb constructions (make do). VMWEs were annotated according to the universal guidelines in 19 languages. The corpora are provided in the cupt format, inspired by the CONLL-U format. The corpora were used in the 1.1 edition of the PARSEME Shared Task (2018). For most languages, morphological and syntactic information – not necessarily using UD tagsets – including parts of speech, lemmas, morphological features and/or syntactic dependencies are also provided. Depending on the language, the information comes from treebanks (e.g., Universal Dependencies) or from automatic parsers trained on treebanks (e.g., UDPipe). This item contains training, development and test data, as well as the evaluation tools used in the PARSEME Shared Task 1.1 (2018). The annotation guidelines are available online: http://parsemefr.lif.univ-mrs.fr/parseme-st-guidelines/1.1
Rights:: PARSEME Shared Task Data (v. 1.1) Agreement, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-mwe-1.1, and PUB

5. Annotated corpora and tools of the PARSEME Shared Task on Semi-Supervised Identification of Verbal Multiword Expressions (edition 1.2)

Creator:: Ramisch, Carlos, Guillaume, Bruno, Savary, Agata, Waszczuk, Jakub, Candito, Marie, Vaidya, Ashwini, Barbu Mititelu, Verginica, Bhatia, Archna, Iñurrieta, Uxoa, Giouli, Voula, Güngör, Tunga, Jiang, Menghan, Lichte, Timm, Liebeskind, Chaya, Monti, Johanna, Ramisch, Renata, Stymme, Sara, Walsh, Abigail, Xu, Hongzhi, Palka-Binkiewicz, Emilia, Ehren, Rafael, Stymne, Sara, Constant, Matthieu, Pasquer, Caroline, Parmentier, Yannick, Antoine, Jean-Yves, Carlino, Carola, Caruso, Valeria, Di Buono, Maria Pia, Pascucci, Antonio, Raffone, Annalisa, Riccio, Anna, Sangati, Federico, Speranza, Giulia, Cordeiro, Silvio Ricardo, de Medeiros Caseli, Helena, Miranda, Isaac, Rademaker, Alexandre, Vale, Oto, Villavicencio, Aline, Wick Pedro, Gabriela, Wilkens, Rodrigo, Zilio, Leonardo, Rizea, Monica-Mihaela, Ionescu, Mihaela, Onofrei, Mihaela, Chen, Jia, Ge, Xiaomin, Hu, Fangyuan, Hu, Sha, Li, Minli, Liu, Siyuan, Qin, Zhenzhen, Sun, Ruilong, Wang, Chenweng, Xiao, Huangyang, Yan, Peiyi, Yih, Tsy, Yu, Ke, Yu, Songping, Zeng, Si, Zhang, Yongchen, Zhao, Yun, Foufi, Vassiliki, Fotopoulou, Aggeliki, Markantonatou, Stella, Papadelli, Stella, Louizou, Sevasti, Aduriz, Itziar, Estarrona, Ainara, Gonzalez, Itziar, Gurrutxaga, Antton, Uria, Larraitz, Urizar, Ruben, Foster, Jennifer, Lynn, Teresa, Elyovitch, Hevi, Ha-Cohen Kerner, Yaakov, Malka, Ruth, Jain, Kanishka, Puri, Vandana, Ratori, Shraddha, Shukla, Vishakha, Srivastava, Shubham, Berk, Gozde, Erden, Berna, and Yirmibeşoğlu, Zeynep
Publisher:: PARSEME
Type:: text and corpus
Subject:: multiword expressions, verbal multiword expressions, light verb construction, verb-particle constructions, inherently reflexive verbs, verbal idioms, and multi-verb constructions
Language:: German, Modern Greek (1453-), Basque, French, Irish, Hebrew, Hindi, Italian, Polish, Portuguese, Romanian, Swedish, Turkish, and Chinese
Description:: This multilingual resource contains corpora in which verbal MWEs have been manually annotated, gathered at the occasion of the 1.2 edition of the PARSEME Shared Task on semi-supervised Identification of Verbal MWEs (2020). VMWEs include idioms (let the cat out of the bag), light-verb constructions (make a decision), verb-particle constructions (give up), inherently reflexive verbs (help oneself), and multi-verb constructions (make do). For the 1.2 shared task edition, the data covers 14 languages, for which VMWEs were annotated according to the universal guidelines. The corpora are provided in the cupt format, inspired by the CONLL-U format. Morphological and syntactic information – not necessarily using UD tagsets – including parts of speech, lemmas, morphological features and/or syntactic dependencies are also provided. Depending on the language, the information comes from treebanks (e.g., Universal Dependencies) or from automatic parsers trained on treebanks (e.g., UDPipe). This item contains training, development and test data, as well as the evaluation tools used in the PARSEME Shared Task 1.2 (2020). The annotation guidelines are available online: http://parsemefr.lif.univ-mrs.fr/parseme-st-guidelines/1.2
Rights:: PARSEME Shared Task Data (v. 1.2) Agreement, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-mwe-1.2, and PUB

6. Biblí svatá, aneb, Všecka Svatá písma Starého i Nového zákona: podle posledního vydání kralického z roku 1613

Publisher:: nákladem Biblické společnosti britické a zahraniční
Format:: print, text, regular print, bez média, svazek, and 831, 270 stran, 8 nečíslovaných stran obrazových příloh : mapy ; 17 cm
Type:: model:monograph and TEXT
Subject:: Bible. Biblistika, biblické texty, 27-232/-236, (0:82-9), 5, and 2-23/-27
Language:: Czech, Official Aramaic (700-300 BCE), Ancient Greek (to 1453), and Hebrew
Description:: Přeloženo z aramejštiny, hebrejštiny a řečtiny, Část Nový zákon má vlastní titulní stránku včetně stejných nakladatelských údajů, and Obsahuje bibliografické odkazy
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

16. Čítanka ranně novověké aškánázské paleografie. 1 Obrazová část. 2 Transkripce, překlady a komentáře /

Creator:: Sixtová, Olga,
Type:: text and monografie
Subject:: Historická věda. Pomocné vědy historické. Archivnictví, paleografie, písmo hebrejské, jazyk hebrejský, čítanky, novověk raný, rukopisy hebrejské, české země 1306-1526, české země 1526-1792, and židovské náboženství, filozofie
Language:: Czech and Hebrew
Rights:: unknown

17. CoNLL 2017 and 2018 Shared Task Blind and Preprocessed Test Data

Creator:: Zeman, Daniel and Straka, Milan
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: tokenization, word segmentation, morphology, tagging, syntax, parsing, and universal dependencies
Language:: Afrikaans, Arabic, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Persian, Finnish, French, Old French (842-ca. 1400), Irish, Galician, Gothic, Ancient Greek (to 1453), Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Latin, Latvian, Dutch, Norwegian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Thai, Turkish, Uighur, Ukrainian, Urdu, Vietnamese, and Chinese
Description:: CoNLL 2017 and 2018 shared tasks: Multilingual Parsing from Raw Text to Universal Dependencies This package contains the test data in the form in which they ware presented to the participating systems: raw text files and files preprocessed by UDPipe. The metadata.json files contain lists of files to process and to output; README files in the respective folders describe the syntax of metadata.json. For full training, development and gold standard test data, see Universal Dependencies 2.0 (CoNLL 2017) Universal Dependencies 2.2 (CoNLL 2018) See the download links at http://universaldependencies.org/. For more information on the shared tasks, see http://universaldependencies.org/conll17/ http://universaldependencies.org/conll18/ Contents: conll17-ud-test-2017-05-09 ... CoNLL 2017 test data conll18-ud-test-2018-05-06 ... CoNLL 2018 test data conll18-ud-test-2018-05-06-for-conll17 ... CoNLL 2018 test data with metadata and filenames modified so that it is digestible by the 2017 systems.
Rights:: Licence Universal Dependencies v2.2, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.2, and PUB

18. CoNLL 2017 Shared Task System Outputs

Creator:: Zeman, Daniel, Potthast, Martin, Straka, Milan, Popel, Martin, Dozat, Timothy, Qi, Peng, Manning, Christopher, Shi, Tianze, Wu, Felix G., Chen, Xilun, Cheng, Yao, Björkelund, Anders, Falenska, Agnieszka, Yu, Xiang, Kuhn, Jonas, Che, Wanxiang, Guo, Jiang, Wang, Yuxuan, Zheng, Bo, Zhao, Huaipeng, Liu, Yang, Teng, Dechuan, Liu, Ting, Lim, Kyungtae, Poibeau, Thierry, Sato, Motoki, Manabe, Hitoshi, Noji, Hiroshi, Matsumoto, Yuji, Kırnap, Ömer, Önder, Berkay Furkan, Yuret, Deniz, Straková, Jana, Vania, Clara, Zhang, Xingxing, Lopez, Adam, Heinecke, Johannes, Asadullah, Munshi, Kanerva, Jenna, Luotolahti, Juhani, Ginter, Filip, Kuan, Yu, Sofroniev, Pavel, Schill, Erik, Hinrichs, Erhard, Nguyen, Dat Quoc, Dras, Mark, Johnson, Mark, Qian, Xian, Vilares, David, Gómez-Rodríguez, Carlos, Aufrant, Lauriane, Wisniewski, Guillaume, Yvon, François, Dumitrescu, Stefan Daniel, Boroş, Tiberiu, Tufiş, Dan, Das, Ayan, Zaffar, Affan, Sarkar, Sudeshna, Wang, Hao, Zhao, Hai, Zhang, Zhisong, Hornby, Ryan, Taylor, Clark, Park, Jungyeul, de Lhoneux, Miryam, Shao, Yan, Basirat, Ali, Kiperwasser, Eliyahu, Stymne, Sara, Goldberg, Yoav, Nivre, Joakim, Akkuş, Burak Kerim, Azizoglu, Heval, Cakici, Ruket, Moor, Christophe, Merlo, Paola, Henderson, James, Wang, Haozhou, Ji, Tao, Wu, Yuanbin, Lan, Man, de la Clergerie, Eric, Sagot, Benoît, Seddah, Djamé, More, Amir, Tsarfaty, Reut, Kanayama, Hiroshi, Muraoka, Masayasu, Yoshikawa, Katsumasa, Garcia, Marcos, and Gamallo, Pablo
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: dependency parser and parsebank
Language:: Arabic, Bulgarian, Russia Buriat, Czech, Catalan, Church Slavic, Danish, German, Modern Greek (1453-), English, Spanish, Estonian, Basque, Persian, Finnish, French, Irish, Galician, Gothic, Ancient Greek (to 1453), Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Latin, Latvian, Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Northern Sami, Swedish, Turkish, Uighur, Ukrainian, Urdu, Vietnamese, and Chinese
Description:: This package contains the system outputs from the CoNLL 2017 Shared Task in Multilingual Parsing from Raw Text to Universal Dependencies.
Rights:: Licence Universal Dependencies v2.0, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.0, and PUB

19. CoNLL 2018 Shared Task System Outputs

Creator:: Zeman, Daniel, Potthast, Martin, Duthoo, Elie, Mesnard, Olivier, Rybak, Piotr, Wróblewska, Alina, Che, Wanxiang, Liu, Yijia, Wang, Yuxuan, Zheng, Bo, Liu, Ting, Li, Zuchao, He, Shexia, Zhang, Zhuosheng, Zhao, Hai, Wu, Yingting, Tong, Jia-Jun, Nguyen, Dat Quoc, Verspoor, Karin, Wan, Hui, Naseem, Tahira, Lee, Young-Suk, Castelli, Vittorio, Ballesteros, Miguel, Hershcovich, Daniel, Abend, Omri, Rappoport, Ari, Smith, Aaron, Bohnet, Bernd, de Lhoneux, Miryam, Nivre, Joakim, Shao, Yan, Stymne, Sara, Kırnap, Ömer, Dayanık, Erenay, Yuret, Deniz, Kanerva, Jenna, Ginter, Filip, Miekka, Niko, Leino, Akseli, Salakoski, Tapio, Lim, KyungTae, Park, Cheoneum, Lee, Changki, Poibeau, Thierry, Bhat, Riyaz Ahmad, Bhat, Irshad, Bangalore, Srinivas, Qi, Peng, Dozat, Timothy, Zhang, Yuhao, Manning, Christopher, Boroș, Tiberiu, Dumitrescu, Stefan Daniel, Burtica, Ruxandra, Arakelyan, Gor, Hambardzumyan, Karen, Khachatrian, Hrant, Rosa, Rudolf, Mareček, David, Straka, Milan, Seker, Amit, More, Amir, Tsarfaty, Reut, Önder, Berkay Furkan, Gümeli, Can, Jawahar, Ganesh, Muller, Benjamin, Fethi, Amal, Martin, Louis, Villemonte de la Clergerie, Eric, Sagot, Benoît, Seddah, Djamé, Özateş, Şaziye Betül, Özgür, Arzucan, Gungor, Tunga, Öztürk, Balkız, Ji, Tao, Liu, Yufang, Wang, Yijun, Wu, Yuanbin, Lan, Man, Chen, Danlu, Lin, Mengxiao, Hu, Zhifeng, and Qiu, Xipeng
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: parsed data, conllu, and universal dependencies
Language:: Afrikaans, Arabic, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Persian, Finnish, French, Old French (842-ca. 1400), Irish, Galician, Gothic, Ancient Greek (to 1453), Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Latin, Latvian, Dutch, Norwegian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Thai, Turkish, Uighur, Ukrainian, Urdu, Vietnamese, and Chinese
Description:: Test data parsed by systems submitted to the CoNLL 2018 UD parsing shared task.
Rights:: Licence Universal Dependencies v2.2, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.2, and PUB

20. Da'at tvunot :

Creator:: Luzato, Moše Chajim,
Type:: text, monografie, prameny, and edice
Subject:: Judaismus, Luzato, Moše Chajim,, rabíni, filozofie židovská, Itálie, světové dějiny 1648-1789, and židovské náboženství, filozofie
Language:: Czech and Hebrew
Description:: Částečně přeloženo z angličtiny a hebrejštiny
Rights:: unknown

21. DaMuEL 1.0: A Large Multilingual Dataset for Entity Linking

Creator:: Kubeša, David and Straka, Milan
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: entity linking, NEL, NER, dataset, and knowledge base
Language:: Afrikaans, Arabic, Armenian, Basque, Belarusian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Hebrew, Hindi, Hungarian, Indonesian, Irish, Italian, Japanese, Korean, Latin, Latvian, Lithuanian, Maltese, Marathi, Modern Greek (1453-), Northern Sami, Norwegian Nynorsk, Persian, Polish, Portuguese, Romanian, Russian, Scottish Gaelic, Serbian, Slovak, Slovenian, Spanish, Swedish, Tamil, Telugu, Uighur, Ukrainian, Urdu, Vietnamese, and Wolof
Description:: We present DaMuEL, a large Multilingual Dataset for Entity Linking containing data in 53 languages. DaMuEL consists of two components: a knowledge base that contains language-agnostic information about entities, including their claims from Wikidata and named entity types (PER, ORG, LOC, EVENT, BRAND, WORK_OF_ART, MANUFACTURED); and Wikipedia texts with entity mentions linked to the knowledge base, along with language-specific text from Wikidata such as labels, aliases, and descriptions, stored separately for each language. The Wikidata QID is used as a persistent, language-agnostic identifier, enabling the combination of the knowledge base with language-specific texts and information for each entity. Wikipedia documents deliberately annotate only a single mention for every entity present; we further automatically detect all mentions of named entities linked from each document. The dataset contains 27.9M named entities in the knowledge base and 12.3G tokens from Wikipedia texts. The dataset is published under the CC BY-SA licence.
Rights:: Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB

22. Deep Universal Dependencies 2.4

Creator:: Zeman, Daniel and Droganova, Kira
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: semantic dependency and universal dependencies
Language:: Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, and Galician
Description:: Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-2988). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
Rights:: Licence Universal Dependencies v2.4, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.4, and PUB

23. Deep Universal Dependencies 2.5

Creator:: Zeman, Daniel and Droganova, Kira
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: semantic dependency and universal dependencies
Language:: Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, and Skolt Sami
Description:: Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3105). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
Rights:: Licence Universal Dependencies v2.5, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.5, and PUB

24. Deep Universal Dependencies 2.6

Creator:: Zeman, Daniel and Droganova, Kira
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: semantic dependency and universal dependencies
Language:: Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, Skolt Sami, Icelandic, Albanian, and Persian
Description:: Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3226). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
Rights:: Licence Universal Dependencies v2.6, https://lindat.mff.cuni.cz/repository/xmlui/page/license-ud-2.6, and PUB

25. Deep Universal Dependencies 2.7

Creator:: Zeman, Daniel and Droganova, Kira
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: semantic dependency and universal dependencies
Language:: Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, Skolt Sami, Icelandic, Albanian, Persian, Akuntsu, Apurinã, Khunsari, Manx, Mundurukú, Nayini, Soi, South Levantine Arabic, and Tupinambá
Description:: Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3424). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
Rights:: Licence Universal Dependencies v2.7, https://lindat.mff.cuni.cz/repository/xmlui/page/license-ud-2.7, and PUB

26. Deep Universal Dependencies 2.8

Creator:: Zeman, Daniel and Droganova, Kira
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: semantic dependency and universal dependencies
Language:: Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, Skolt Sami, Icelandic, Albanian, Persian, Akuntsu, Apurinã, Khunsari, Manx, Mundurukú, Nayini, Soi, South Levantine Arabic, Tupinambá, Beja, Western Frisian, Urubú-Kaapor, Kangri, K'iche', Low German, Makuráp, Western Armenian, and Central Siberian Yupik
Description:: Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3687). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
Rights:: Licence Universal Dependencies v2.8, https://lindat.mff.cuni.cz/repository/xmlui/page/license-ud-2.8, and PUB

27. Deltacorpus

Creator:: Mareček, David, Yu, Zhiwei, Zeman, Daniel, and Žabokrtský, Zdeněk
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: part of speech, tagging, semi-supervised, and cross-language
Language:: Belarusian, Bosnian, Bulgarian, Czech, Serbo-Croatian, Croatian, Upper Sorbian, Macedonian, Polish, Russian, Slovak, Slovenian, Serbian, Ukrainian, Latvian, Lithuanian, Afrikaans, Danish, German, English, Faroese, Western Frisian, Swiss German, Icelandic, Limburgan, Luxembourgish, Low German, Dutch, Norwegian Nynorsk, Norwegian, Scots, Swedish, Yiddish, Aragonese, Asturian, Catalan, French, Galician, Haitian, Italian, Latin, Lombard, Neapolitan, Piemontese, Portuguese, Romanian, Spanish, Venetian, Walloon, Breton, Welsh, Scottish Gaelic, Irish, Modern Greek (1453-), Armenian, Albanian, Dimli (individual language), Persian, Gilaki, Kurdish, Tajik, Bengali, Bishnupriya, Gujarati, Fiji Hindi, Hindi, Marathi, Nepali (macrolanguage), Urdu, Amharic, Arabic, Egyptian Arabic, Hebrew, Estonian, Finnish, Hungarian, Basque, Georgian, Chuvash, Azerbaijani, Turkish, Uzbek, Kazakh, Tatar, Yakut, Korean, Mongolian, Telugu, Kannada, Malayalam, Tamil, Newari, Vietnamese, Indonesian, Javanese, Malagasy, Maori, Malay (macrolanguage), Pampanga, Sundanese, Tagalog, Waray (Philippines), Swahili (macrolanguage), Esperanto, Ido, Interlingua (International Auxiliary Language Association), and Volapük
Description:: Texts in 107 languages from the W2C corpus (http://hdl.handle.net/11858/00-097C-0000-0022-6133-9), first 1,000,000 tokens per language, tagged by the delexicalized tagger described in Yu et al. (2016, LREC, Portorož, Slovenia).
Rights:: Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB

28. Deltacorpus 1.1

Creator:: Mareček, David, Yu, Zhiwei, Zeman, Daniel, and Žabokrtský, Zdeněk
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: part of speech, tagging, semi-supervised, and cross-language
Language:: Belarusian, Bosnian, Bulgarian, Czech, Serbo-Croatian, Croatian, Upper Sorbian, Macedonian, Polish, Russian, Slovak, Slovenian, Serbian, Ukrainian, Latvian, Lithuanian, Afrikaans, Danish, German, English, Faroese, Western Frisian, Swiss German, Icelandic, Limburgan, Luxembourgish, Low German, Dutch, Norwegian Nynorsk, Norwegian, Scots, Swedish, Yiddish, Aragonese, Asturian, Catalan, French, Galician, Haitian, Italian, Latin, Lombard, Neapolitan, Piemontese, Portuguese, Romanian, Spanish, Venetian, Walloon, Breton, Welsh, Scottish Gaelic, Irish, Modern Greek (1453-), Armenian, Albanian, Dimli (individual language), Persian, Gilaki, Kurdish, Tajik, Bengali, Bishnupriya, Gujarati, Fiji Hindi, Hindi, Marathi, Nepali (macrolanguage), Urdu, Amharic, Arabic, Egyptian Arabic, Hebrew, Estonian, Finnish, Hungarian, Basque, Georgian, Chuvash, Azerbaijani, Turkish, Uzbek, Kazakh, Tatar, Yakut, Korean, Mongolian, Telugu, Kannada, Malayalam, Tamil, Newari, Vietnamese, Indonesian, Javanese, Malagasy, Maori, Malay (macrolanguage), Pampanga, Sundanese, Tagalog, Waray (Philippines), Swahili (macrolanguage), Esperanto, Ido, Interlingua (International Auxiliary Language Association), and Volapük
Description:: Texts in 107 languages from the W2C corpus (http://hdl.handle.net/11858/00-097C-0000-0022-6133-9), first 1,000,000 tokens per language, tagged by the delexicalized tagger described in Yu et al. (2016, LREC, Portorož, Slovenia). Changes in version 1.1: 1. Universal Dependencies tagset instead of the older and smaller Google Universal POS tagset. 2. SVM classifier trained on Universal Dependencies 1.2 instead of HamleDT 2.0. 3. Balto-Slavic languages, Germanic languages and Romance languages were tagged by classifier trained only on the respective group of languages. Other languages were tagged by a classifier trained on all available languages. The "c7" combination from version 1.0 is no longer used.
Rights:: Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB

29. Erich Kulka 1911-1995 :

Publisher:: Židovské muzeum v Praze : and Hebrejská univerzita v Jeruzalémě,
Type:: biografie
Subject:: Biografie, Kulka, Erich,, Židé, holocaust, spisovatelé, tábory koncentrační, Československo 1938-1945, and antisemitismus, perzekuce, pogromy
Language:: Czech, Hebrew, and English
Rights:: unknown

30. Fase pražských židovských rodin z let 1748-1749 (1751) :

Publisher:: Židovské muzeum v Praze,
Type:: prameny and edice
Subject:: Dějiny Česka a Slovenska, Židé, prameny písemné, soupisy, české země 1620-1740, and dějiny židů
Language:: Czech, German, and Hebrew
Rights:: unknown

31. Gizela Lipovská - posledná Židovka z Uličskej doliny :

Creator:: Marcineková, Jaroslava
Publisher:: Elinor,
Type:: vzpomínky and autobiografie
Subject:: Vnitropolitický vývoj, politický život, Biografie, Lipovská, Gizela,, Židé, holocaust, válka druhá světová (1939-1945), Slovensko 1939-1945, and antisemitismus, perzekuce, pogromy
Language:: Slovak and Hebrew
Rights:: unknown

32. Ha-Ḥaḳiḳah neged ha-Yehudim ṿe-nishulam min ha-kalkalah bi-Medinat Slovaḳyah :

Creator:: Steiner, Jan
Type:: text and monografie
Subject:: Dějiny zemí střední Evropy, Židé, perzekuce, zákonodárství, nacismus, Slovensko 1939-1945, and antisemitismus, perzekuce, pogromy
Language:: Hebrew
Description:: Title on leaf following series t.p.: Anti-Jewish legislation and elimination of the Jews from the economic life of the Slovakian state.
Rights:: unknown

33. Ha-Ro'e - Židovský snář :

Creator:: Holubová, Markéta,
Type:: text, monografie, prameny, and edice
Subject:: Judaismus, judaismus, literatura židovská, překlady, sny, přehledná zpracování světových dějin (chronologicky), and židovské náboženství, filozofie
Language:: Czech, Official Aramaic (700-300 BCE), English, and Hebrew
Description:: Část. přeloženo z aramejštiny a hebrejštiny
Rights:: unknown

34. HamleDT 3.0

Creator:: Zeman, Daniel, Mareček, David, Mašek, Jan, Popel, Martin, Ramasamy, Loganathan, Rosa, Rudolf, Štěpánek, Jan, and Žabokrtský, Zdeněk
Publisher:: Charles University
Type:: text and corpus
Subject:: annotated corpus, morphology, syntax, dependency, treebank, harmonized annotation, and common annotation style
Language:: Arabic, Basque, Bengali, Bulgarian, Catalan, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Modern Greek (1453-), Ancient Greek (to 1453), Hebrew, Hindi, Hungarian, Indonesian, Irish, Italian, Japanese, Latin, Persian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Spanish, Swedish, Tamil, Telugu, and Turkish
Description:: HamleDT (HArmonized Multi-LanguagE Dependency Treebank) is a compilation of existing dependency treebanks (or dependency conversions of other treebanks), transformed so that they all conform to the same annotation style. This version uses Universal Dependencies as the common annotation style. Update (November 1017): for a current collection of harmonized dependency treebanks, we recommend using the Universal Dependencies (UD). All of the corpora that are distributed in HamleDT in full are also part of the UD project; only some corpora from the Patch group (where HamleDT provides only the harmonizing scripts but not the full corpus data) are available in HamleDT but not in UD.
Rights:: HamleDT 3.0 License Terms, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-hamledt-3.0, and PUB

35. Hebräisch-orientalischer Melodienschatz

Creator:: Idelsohn, Abraham Zwi
Publisher:: Friedrich Hofmeister
Format:: hudebnina and xxvii, 211 stran ; 33 cm
Type:: notated music, sheetmusic, model:sheetmusic, and TEXT
Subject:: jidiš písně, židovská hudba, židovská kultura, židovské lidové písně, vokální partitury, komentovaná vydání, vokální hudba, text, zápis hudby, etnická hudba, and hudební edice
Language:: Hebrew
Description:: Band IX, Der Volksgesang der osteuropäischen Juden, gesammelt, geordnet und erläutert von A.Z. Idelsohn, Úvod německy, Textové podložení německy a hebrejsky ve fonetickém přepisu, and Kritické edice skladeb, komentovaná vydání.
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

36. Hebräisch-orientalischer Melodienschatz

Creator:: Idelsohn, Abraham Zwi
Publisher:: Benjamin Harz Verlag
Format:: print and VIII, 51, 68 stran ; 34 cm
Type:: notated music, sheetmusic, model:sheetmusic, and TEXT
Subject:: chrámová hudba, liturgické zpěvy, židovská hudba, židovská kultura, komentovaná vydání, vokální partitury, vokální hudba, etnická hudba, starověká hudba, and hudební edice
Language:: Hebrew
Description:: Band III, Gesänge der persischen, bucharischen und daghestanischen Juden, zum ersten Male gesammelt, erläutert und herausgegeben von A.Z. Idelsohn, Komentář německy, Textové podložení hebrejsky ve fonetickém přepisu, and Kritické edice skladeb, komentovaná vydání
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

37. Hebräisch-orientalischer Melodienschatz

Creator:: Idelsohn, Abraham Zwi
Publisher:: Benjamin Harz Verlag
Format:: hudebnina and xv, 280 stran ; 34 cm
Type:: notated music, sheetmusic, model:sheetmusic, and TEXT
Subject:: chrámová hudba, liturgické zpěvy, židovská hudba, židovská kultura, vokální partitury, komentovaná vydání, vokální hudba, zápis hudby, etnická hudba, and hudební edice
Language:: Hebrew and Spanish
Description:: Band IV, Gesänge der orientalischen Sefardim, zum ersten Male gesammelt, erläutert und herausgegeben von A.Z. Idelsohn, Komentář německy, Textové podložení hebrejsky a španělsky ve fonetickém přepisu, Kritické edice skladeb, komentovaná vydání., and Obsahuje: I. Prayers; II. Religious songs; III. Spanisch songs;
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

38. Hebräisch-orientalischer Melodienschatz

Creator:: Idelsohn, Abraham Zwi
Publisher:: Harz
Format:: hudebnina and 119 stran ; 33 cm
Type:: notated music, sheetmusic, model:sheetmusic, and TEXT
Subject:: chrámová hudba, liturgické zpěvy, židovská hudba, židovská kultura, vokální partitury, komentovaná vydání, vokální hudba, zápis hudby, and hudební edice
Language:: Hebrew
Description:: Band V, Gesänge der marokkanischen Juden, zum ersten Male gesammelt, erläutert und herausgegeben von A.Z. Idelsohn, Textové podloženi hebrejsky ve fonetickém přepisu, and Kritické edice skladeb, komentovaná vydání.
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

39. Hebräisch-orientalischer Melodienschatz

Creator:: Idelsohn, Abraham Zwi
Publisher:: Friedrich Hofmeister
Format:: hudebnina and xxiv, 72 stran ; 34 cm
Type:: notated music, sheetmusic, model:sheetmusic, and TEXT
Subject:: chrámová hudba, liturgické zpěvy, židovská hudba, židovská kultura, vokální partitury, komentovaná vydání, vokální hudba, text, zápis hudby, etnická hudba, and hudební edice
Language:: Hebrew and Yiddish
Description:: Band X, Gesänge der Chassidim, gesammelt, geordnet und erläutert von A.Z. Idelsohn, Komentář německy a ukázky jidiš, Textové podložení německy a jidiš, and Kritické edice skladeb, komentovaná vydání.
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

40. Hebräisch-orientalischer Melodienschatz

Creator:: Idelsohn, Abraham Zwi
Publisher:: Benjamin Harz Verlag
Format:: hudebnina and ix, 140 stran ; 34 cm
Type:: notated music, sheetmusic, model:sheetmusic, and TEXT
Subject:: chrámová hudba, liturgické zpěvy, židovská hudba, židovská kultura, komentovaná vydání, vokální partitury, vokální hudba, starověká hudba, and hudební edice
Language:: Hebrew
Description:: Band II, Gesänge der babylonischen Juden, zum ersten Male gesammelt, erläutert und herausgegeben von A.Z. Idelsohn, Komentář německy, and Textové podložení hebrejsky ve fonetickém přepisu
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

41. Hebräisch-orientalischer Melodienschatz

Creator:: Idelsohn, Abraham Zwi
Publisher:: Friedrich Hofmeister
Format:: hudebnina, svazek, and xxxiv, 143 stran ; 34 cm
Type:: notated music, sheetmusic, model:sheetmusic, and TEXT
Subject:: chrámová hudba, liturgické zpěvy, židovská hudba, židovská kultura, vokální partitury, komentovaná vydání, vokální hudba, text, zápis hudby, etnická hudba, and hudební edice
Language:: Hebrew
Description:: Band VIII, Der Synagogengesang der osteuropäischen Juden, gesammelt, erläutert und herausgegeben von A.Z. Idelsohn, Úvod německy, Textové podložení hebrejsky ve fonetickém přepisu, Obsahuje: Abteilung I: Der traditionelle Gesang; Abteilung II: Ausgewählte Kompositionen der Osteuropäischen Chasanim;, and Kritické edice skladeb, komentovaná vydání.
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

42. Hebräisch-orientalischer Melodienschatz

Creator:: Idelsohn, Abraham Zwi
Publisher:: Druck und Verlag von Breitkopf & Härtel
Format:: hudebnina and xi, 158 stran ; 30 cm
Type:: notated music, sheetmusic, model:sheetmusic, and TEXT
Subject:: chrámová hudba, liturgické zpěvy, židovská hudba, židovská kultura, vokální partitury, komentovaná vydání, vokální hudba, text, zápis hudby, and hudební edice
Language:: Hebrew
Description:: I. Band, Gesänge der jemenischen Juden, zum ersten Male gesammelt, erläutert und herausgegeben von A.Z. Idelsohn, Předmluva a komentář německy, Textové podložení hebrejsky ve fonetickém přepisu, Obsahuje: Abteilung I: Synagogengesänge; Abteilung II: Außersynagogale Gesänge;, and Kritické edice skladeb, komentovaná vydání.
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

43. Hebrew printing in Bohemia and Moravia /

Publisher:: Academia : and Jewish Museum in Prague,
Type:: monografie kolektivní
Subject:: Polygrafie. Vydavatelství a knižní obchod, tisky staré, hebraika, knihtisk, knihtiskaři, kultura knižní, kultura židovská, české země 1526-1792, židovská věda, kultura a školství, české země 1792-1918, dějiny knihy, knihtisk, nakladatelství, and staré tisky
Language:: English, German, Hebrew, and Latin
Description:: Přeloženo z češtiny
Rights:: unknown

44. Ideály humanitní /

Creator:: Masaryk, Tomáš Garrigue,
Type:: text and eseje
Subject:: Filozofie, demokracie, humanismus, české země 1848-1918, and filozofie, filozofové
Language:: Hebrew
Description:: Vydáno za podpory ministerstva školství a národní osvěty
Rights:: unknown

45. Juden in der mittelalterlichen Stadt :

Creator:: Doležalová, Eva,
Type:: text and sborníky
Subject:: Dějiny Česka a Slovenska, Židé, dějiny Židů, města středověká, and české (československé) sborníky a kolektivní monografie
Language:: German, English, Hebrew, and Latin
Description:: Na obálce nad názvem: cms - Centre for Medieval Studies
Rights:: unknown

46. Juden in der mittelalterlichen Stadt :

Creator:: Doležalová, Eva,
Type:: text and sborníky
Subject:: Dějiny Česka a Slovenska, Židé, dějiny Židů, města středověká, and české (československé) sborníky a kolektivní monografie
Language:: German, English, Hebrew, and Latin
Description:: Na obálce nad názvem: cms - Centre for Medieval Studies
Rights:: unknown

47. Kenaanské glosy ve středověkých hebrejských rukopisech s vazbou na české země /

Creator:: Bláha, Ondřej,
Type:: text and monografie kolektivní
Subject:: Filologie, Jakobson, Roman,, rukopisy středověké, jazyk staročeský, kultura židovská, české země 1306-1526, jazyk, písmo, české země od příchodu Slovanů do roku 1306, židovská věda, kultura a školství, světové dějiny středověku (do r. 1492), and rukopisy
Language:: Czech, English, and Hebrew
Description:: Canaanite glosses in Medieval Hebrew Manuscripts Related to the Czech Lands.
Rights:: unknown

48. Kindersuite

Creator:: Achron, Joseph
Format:: print
Type:: supplement, model:supplement, and TEXT
Language:: English, French, Russian, German, and Hebrew
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

49. Kindersuite

Creator:: Achron, Joseph
Format:: print
Type:: supplement, model:supplement, and TEXT
Language:: English, French, German, Russian, and Hebrew
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

50. Kindersuite

Creator:: Achron, Joseph
Format:: print
Type:: supplement, model:supplement, and TEXT
Language:: English, French, German, Russian, and Hebrew
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

51. Kindersuite

Creator:: Achron, Joseph
Format:: print
Type:: supplement, model:supplement, and TEXT
Language:: English, French, German, Russian, and Hebrew
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

52. Kindersuite

Creator:: Achron, Joseph
Publisher:: Universal Edition
Format:: hudebnina and 1 partitura (51 s.) + 5 hlasů (12, 12, 12, 10, 10 s.) ; 34 cm
Type:: notated music, sheetmusic, model:sheetmusic, and TEXT
Subject:: sextety, viola (1), violoncello (1), klavír (1), klarinet in C (1), klavírní kvintety, partitury a hlasy, housle (2), hlasy, hudební úpravy, and kvintety
Language:: English, French, German, Hebrew, and Russian
Description:: Skladby pro klavír a 5 a více různých nástrojů.
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

53. Kindersuite

Creator:: Achron, Joseph
Format:: print
Type:: supplement, model:supplement, and TEXT
Language:: English, French, German, Russian, and Hebrew
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

54. Knaanic Language: Structure and Historical Background :

Publisher:: Academia,
Type:: sborníky konferenční
Subject:: Filologie, hebraistika, Židé, jazyk staročeský, vztahy křesťansko-židovské, vztahy jazykové, rukopisy, české (československé) sborníky a kolektivní monografie, české země 1306-1526, and dějiny židů
Language:: English, Czech, and Hebrew
Rights:: unknown

55. Kompendium gramatiky hebrejského jazyka /

Creator:: Spinoza, Benedictus de,
Type:: text, pojednání, and edice
Subject:: Afroasijské (hamitosemitské) jazyky, jazyk hebrejský, mluvnice, světové dějiny 1492-1648, and židovská věda, kultura a školství
Language:: Czech and Hebrew
Description:: Přeloženo z latiny
Rights:: unknown

56. Lingua::Interset 2.026

Creator:: Zeman, Daniel
Publisher:: Charles University, Faculty of Mathematics and Physics
Type:: tool and toolService
Subject:: morphology, part of speech, conversion, and tagset
Language:: Arabic, Bulgarian, Bengali, Catalan, Czech, Danish, German, Modern Greek (1453-), English, Spanish, Estonian, Basque, Persian, Finnish, Ancient Greek (to 1453), Hebrew, Hindi, Croatian, Japanese, Multiple languages, and Portuguese
Description:: Lingua::Interset is a universal morphosyntactic feature set to which all tagsets of all corpora/languages can be mapped. Version 2.026 covers 37 different tagsets of 21 languages. Limited support of the older drivers for other languages (which are not included in this package but are available for download elsewhere) is also available; these will be fully ported to Interset 2 in future. Interset is implemented as Perl libraries. It is also available via CPAN.
Rights:: Artistic License (Perl) 1.0, http://opensource.org/licenses/Artistic-Perl-1.0, and PUB

57. Maimonides' Views on the Authority of a Written Word /

Creator:: Harel, Avi
Type:: studie
Subject:: Judaismus, Maimonides,, judaismus, filozofie středověká, světové dějiny středověku (do r. 1492), and židovské náboženství, filozofie
Language:: Hebrew
Description:: Text v hebrejštině
Rights:: unknown

58. Morpho-syntactically annotated corpora provided for the PARSEME Shared Task on Semi-Supervised Identification of Verbal Multiword Expressions (edition 1.2)

Creator:: Guillaume, Bruno, Ramisch, Carlos, Waszczuk, Jakub, Monti, Johanna, Di Buono, Maria Pia, Sangati, Federico, Speranza, Giulia, Carlino, Carola, Güngör, Tunga, Yirmibeşoğlu, Zeynep, Sak, Haşim, Saraçlar, Murat, Giouli, Voula, Foufi, Vassiliki, Ramisch, Renata, Rademaker, Alexandre, Vale, Oto, Wilkens, Rodrigo, Candito, Marie, Crabbé, Benoît, Segonne, Vincent, Liebeskind, Chaya, Stymne, Sara, Hajič, Jan, Ginter, Filip, Luotolahti, Juhani, Straka, Milan, Zeman, Daniel, Barbu Mititelu, Verginica, Cristescu, Mihaela, Vaidya, Ashwini, Bhatia, Archna, Lichte, Timm, Ehren, Rafael, Jiang, Menghan, Xu, Hongzhi, Walsh, Abigail, Irimia, Elena, and Dowling, Meghan
Publisher:: PARSEME
Type:: text and corpus
Subject:: morphosyntactic annotation, dependency trees, and morphological analysis
Language:: German, Modern Greek (1453-), Basque, French, Irish, Hebrew, Hindi, Italian, Polish, Portuguese, Romanian, Swedish, Turkish, and Chinese
Description:: This multilingual resource contains corpora for 14 languages, gathered at the occasion of the 1.2 edition of the PARSEME Shared Task on semi-supervised Identification of Verbal MWEs (2020). These corpora were meant to serve as additional "raw" corpora, to help discovering unseen verbal MWEs. The corpora are provided in CONLL-U (https://universaldependencies.org/format.html) format. They contain morphosyntactic annotations (parts of speech, lemmas, morphological features, and syntactic dependencies). Depending on the language, the information comes from treebanks (mostly Universal Dependencies v2.x) or from automatic parsers trained on UD v2.x treebanks (e.g., UDPipe). VMWEs include idioms (let the cat out of the bag), light-verb constructions (make a decision), verb-particle constructions (give up), inherently reflexive verbs (help oneself), and multi-verb constructions (make do). For the 1.2 shared task edition, the data covers 14 languages, for which VMWEs were annotated according to the universal guidelines. The corpora are provided in the cupt format, inspired by the CONLL-U format. Morphological and syntactic information – not necessarily using UD tagsets – including parts of speech, lemmas, morphological features and/or syntactic dependencies are also provided. Depending on the language, the information comes from treebanks (e.g., Universal Dependencies) or from automatic parsers trained on treebanks (e.g., UDPipe). This item contains training, development and test data, as well as the evaluation tools used in the PARSEME Shared Task 1.2 (2020). The annotation guidelines are available online: http://parsemefr.lif.univ-mrs.fr/parseme-st-guidelines/1.2
Rights:: PARSEME Shared Task Raw Corpus Data (v. 1.2) Agreement, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-mwe-1.2-raw, and PUB

59. OmegaWiki

Publisher:: Universität Bamberg, World Language Documentation Centre
Format:: application/octet-stream
Type:: lexicalConceptualResource
Language:: Afrikaans, Arabic, Basque, Bulgarian, Catalan, Chinese, Czech, Danish, Dutch, English, Esperanto, Estonian, Finnish, French, Galician, Georgian, Modern Greek (1453-), Hebrew, Hungarian, Icelandic, Indonesian, Interlingua (International Auxiliary Language Association), Irish, Italian, Japanese, Khmer, Norwegian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Spanish, Swedish, Turkish, Ukrainian, and Welsh
Rights:: GFDL or CC and http://www.omegawiki.org/Licensing

60. OrienTel Telephone databases

Type:: corpus
Subject:: Multilingual access to interactive communication services for the Mediterranean and the Middle East
Language:: Modern Greek (1453-), Turkish, Arabic, and Hebrew
Description:: Collection of telephone databases from mediterranean region, incl. (variants of) Arabic. 500-1000 speakers per database, all orthographically transcribed. Speaker information regarding gender, age and accent. Phonetic lexicons included.
Rights:: Not specified

61. Otevřené dveře :

Type:: text and monografie kolektivní
Subject:: Bible. Biblistika, bible, teologie křesťanská, Starý zákon, teologie židovská, české (československé) sborníky a kolektivní monografie, and teologie, ikonografie, zbožnost, hagiografie
Language:: Czech and Hebrew
Description:: Část. přeloženo z němčiny
Rights:: unknown

62. Památník Židovského ústředního musea pro Moravsko-Slezsko =

Type:: text and publikace jubilejní
Subject:: Výtvarné umění, dějiny Židů, muzea židovská, české (československé) sborníky a kolektivní monografie, přehledná zpracování dějin českých zemí (chronologicky), and dějiny židů
Language:: Czech, German, and Hebrew
Rights:: unknown

63. PARSEME corpora annotated for verbal multiword expressions (version 1.3)

Creator:: Savary, Agata, Ramisch, Carlos, Guillaume, Bruno, Hawwari, Abdelati, Walsh, Abigail, Fotopoulou, Aggeliki, Bielinskienė, Agnė, Estarrona, Ainara, Gatt, Albert, Butler, Alexandra, Rademaker, Alexandre, Maldonado, Alfredo, Villavicencio, Aline, Farrugia, Alison, Muscat, Amanda, Gatt, Anabelle, Antić, Anđela, De Santis, Anna, Raffone, Annalisa, Riccio, Anna, Pascucci, Antonio, Gurrutxaga, Antton, Bhatia, Archna, Vaidya, Ashwini, Miral, Ayşenur, QasemiZadeh, Behrang, Priego Sanchez, Belem, Griciūtė, Bernadeta, Erden, Berna, Parra Escartín, Carla, Herrero, Carlos, Carlino, Carola, Pasquer, Caroline, Liebeskind, Chaya, Wang, Chenweng, Ben Khelil, Chérifa, Bonial, Claire, Somers, Clarissa, Aceta, Cristina, Krstev, Cvetana, Bejček, Eduard, Lindqvist, Ellinor, Erenmalm, Elsa, Palka-Binkiewicz, Emilia, Rimkute, Erika, Petterson, Eva, Cap, Fabienne, Hu, Fangyuan, Sangati, Federico, Wick Pedro, Gabriela, Speranza, Giulia, Jagfeld, Glorianna, Blagus, Goranka, Berk, Gözde, Attard, Greta, Eryiğit, Gülşen, Finnveden, Gustav, Martínez Alonso, Héctor, de Medeiros Caseli, Helena, Elyovich, Hevi, Xu, Hongzhi, Xiao, Huangyang, Miranda, Isaac, Jaknić, Isidora, El Maarouf, Ismail, Aduriz, Itziar, Gonzalez, Itziar, Matas, Ivana, Stoyanova, Ivelina, Jazbec, Ivo-Pavao, Busuttil, Jael, Waszczuk, Jakub, Findlay, Jamie, Bonnici, Janice, Šnajder, Jan, Antoine, Jean-Yves, Foster, Jennifer, Chen, Jia, Nivre, Joakim, Monti, Johanna, McCrae, John, Kovalevskaitė, Jolanta, Jain, Kanishka, Simkó, Katalin, Yu, Ke, Azzopardi, Kirsty, Adalı, Kübra, Uria, Larraitz, Zilio, Leonardo, Boizou, Loïc, van der Plas, Lonneke, Galea, Luke, Sarlak, Mahtab, Buljan, Maja, Cherchi, Manuela, Tanti, Marc, Di Buono, Maria Pia, Todorova, Maria, Candito, Marie, Constant, Matthieu, Shamsfard, Mehrnoush, Jiang, Menghan, Boz, Mert, Spagnol, Michael, Onofrei, Mihaela, Li, Minli, Elbadrashiny, Mohamed, Diab, Mona, Rizea, Monica-Mihaela, Hadj Mohamed, Najet, Theoxari, Natasa, Schneider, Nathan, Tabone, Nicole, Ljubešić, Nikola, Vale, Oto, Cook, Paul, Yan, Peiyi, Gantar, Polona, Ehren, Rafael, Fabri, Ray, Ibrahim, Rehab, Ramisch, Renata, Walles, Rinat, Wilkens, Rodrigo, Urizar, Ruben, Sun, Ruilong, Malka, Ruth, Galea, Sara Anne, Stymne, Sara, Louizou, Sevasti, Hu, Sha, Taslimipoor, Shiva, Ratori, Shraddha, Srivastava, Shubham, Cordeiro, Silvio Ricardo, Krek, Simon, Liu, Siyuan, Zeng, Si, Yu, Songping, Arhar Holdt, Špela, Markantonatou, Stella, Papadelli, Stella, Leseva, Svetlozara, Kuzman, Taja, Kavčič, Teja, Lynn, Teresa, Lichte, Timm, Pickard, Thomas, Dimitrova, Tsvetana, Yih, Tsy, Güngör, Tunga, Dinç, Tutkum, Iñurrieta, Uxoa, Tajalli, Vahide, Stefanova, Valentina, Caruso, Valeria, Puri, Vandana, Foufi, Vassiliki, Barbu Mititelu, Verginica, Vincze, Veronika, Kovács, Viktória, Shukla, Vishakha, Giouli, Voula, Ge, Xiaomin, Ha-Cohen Kerner, Yaakov, Öztürk, Yağmur, Yarandi, Yalda, Parmentier, Yannick, Zhang, Yongchen, Zhao, Yun, Urešová, Zdeňka, Yirmibeşoğlu, Zeynep, Qin, Zhenzhen, Stank, Cristescu, Mihaela, Zgreabăn, Bianca-Mădălina, Bărbulescu, Elena-Andreea, and Stanković, Ranka
Publisher:: PARSEME
Type:: text and corpus
Subject:: multiword expressions, verbal multiword expressions, light verb construction, verb-particle constructions, inherently reflexive verbs, verbal idioms, and multi-verb constructions
Language:: Arabic, Bulgarian, Czech, German, Modern Greek (1453-), English, Spanish, Basque, Persian, French, Irish, Hebrew, Hindi, Croatian, Hungarian, Lithuanian, Italian, Maltese, Polish, Portuguese, Romanian, Slovenian, Serbian, Swedish, Turkish, and Chinese
Description:: This multilingual resource contains corpora in which verbal MWEs have been manually annotated. VMWEs include idioms (let the cat out of the bag), light-verb constructions (make a decision), verb-particle constructions (give up), inherently reflexive verbs (help oneself), and multi-verb constructions (make do). This is the first release of the corpora without an associated shared task. Previous version (1.2) was associated with the PARSEME Shared Task on semi-supervised Identification of Verbal MWEs (2020). The data covers 26 languages corresponding to the combination of the corpora for all previous three editions (1.0, 1.1 and 1.2) of the corpora. VMWEs were annotated according to the universal guidelines. The corpora are provided in the cupt format, inspired by the CONLL-U format. Morphological and syntactic information, including parts of speech, lemmas, morphological features and/or syntactic dependencies, are also provided. Depending on the language, the information comes from treebanks (e.g., Universal Dependencies) or from automatic parsers trained on treebanks (e.g., UDPipe). All corpora are split into training, development and test data, following the splitting strategy adopted for the PARSEME Shared Task 1.2. The annotation guidelines are available online: https://parsemefr.lis-lab.fr/parseme-st-guidelines/1.3 The .cupt format is detailed here: https://multiword.sourceforge.net/cupt-format/
Rights:: PARSEME Corpora v. 1.3 - Licence Agreement, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-mwe-1.3, and PUB

64. Perāqîm be-qôrôt šô'at yehûdê Slôvaqyā :

Type:: text and sborníky konferenční
Subject:: Dějiny Česka a Slovenska, Židé, antisemitismus, holocaust, Slovensko 1939-1945, antisemitismus, perzekuce, pogromy, and zahraniční periodika a sborníky
Language:: Hebrew
Description:: Givʿat Ḥavîvā 6.11.1984, Yad Wā-Šēm 7.11.1984
Rights:: unknown

65. Pinkasim a správa židovských obcí v českých zemích raného novověku :

Creator:: Sixtová, Olga,
Type:: text, monografie kolektivní, and prameny
Subject:: Organizace a řízení veřejné správy, Židé, obce židovské, správa místní, knihy úřední, české země 1620-1740, české země 1740-1792, and dějiny židů
Language:: Czech, German, Hebrew, and Yiddish
Rights:: unknown

66. Plaintext Wikipedia dump 2018

Creator:: Rosa, Rudolf
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: Wikipedia, text corpora, and monolingual corpus
Language:: Abkhazian, Achinese, Adyghe, Afrikaans, Akan, Tosk Albanian, Amharic, Old English (ca. 450-1100), Arabic, Official Aramaic (700-300 BCE), Aragonese, Egyptian Arabic, Assamese, Asturian, Atikamekw, Avaric, Aymara, South Azerbaijani, Azerbaijani, Bashkir, Bambara, Bavarian, Central Bikol, Belarusian, Bengali, Bislama, Banjar, Tibetan, Bosnian, Bishnupriya, Breton, Buginese, Bulgarian, Russia Buriat, Catalan, Min Dong Chinese, Cebuano, Czech, Chamorro, Chechen, Cherokee, Church Slavic, Chuvash, Cheyenne, Central Kurdish, Cornish, Corsican, Cree, Crimean Tatar, Kashubian, Welsh, Danish, German, Dinka, Dimli (individual language), Dhivehi, Lower Sorbian, Dzongkha, Modern Greek (1453-), English, Esperanto, Estonian, Basque, Ewe, Extremaduran, Faroese, Persian, Fijian, Finnish, French, Arpitan, Northern Frisian, Western Frisian, Fulah, Friulian, Gagauz, Gan Chinese, Scottish Gaelic, Irish, Galician, Gilaki, Manx, Goan Konkani, Gothic, Guarani, Gujarati, Hakka Chinese, Haitian, Hausa, Hawaiian, Serbo-Croatian, Hebrew, Herero, Fiji Hindi, Hindi, Hiri Motu, Croatian, Upper Sorbian, Hungarian, Armenian, Igbo, Ido, Inuktitut, Interlingue, Iloko, Interlingua (International Auxiliary Language Association), Indonesian, Inupiaq, Icelandic, Italian, Jamaican Creole English, Javanese, Lojban, Japanese, Kara-Kalpak, Kabyle, Kalaallisut, Kannada, Kashmiri, Georgian, Kanuri, Kazakh, Kabardian, Kabiyè, Khmer, Kikuyu, Kinyarwanda, Kirghiz, Komi-Permyak, Komi, Kongo, Korean, Karachay-Balkar, Kölsch, Kurdish, Ladino, Lao, Latin, Latvian, Lak, Lezghian, Ligurian, Limburgan, Lingala, Lithuanian, Lombard, Northern Luri, Latgalian, Luxembourgish, Ganda, Literary Chinese, Marshallese, Maithili, Malayalam, Marathi, Moksha, Eastern Mari, Minangkabau, Macedonian, Malagasy, Maltese, Mongolian, Maori, Western Mari, Malay (macrolanguage), Creek, Mirandese, Burmese, Erzya, Mazanderani, Min Nan Chinese, Neapolitan, Nauru, Navajo, Ndonga, Low German, Nepali (macrolanguage), Newari, Dutch, Norwegian Nynorsk, Norwegian, Novial, Pedi, Nyanja, Occitan (post 1500), Livvi, Oriya (macrolanguage), Oromo, Ossetian, Pangasinan, Pampanga, Panjabi, Papiamento, Picard, Pennsylvania German, Pfaelzisch, Pitcairn-Norfolk, Pali, Piemontese, Western Panjabi, Pontic, Polish, Portuguese, Pushto, Quechua, Vlax Romani, Romansh, Romanian, Rusyn, Rundi, Macedo-Romanian, Russian, Sango, Yakut, Sanskrit, Sicilian, Scots, Samogitian, Sinhala, Slovak, Slovenian, Northern Sami, Samoan, Shona, Sindhi, Somali, Southern Sotho, Spanish, Albanian, Sardinian, Sranan Tongo, Serbian, Swati, Saterfriesisch, Sundanese, Swahili (macrolanguage), Swedish, Silesian, Tahitian, Tamil, Tatar, Tulu, Telugu, Tama (Colombia), Tetum, Tajik, Tagalog, Thai, Tigrinya, Tonga (Tonga Islands), Tok Pisin, Tswana, Tsonga, Turkmen, Tumbuka, Turkish, Twi, Tuvinian, Udmurt, Uighur, Ukrainian, Urdu, Uzbek, Venetian, Venda, Veps, Vietnamese, Vlaams, Volapük, Võro, Waray (Philippines), Walloon, Wolof, Wu Chinese, Kalmyk, Xhosa, Mingrelian, Yiddish, Yoruba, Yue Chinese, Zeeuws, Zhuang, Chinese, Zulu, and Dotyali
Description:: Wikipedia plain text data obtained from Wikipedia dumps with WikiExtractor in February 2018. The data come from all Wikipedias for which dumps could be downloaded at [https://dumps.wikimedia.org/]. This amounts to 297 Wikipedias, usually corresponding to individual languages and identified by their ISO codes. Several special Wikipedias are included, most notably "simple" (Simple English Wikipedia) and "incubator" (tiny hatching Wikipedias in various languages). For a list of all the Wikipedias, see [https://meta.wikimedia.org/wiki/List_of_Wikipedias]. The script which can be used to get new version of the data is included, but note that Wikipedia limits the download speed for downloading a lot of the dumps, so it takes a few days to download all of them (but one or a few can be downloaded fast). Also, the format of the dumps changes time to time, so the script will probably eventually stop working one day. The WikiExtractor tool [http://medialab.di.unipi.it/wiki/Wikipedia_Extractor] used to extract text from the Wikipedia dumps is not mine, I only modified it slightly to produce plaintext outputs [https://github.com/ptakopysk/wikiextractor].
Rights:: Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0), http://creativecommons.org/licenses/by-sa/3.0/, and PUB

67. Prameny k dějinám Židů v Čechách a na Moravě ve středověku :

Type:: text, prameny, and soupisy
Subject:: Historická věda. Pomocné vědy historické. Archivnictví, Židé, dějiny Židů, společnost středověká, české země od příchodu Slovanů do roku 1306, dějiny židů, and české země 1306-1419
Language:: Czech, English, German, Hebrew, and Latin
Description:: Obálkový název: Archiv český
Rights:: unknown

68. Příběhy tesané do kamene :

Creator:: Vladařová, Petra,
Type:: text and monografie
Subject:: Kulturní památky historického období obecně, hřbitovy židovské, náhrobky židovské, přehledná zpracování dějin českých zemí (chronologicky), and židovské památky, hřbitovy, synagogy
Language:: Czech and Hebrew
Description:: "Vydalo nakl. L. Marek pro HTF UK"--Rub titulní stránky
Rights:: unknown

69. Rabiho Moše ben Majmona Osm kapitol o lidské duši a mravním konání /

Creator:: Maimonides,
Type:: text, prameny, and edice
Subject:: Judaismus, Maimonides,, judaismus, filozofie náboženská, světové dějiny středověku (do r. 1492), and židovská věda, kultura a školství
Language:: Czech and Hebrew
Description:: Název edice je v knize vyjádřen znakem - prvním písmenem hebrejské abecedy, Část. souběžný hebrejský text, and Další variantní název: Osm kapitol
Rights:: unknown

70. Rekonstrukce knihovny Bohuslava Hasištejnského z Lobkovic :

Creator:: Boldan, Kamil,
Type:: text, bibliografie, katalogy, and monografie kolektivní
Subject:: Dějiny Česka a Slovenska, Dějiny civilizace. Kulturní dějiny, Hasištejnský z Lobkovic, Bohuslav,, rukopisy, prvotisky, paleotypy, tisky staré, knihovny soukromé, knihovny zámecké, fondy knihovní, and rukopisy a staré tisky
Language:: Czech and Hebrew
Description:: The Roudnice Incunabula and the Library of Bohuslav Hasištejnský of Lobkowicz.
Rights:: unknown

71. Rukopisy od Mrtvého moře :

Type:: text, prameny, and edice
Subject:: Bible. Biblistika, rukopisy hebrejské, judaismus, přehledná zpracování světových dějin (chronologicky), církevní a náboženské dějiny, and dějiny židů
Language:: Czech and Hebrew
Description:: Přeloženo z hebrejštiny
Rights:: unknown

72. Šalom :

Type:: text, publikace jubilejní, and monografie kolektivní
Subject:: Judaismus, Nosek, Bedřich,, sborníky jubilejní, judaisté, judaistika, biblistika, and české (československé) sborníky a kolektivní monografie
Language:: Czech, English, German, and Hebrew
Rights:: unknown

73. Šana tova! :

Type:: zápis hudby and zpěvníky
Subject:: Církevní hudba. Duchovní hudba. Náboženská hudba, písně duchovní, židovství, liturgie, světové dějiny 1789-1918, světové dějiny od r. 1918 do současnosti, and hudba, tanec, hudební nástroje
Language:: Hebrew
Description:: Průvodní texty česky and Sestavila Ráchel Polohová
Rights:: unknown

74. Speecon databases

Type:: corpus
Language:: Czech, Danish, Dutch, English, Finnish, French, German, Hungarian, Italian, Polish, Portuguese, Russian, Spanish, Swedish, Turkish, Chinese, Hebrew, Japanese, Korean, and Thai
Description:: 28 speech databases containing broadband recordings from 550 adults and 50 children per language. Contains interesting phonetically rich material. All orthographically transcribed. Speaker information included for gender, age, accent. Including pronunciation lexicon.
Rights:: Not specified

75. Starý židovský hřbitov v Praze /

Creator:: Muneles, Otto,
Type:: text, monografie, and publikace fotografické
Subject:: Judaismus, památky židovské, hřbitovy židovské, přehledná zpracování dějin českých zemí (chronologicky), židovské památky, hřbitovy, synagogy, and jednotlivé památky, památkové rezervace
Language:: Czech and Hebrew
Rights:: unknown

76. Stopy Židů v Pardubickém kraji =

Creator:: Langová, Alžběta,
Type:: text, statický obraz, and monografie
Subject:: Dějiny Česka a Slovenska, Židé, dějiny Židů, památky židovské, hřbitovy židovské, přehledná zpracování dějin českých zemí (chronologicky), and dějiny židů
Language:: Czech, English, and Hebrew
Rights:: unknown

77. Stopy Židů v Pardubickém kraji =

Creator:: Růžičková, Renáta,
Type:: text and monografie kolektivní
Subject:: Dějiny Česka a Slovenska, Židé, dějiny regionální, památky židovské, dějiny židů, and přehledná zpracování dějin českých zemí (chronologicky)
Language:: Czech, English, and Hebrew
Rights:: unknown

78. The Jewish prayer for the welfare of the country as the echo of political and historical changes in Central Europe /

Creator:: Damohorská, Pavla,
Publisher:: For Charles University in Prague, Hussite Theological Faculty publ. by Vodnář,
Type:: monografie
Subject:: Judaismus, modlitby, judaismus, politika, společnost, světové dějiny novověku (1492-1918), světové dějiny od r. 1918 do současnosti, and židovské náboženství, filozofie
Language:: English, Hebrew, German, and Czech
Description:: 130 výt. and Obsahuje bibliografii a bibliografické odkazy
Rights:: unknown

79. The Use of Eschatological Prophecy in an Ideological Setting S. Yizhar (1916-2007) :

Creator:: Riveline, Ephraïm
Type:: studie
Subject:: Judaismus, Yizhar, S.,, judaismus, eschatologie, literatura židovská, světové dějiny od r. 1945 do současnosti, and židovské náboženství, filozofie
Language:: Hebrew
Description:: Text v hebrejštině
Rights:: unknown

80. TITUS Hebrew

Format:: text/html
Type:: corpus
Language:: Hebrew
Description:: ca. 500.000 tokens; linked with relational database; XML-encoding in progress
Rights:: http://titus.uni-frankfurt.de/texte/texte2.htm#Estart