« Previous |
1 - 100 of 667
|
Next »
Number of results to display per page
Search Results
2. "Bogurodzica" wobec hymnografji łacińskiej /
- Creator:
- Birkenmajer, Józef,
- Type:
- text and studie
- Subject:
- Dějiny zemí střední Evropy, jazyk polský, dějiny literatury, Polsko, jazyk, písmo, and světové dějiny středověku (do r. 1492)
- Language:
- Polish
- Rights:
- unknown
3. 13. celostátní konference archivářů České republiky :
- Type:
- text and sborníky konferenční
- Subject:
- Historická věda. Pomocné vědy historické. Archivnictví, sborníky konferenční, archivnictví, archivy, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech and Polish
- Description:
- Konference konaná při příležitosti 170. výročí založení Moravského zemského archivu v Brně, V tiráži označení sv. edice: č. 51/2010, Název v tiráži: 13. celostátní konference archivářů ČR: České archivy a zahraniční inspirace, Obálkový název: Zpravodaj pobočky České informační společnosti, and Hřbetní název: Zpravodaj 51
- Rights:
- unknown
4. 15 variations d'apres une suite de douze tons, op. 9
- Creator:
- Koffler, Józef
- Publisher:
- Senart
- Format:
- hudebnina and 1 partitura (23 s.) ; 28 cm
- Type:
- notated music, sheetmusic, model:sheetmusic, and TEXT
- Subject:
- variace, smyčcový orchestr, dodekafonie, and partitury
- Language:
- French, German, and Polish
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
5. 40°51° Lublin
- Publisher:
- K.u.k. militär-geographisches Institut
- Format:
- 1 mapa : barevná ; 57 x 36 cm and kartografický dokument
- Type:
- model:map and IMAGE
- Language:
- Polish
- Description:
- Název dodán katalogizátorem, mapa oříznuta, chybí popis, měřítko a vydavatel dle souvisejících map souboru.
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
6. [Posudek práce J. Pekaře Nejstarší kronika česká] /
- Creator:
- Brückner, Aleksander,
- Type:
- text and recenze
- Subject:
- Historická věda. Pomocné vědy historické. Archivnictví, Kosmas,, Kristián,, Václav,, Ludmila,, Konstantin,, Metoděj,, Dalimil,, Dobner, Gelasius,, Dobrovský, Josef,, legendy, kroniky, světci čeští, falza rukopisná, historiografie česká, teologie, ikonografie, zbožnost, hagiografie, and české země 895/906-1197
- Language:
- Polish
- Rights:
- unknown
7. [Recenze] /
- Creator:
- Brückner, Aleksander,
- Type:
- text and studie
- Subject:
- Kristián,, legendy svatováclavské, diskuse vědecké, teologie, ikonografie, zbožnost, hagiografie, české země 1306-1526, and české země od příchodu Slovanů do roku 1306
- Language:
- Polish
- Description:
- Recenze na: Pekař, Josef: Die Wenzels und Ludmila-Legenden
- Rights:
- unknown
8. Acta capitulorum nec non iudiciorum ecclesiasticorum selecta.
- Type:
- text, prameny, and edice
- Subject:
- Dějiny Evropy, dějiny polské, Polsko, světové dějiny novověku (1492-1918), přehledná zpracování (tematicky), and diplomatika, edice
- Language:
- Polish
- Rights:
- unknown
9. Adam Mickiewicz /
- Creator:
- Kallenbach, Józef Henryk,
- Type:
- text and biografie
- Subject:
- Polská literatura (o ní), Mickiewicz, Adam,, básníci polští, literatura polská, Polsko, literatura, spisovatelé, and světové dějiny 1789-1918
- Language:
- Polish
- Rights:
- unknown
10. Akta grodzkie i ziemskie z czasów Rzeczypospolitej Polskiej z archiwum tak zwanego bernardyńskiego we Lwowie w skutek fundacyi śp. Aleksandra hr. Stadnickiego.
- Type:
- text, prameny, and edice
- Subject:
- Dějiny Evropy, Polsko, politické dějiny, politici, and světové dějiny 1648-1789
- Language:
- Polish
- Rights:
- unknown
11. Akta unji Polski z Litwą 1385-1791 /
- Type:
- text, prameny, and edice
- Subject:
- Dějiny Evropy, dějiny států, unie polsko-litevská, dějiny polské, Polsko, přehledná zpracování světových dějin (chronologicky), politické dějiny, politici, and Litva
- Language:
- Polish
- Rights:
- unknown
12. Album zasłużonych lekarzy polskich
- Publisher:
- [s.n.]
- Type:
- model:monograph and TEXT
- Subject:
- udc:61 ; 61(091) ; 929 ;
- Language:
- Polish
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
13. Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.0)
- Creator:
- Savary, Agata, Ramisch, Carlos, Cordeiro, Silvio Ricardo, Sangati, Federico, Vincze, Veronika, QasemiZadeh, Behrang, Candito, Marie, Cap, Fabienne, Giouli, Voula, Stoyanova, Ivelina, Doucet, Antoine, Adalı, Kübra, Barbu Mititelu, Verginica, Bejček, Eduard, El Maarouf, Ismail, Eryiğit, Gülşen, Galea, Luke, Ha-Cohen Kerner, Yaakov, Liebeskind, Chaya, Monti, Johanna, Parra Escartín, Carla, Kovalevskaitė, Jolanta, Krek, Simon, van der Plas, Lonneke, Aceta, Cristina, Aduriz, Itziar, Antoine, Jean-Yves, Attard, Greta, Azzopardi, Kirsty, Boizou, Loic, Bonnici, Janice, Boz, Mert, Bumbulienė, Ieva, Busuttil, Jael, Caruso, Valeria, Cherchi, Manuela, Constant, Matthieu, Czerepowicka, Monika, De Santis, Anna, Dimitrova, Tsvetana, Dinç, Tutkum, Elyovich, Hevi, Fabri, Ray, Farrugia, Alison, Findlay, Jamie, Fotopoulou, Aggeliki, Foufi, Vassiliki, Galea, Sara Anne, Gantar, Polona, Gatt, Albert, Gatt, Anabelle, Herrero, Carlos, Iñurrieta, Uxoa, Jagfeld, Glorianna, Hnátková, Milena, Ionescu, Mihaela, Klyueva, Natalia, Koeva, Svetla, Kovács, Viktória, Kuzman, Taja, Leseva, Svetlozara, Louisou, Sevi, Lynn, Teresa, Malka, Ruth, Martínez Alonso, Héctor, McCrae, John, de Medeiros Caseli, Helena, Miral, Ayşenur, Muscat, Amanda, Nivre, Joakim, Oakes, Michael, Onofrei, Mihaela, Parmentier, Yannick, Pasquer, Caroline, Pia di Buono, Maria, Priego Sanchez, Belem, Raffone, Annalisa, Ramisch, Renata, Rimkutė, Erika, Rizea, Monica-Mihaela, Simkó, Katalin, Spagnol, Michael, Stefanova, Valentina, Stymne, Sara, Sulubacak, Umut, Tabone, Nicole, Tanti, Marc, Todorova, Maria, Urešová, Zdenka, Villavicencio, Aline, and Zilio, Leonardo
- Publisher:
- PARSEME
- Type:
- text and corpus
- Subject:
- Multiword expressions, verbal multiword expressions, idioms, light-verb constructions, verb-particle constructions, and inherently reflexive verbs
- Language:
- Bulgarian, Czech, German, Modern Greek (1453-), Spanish, Persian, French, Hebrew, Hungarian, Italian, Lithuanian, Maltese, Polish, Portuguese, Romanian, Slovenian, Swedish, and Turkish
- Description:
- The PARSEME shared task aims at identifying verbal MWEs in running texts. Verbal MWEs include idioms (let the cat out of the bag), light verb constructions (make a decision), verb-particle constructions (give up), and inherently reflexive verbs (se suicider 'to suicide' in French). VMWEs were annotated according to the universal guidelines in 18 languages. The corpora are provided in the parsemetsv format, inspired by the CONLL-U format. For most languages, paired files in the CONLL-U format - not necessarily using UD tagsets - containing parts of speech, lemmas, morphological features and/or syntactic dependencies are also provided. Depending on the language, the information comes from treebanks (e.g., Universal Dependencies) or from automatic parsers trained on treebanks (e.g., UDPipe). This item contains training and test data, tools and the universal guidelines file.
- Rights:
- PARSEME Shared Task Data (v. 1.0) Agreement, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-mwe-1.0, and PUB
14. Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.1)
- Creator:
- Ramisch, Carlos, Cordeiro, Silvio Ricardo, Savary, Agata, Vincze, Veronika, Barbu Mititelu, Verginica, Bhatia, Archna, Buljan, Maja, Candito, Marie, Gantar, Polona, Giouli, Voula, Güngör, Tunga, Hawwari, Abdelati, Iñurrieta, Uxoa, Kovalevskaitė, Jolanta, Krek, Simon, Lichte, Timm, Liebeskind, Chaya, Monti, Johanna, Parra Escartín, Carla, QasemiZadeh, Behrang, Ramisch, Renata, Schneider, Nathan, Stoyanova, Ivelina, Vaidya, Ashwini, Walsh, Abigail, Aceta, Cristina, Aduriz, Itziar, Antoine, Jean-Yves, Arhar Holdt, Špela, Berk, Gözde, Bielinskienė, Agnė, Blagus, Goranka, Boizou, Loic, Bonial, Claire, Caruso, Valeria, Čibej, Jaka, Constant, Matthieu, Cook, Paul, Diab, Mona, Dimitrova, Tsvetana, Ehren, Rafael, Elbadrashiny, Mohamed, Elyovich, Hevi, Erden, Berna, Estarrona, Ainara, Fotopoulou, Aggeliki, Foufi, Vassiliki, Geeraert, Kristina, van Gompel, Maarten, Gonzalez, Itziar, Gurrutxaga, Antton, Ha-Cohen Kerner, Yaakov, Ibrahim, Rehab, Ionescu, Mihaela, Jain, Kanishka, Jazbec, Ivo-Pavao, Kavčič, Teja, Klyueva, Natalia, Kocijan, Kristina, Kovács, Viktória, Kuzman, Taja, Leseva, Svetlozara, Ljubešić, Nikola, Malka, Ruth, Markantonatou, Stella, Martínez Alonso, Héctor, Matas, Ivana, McCrae, John, de Medeiros Caseli, Helena, Onofrei, Mihaela, Palka-Binkiewicz, Emilia, Papadelli, Stella, Parmentier, Yannick, Pascucci, Antonio, Pasquer, Caroline, Pia di Buono, Maria, Puri, Vandana, Raffone, Annalisa, Ratori, Shraddha, Riccio, Anna, Sangati, Federico, Shukla, Vishakha, Simkó, Katalin, Šnajder, Jan, Somers, Clarissa, Srivastava, Shubham, Stefanova, Valentina, Taslimipoor, Shiva, Theoxari, Natasa, Todorova, Maria, Urizar, Ruben, Villavicencio, Aline, and Zilio, Leonardo
- Publisher:
- PARSEME
- Type:
- text and corpus
- Subject:
- Multiword expressions, verbal multiword expressions, light-verb constructions, verb-particle constructions, inherently reflexive verbs, verbal idioms, and multi-verb constructions
- Language:
- Bulgarian, German, Modern Greek (1453-), Spanish, Persian, French, Hebrew, Hungarian, Italian, Lithuanian, Polish, Portuguese, Romanian, Slovenian, Turkish, Hindi, Basque, English, and Croatian
- Description:
- This multilingual resource contains corpora in which verbal MWEs have been manually annotated. VMWEs include idioms (let the cat out of the bag), light-verb constructions (make a decision), verb-particle constructions (give up), inherently reflexive verbs (help oneself), and multi-verb constructions (make do). VMWEs were annotated according to the universal guidelines in 19 languages. The corpora are provided in the cupt format, inspired by the CONLL-U format. The corpora were used in the 1.1 edition of the PARSEME Shared Task (2018). For most languages, morphological and syntactic information – not necessarily using UD tagsets – including parts of speech, lemmas, morphological features and/or syntactic dependencies are also provided. Depending on the language, the information comes from treebanks (e.g., Universal Dependencies) or from automatic parsers trained on treebanks (e.g., UDPipe). This item contains training, development and test data, as well as the evaluation tools used in the PARSEME Shared Task 1.1 (2018). The annotation guidelines are available online: http://parsemefr.lif.univ-mrs.fr/parseme-st-guidelines/1.1
- Rights:
- PARSEME Shared Task Data (v. 1.1) Agreement, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-mwe-1.1, and PUB
15. Annotated corpora and tools of the PARSEME Shared Task on Semi-Supervised Identification of Verbal Multiword Expressions (edition 1.2)
- Creator:
- Ramisch, Carlos, Guillaume, Bruno, Savary, Agata, Waszczuk, Jakub, Candito, Marie, Vaidya, Ashwini, Barbu Mititelu, Verginica, Bhatia, Archna, Iñurrieta, Uxoa, Giouli, Voula, Güngör, Tunga, Jiang, Menghan, Lichte, Timm, Liebeskind, Chaya, Monti, Johanna, Ramisch, Renata, Stymme, Sara, Walsh, Abigail, Xu, Hongzhi, Palka-Binkiewicz, Emilia, Ehren, Rafael, Stymne, Sara, Constant, Matthieu, Pasquer, Caroline, Parmentier, Yannick, Antoine, Jean-Yves, Carlino, Carola, Caruso, Valeria, Di Buono, Maria Pia, Pascucci, Antonio, Raffone, Annalisa, Riccio, Anna, Sangati, Federico, Speranza, Giulia, Cordeiro, Silvio Ricardo, de Medeiros Caseli, Helena, Miranda, Isaac, Rademaker, Alexandre, Vale, Oto, Villavicencio, Aline, Wick Pedro, Gabriela, Wilkens, Rodrigo, Zilio, Leonardo, Rizea, Monica-Mihaela, Ionescu, Mihaela, Onofrei, Mihaela, Chen, Jia, Ge, Xiaomin, Hu, Fangyuan, Hu, Sha, Li, Minli, Liu, Siyuan, Qin, Zhenzhen, Sun, Ruilong, Wang, Chenweng, Xiao, Huangyang, Yan, Peiyi, Yih, Tsy, Yu, Ke, Yu, Songping, Zeng, Si, Zhang, Yongchen, Zhao, Yun, Foufi, Vassiliki, Fotopoulou, Aggeliki, Markantonatou, Stella, Papadelli, Stella, Louizou, Sevasti, Aduriz, Itziar, Estarrona, Ainara, Gonzalez, Itziar, Gurrutxaga, Antton, Uria, Larraitz, Urizar, Ruben, Foster, Jennifer, Lynn, Teresa, Elyovitch, Hevi, Ha-Cohen Kerner, Yaakov, Malka, Ruth, Jain, Kanishka, Puri, Vandana, Ratori, Shraddha, Shukla, Vishakha, Srivastava, Shubham, Berk, Gozde, Erden, Berna, and Yirmibeşoğlu, Zeynep
- Publisher:
- PARSEME
- Type:
- text and corpus
- Subject:
- multiword expressions, verbal multiword expressions, light verb construction, verb-particle constructions, inherently reflexive verbs, verbal idioms, and multi-verb constructions
- Language:
- German, Modern Greek (1453-), Basque, French, Irish, Hebrew, Hindi, Italian, Polish, Portuguese, Romanian, Swedish, Turkish, and Chinese
- Description:
- This multilingual resource contains corpora in which verbal MWEs have been manually annotated, gathered at the occasion of the 1.2 edition of the PARSEME Shared Task on semi-supervised Identification of Verbal MWEs (2020). VMWEs include idioms (let the cat out of the bag), light-verb constructions (make a decision), verb-particle constructions (give up), inherently reflexive verbs (help oneself), and multi-verb constructions (make do). For the 1.2 shared task edition, the data covers 14 languages, for which VMWEs were annotated according to the universal guidelines. The corpora are provided in the cupt format, inspired by the CONLL-U format. Morphological and syntactic information – not necessarily using UD tagsets – including parts of speech, lemmas, morphological features and/or syntactic dependencies are also provided. Depending on the language, the information comes from treebanks (e.g., Universal Dependencies) or from automatic parsers trained on treebanks (e.g., UDPipe). This item contains training, development and test data, as well as the evaluation tools used in the PARSEME Shared Task 1.2 (2020). The annotation guidelines are available online: http://parsemefr.lif.univ-mrs.fr/parseme-st-guidelines/1.2
- Rights:
- PARSEME Shared Task Data (v. 1.2) Agreement, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-mwe-1.2, and PUB
16. Antologia współczesnych poetów polskich: z podobiznami niektórych autorow
- Creator:
- Króliński, Kazimierz
- Publisher:
- Maniszewski
- Format:
- print and 636 s.
- Type:
- model:monograph and TEXT
- Subject:
- Polská literatura (o ní), polská poezie, 11, and 821.162.1.09
- Language:
- Polish
- Description:
- ułožył Kazimierz Królinski.
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
17. Antyhitlerowskie niemieckie ośrodki propagandy na terenie Czechosłowacji w latach 1933-1938 :
- Creator:
- Fiedor, Karol,
- Type:
- text and studie
- Subject:
- Dějiny Česka a Slovenska, antifašisté, propaganda politická, Československo 1918-1938, politické dějiny, politici, Německo, světové dějiny 1918-1945, and zahraniční politika, mezinárodní vztahy
- Language:
- Polish
- Rights:
- unknown
18. Archeologia u progu trzeciego tysiąclecia /
- Creator:
- Tabaczyński, Stanisław,
- Type:
- text and studie
- Subject:
- Archeologie and archeologie
- Language:
- Polish
- Description:
- Archaeollogy at the beginning of the third millennium.
- Rights:
- unknown
19. Archeologické rozhledy
- Type:
- model:periodicalitem and TEXT
- Language:
- Czech, German, English, and Polish
- Description:
- 4
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
20. Archiwum Jana Zamoyskiego, kanclerza i hetmana wielkiego koronnego.
- Creator:
- Zamoyski, Maurycy,
- Type:
- text and prameny
- Subject:
- Dějiny zemí střední Evropy, Zamojski, Jan,, politici polští, dějiny polské, Polsko, světové dějiny 1492-1648, and politické dějiny, politici
- Language:
- Polish
- Rights:
- unknown
21. Archiwum Jana Zamoyskiego, kanclerza i hetmana wielkiego koronnego.
- Creator:
- Zamoyski, Maurycy,
- Type:
- text and prameny
- Subject:
- Dějiny zemí střední Evropy, Zamojski, Jan,, dějiny polské, Polsko, světové dějiny 1492-1648, and přehledná zpracování (tematicky)
- Language:
- Polish
- Rights:
- unknown
22. Artykuły luksusowe na stole królewskim w późnośredniowiecznej Polsce /
- Creator:
- Januszek-Sieradzka, Agnieszka,
- Type:
- text and studie
- Subject:
- Dějiny zemí střední Evropy, středověk pozdní, dvory panovnické, panovníci polští, potraviny, luxus, Polsko, světové dějiny středověku (do r. 1492), panovníci, panovnické rody, dvory, and oděv, strava
- Language:
- Polish
- Description:
- Luxury Products on the Royal Table in Late Medieval Poland.
- Rights:
- unknown
23. Astryk-Anastazy opat Trzemeszyński (1001) /
- Creator:
- Wojciechowski, Tadeusz,
- Type:
- text and studie
- Subject:
- Křesťanství. Křesťanská církev všeobecně. Eklesiologie, Anastazy-Astryk,, opati, duchovenstvo, české země 895/906-1197, and jednotlivci (církevní dějiny)
- Language:
- Polish
- Rights:
- unknown
24. Ateny Wołyńskie :
- Creator:
- Rolle, Michał,
- Type:
- text and monografie
- Subject:
- Czacki, Tadeusz,, města ukrajinská, biografie, školy vysoké, historici polští, Polsko, města, obce, and světové dějiny 1789-1918
- Language:
- Polish
- Rights:
- unknown
25. Atlas do dziejów Polski zawierający trzynaście mapek kolorowanych /
- Creator:
- Niewiadomski, Eligiusz,
- Type:
- text and atlasy
- Subject:
- Historická geografie, dějiny polské, Polsko, přehledná zpracování světových dějin (chronologicky), přehledná zpracování (tematicky), and historická kartografie, atlasy, staré mapy
- Language:
- Polish
- Rights:
- unknown
26. Atlas nazw geograficznych słowiańszczyzny zachodniej =
- Creator:
- Kozierowski, Stanisław Dołęga,
- Type:
- text and atlasy
- Subject:
- Obecná geografie. Systematická geografie, toponomastika, jména místní, zahraniční historická onomastika a toponomastika, and Polsko
- Language:
- Polish
- Rights:
- unknown
27. Atlas nazw geograficznych słowiańszczyzny zachodniej = :
- Creator:
- Kozierowski, Stanisław Dołęga,
- Type:
- text and atlasy
- Subject:
- Dějiny Evropy, toponomastika, Slované západní, and zahraniční historická onomastika a toponomastika
- Language:
- Polish
- Rights:
- unknown
28. Badając przeszłość, poznawałem też współczesne Czechy :
- Creator:
- Gmiterek, Henryk,
- Type:
- text and studie
- Subject:
- Historická věda. Pomocné vědy historické. Archivnictví, historici polští, historiografie polská, vztahy česko-polské, novověk raný, vzpomínky, historici, historici (jubilea, nekrology apod.), Polsko, světové dějiny od r. 1945 do současnosti, and dějepisectví, historické vědy, historici
- Language:
- Polish
- Rights:
- unknown
29. Badania nad fauną pluskwiaków drzew i krzewów w Polsce =: Untersuchungen der Wanzenfauna der Bäume und Sträucher Polens
- Creator:
- Strawiński, Konstanty
- Publisher:
- nákl. vl.
- Format:
- print, text, regular print, and 216 s.
- Type:
- model:monograph and TEXT
- Subject:
- Lesnictví, hmyz, ploštice, výskyt, stromy, keře, Polsko, 630, 24, and UK01
- Language:
- Polish and German
- Description:
- Konstanty Strawiński, Obsahuje bibliografii, and Německé resumé
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
30. Badania nazw topograficznych na obszarze dawnej wschodniej Wielkopolski.
- Creator:
- Kozierowski, Stanisław Dołęga,
- Type:
- text and monografie
- Subject:
- Dějiny Evropy, toponomastika, filologie, topografie historická, Polsko, jazyk, písmo, přehledná zpracování světových dějin (chronologicky), zahraniční historická onomastika a toponomastika, and zahraniční historická geografie a kartografie
- Language:
- Polish
- Rights:
- unknown
31. Badania nazw topograficznych na obszarze dawnej zachodniej i środkowej Wielkopolski.
- Creator:
- Kozierowski, Stanisław Dołęga,
- Type:
- text and monografie
- Subject:
- Dějiny zemí střední Evropy, toponomastika, dějiny polské, and zahraniční historická onomastika a toponomastika
- Language:
- Polish
- Rights:
- unknown
32. Bardejov
- Publisher:
- Vojenský zeměpisný ústav
- Format:
- map and 1 mapa : barevná ; 39 x 50 cm na listu 47 x 63 cm
- Type:
- model:map, cartographic, and IMAGE
- Subject:
- udc:913(4), Konspekt:7, udc:912, udc:913(437.6), udc:912.43, udc:(084.3), Konspekt:Geografie Evropy, reálie, cestování, Konspekt:Mapy. Atlasy. Glóby, and czenas:Bardejov (Slovensko : oblast)
- Language:
- Czech, Slovak, and Polish
- Description:
- 4266, Edice dle kladu listů, and (Language) Místní názvy slovensky a polsky
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
33. Bartosza Paprockiego Dwie broszury polityczne z lat 1587 i 1588 /
- Creator:
- Paprocký z Hlohol a Paprocké Vůle, Bartoloměj,
- Type:
- text, prameny, and edice
- Subject:
- Historická věda. Pomocné vědy historické. Archivnictví, Paprocký z Hlohol a Paprocké Vůle, Bartoloměj,, <<z >>Rožmberka, Vilém,, publicistika, vztahy česko-polské, české země 1526-1620, novinářství, tisk, Polsko, and světové dějiny 1492-1648
- Language:
- Polish
- Description:
- Pamięć nierządu w Polscze (pol.); Upominek albo przestroga zacnemu narodowi polskiemu (pol.)
- Rights:
- unknown
34. Bernardyni polscy.
- Creator:
- Kantak, Stefan Kamil Juliusz,
- Type:
- text and monografie
- Subject:
- Dějiny zemí střední Evropy, řád, bernardini, Polsko, církevní řády a kongregace, náboženská bratrstva, kláštery, and světové dějiny novověku (1492-1918)
- Language:
- Polish
- Rights:
- unknown
35. Bibliografia historyi polskiej.
- Creator:
- Finkel, Ludwik,
- Type:
- text
- Subject:
- Bibliografie. Katalogy, bibliografie oborové, bibliografie historická, dějiny polské, Polsko, přehledná zpracování (tematicky), přehledná zpracování světových dějin (chronologicky), and bibliografie oborové a tematické, rejstříky časopisů
- Language:
- Polish
- Rights:
- unknown
36. Bibliografia kopernikowska.
- Creator:
- Baranowski, Henryk,
- Type:
- text and bibliografie
- Subject:
- Bibliografie. Katalogy, Koperník, Mikuláš,, vědy přírodní, astronomové, Polsko, vědy o neživé přírodě, přírodní prostředí, astronomie, světové dějiny 1492-1648, and personální bibliografie
- Language:
- Polish
- Rights:
- unknown
37. Bibliografia słowianoznawstwa polskiego /
- Type:
- text and bibliografie
- Subject:
- Bibliografie. Katalogy, slovanství, slavistika, and personální bibliografie
- Language:
- Polish
- Rights:
- unknown
38. Bibliografický přehled českých národních písní: seznam studií, starších sbírek rukopisných, sbírek tištěných, překladů s vybranými ukázkami a podrobný abecední ukazatel písní, v knize uvedených i vůbec písní tiskem uveřejněných
- Creator:
- Čeněk Zíbrt and Česká akademie císaře Františka Josefa pro vědy, slovesnost a umění
- Publisher:
- Nákladem České akademie císaře Františka Josefa pro vědy, slovesnost a umění
- Format:
- print, svazek, and 326 stran.
- Type:
- model:monograph and TEXT
- Subject:
- Vokální hudba, Bibliografie. Katalogy, české lidové písně, historické prameny, Česko, 784.4(=162.3), (016), (437.3), 9, 12, 784, and 01
- Language:
- Czech, English, French, German, Italian, Latin, Polish, and Russian
- Description:
- sestavil Čeněk Zíbrt., Obsahuje rejstříky., Částečně souběžný anglický, francouzský, německý, italský, latinský, polský a ruský text, and Vydává III. třída České akademie císaře Františka Josefa pro vědy, slovesnost a umění v Praze
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
39. Biskup krakowski Zbigniew Oleśnicki (1423-1455) wobec husytyzmu i polityki polsko-czeskiej /
- Creator:
- Graff, Tomasz,
- Type:
- text and studie
- Subject:
- Dějiny křesťanské církve, Oleśnicki, Zbigniew,, biskupové krakovští, husitství, vztahy česko-polské, Polsko, světové dějiny středověku (do r. 1492), české země 1419-1471, jednotlivci (církevní dějiny), and zahraniční politika, mezinárodní vztahy
- Language:
- Polish
- Description:
- Bishop of Cracow, Zbigniew Oleśnicki (1423-1455) against the Hussite movement and Polish-Czech policy.
- Rights:
- unknown
40. Bitwa grochowska /
- Creator:
- Szpotański, Stanisław,
- Type:
- text and studie
- Subject:
- Dějiny zemí střední Evropy, Chrzanowski, Wojciech,, bitvy, povstání polská, Polsko, Rusko, vojenské operace, války, bitvy, and světové dějiny 1789-1918
- Language:
- Polish
- Rights:
- unknown
41. Bogurodzica :
- Creator:
- Brückner, Aleksander,
- Type:
- text and studie
- Subject:
- Náboženství, Vojtěch,, písně, vztahy česko-polské, biskupové pražští, Polsko, hudba, tanec, hudební nástroje, and světové dějiny středověku (do r. 1492)
- Language:
- Polish
- Rights:
- unknown
42. Bojkot towarów niemieckich w Polsce w latach 1933-1935 /
- Creator:
- Tomaszewski, Jerzy,
- Type:
- text and studie
- Subject:
- Ekonomie, bojkot, hospodářství, antisemitismus, Židé, vztahy polsko-německé, Polsko, světové dějiny 1918-1945, hospodářské dějiny, and antisemitismus, perzekuce, pogromy
- Language:
- Polish
- Description:
- Boycott of German Goods in Poland, 1933-1935.
- Rights:
- unknown
43. Boleslaw Chrobry Wielki /
- Creator:
- Zakrzewski, Stanisław,
- Type:
- text and biografie
- Subject:
- Dějiny Evropy, Boleslav, panovníci polští, dějiny politické, Polsko, panovníci, panovnické rody, dvory, and světové dějiny středověku (do r. 1492)
- Language:
- Polish
- Rights:
- unknown
44. Bracia czescy w dawnej Polsce :
- Creator:
- Rott, Dariusz,
- Type:
- text and monografie
- Subject:
- Křesťanství. Křesťanská církev všeobecně. Eklesiologie, Jednota bratrská, vztahy česko-polské, světové dějiny novověku (1492-1918), Polsko, církve, sekty, and české země 1526-1792
- Language:
- Polish
- Rights:
- unknown
45. Bronzy Małopolski Środkowej :
- Creator:
- Sulimirski, Tadeusz,
- Type:
- text and monografie
- Subject:
- Archeologie, doba bronzová, archeologie, nálezy, světové dějiny - doba bronzová, and Polsko
- Language:
- Polish
- Rights:
- unknown
46. C4Corpus (CC BY-NC part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Kannada, Korean, Latvian, Lithuanian, Malayalam, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Panjabi, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-NonCommercial 4.0 International (CC BY-NC 4.0), http://creativecommons.org/licenses/by-nc/4.0/, and PUB
47. C4Corpus (CC BY-NC-ND part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Kannada, Korean, Latvian, Lithuanian, Malayalam, Marathi, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Urdu, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0), http://creativecommons.org/licenses/by-nc-nd/4.0/, and PUB
48. C4Corpus (CC BY-NC-SA part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Korean, Latvian, Lithuanian, Malayalam, Marathi, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Urdu, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB
49. C4Corpus (CC BY-ND part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Korean, Latvian, Lithuanian, Malayalam, Macedonian, Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-NoDerivatives 4.0 International (CC BY-ND 4.0), http://creativecommons.org/licenses/by-nc/4.0/, and PUB
50. C4Corpus (CC BY-SA part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Kannada, Korean, Latvian, Lithuanian, Malayalam, Marathi, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Panjabi, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Urdu, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
51. C4Corpus (CC-BY part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Kannada, Korean, Latvian, Lithuanian, Malayalam, Marathi, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Panjabi, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Urdu, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution 4.0 International (CC BY 4.0), http://creativecommons.org/licenses/by/4.0/, and PUB
52. C4Corpus (publicdomain part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Croatian, Hungarian, Indonesian, Italian, Japanese, Korean, Latvian, Lithuanian, Dutch, Norwegian, Polish, Portuguese, Russian, Slovenian, Somali, Spanish, Swahili (macrolanguage), Swedish, Tagalog, Thai, Turkish, Ukrainian, Undetermined, and Vietnamese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Public Domain Mark (PD), http://creativecommons.org/publicdomain/mark/1.0/, and PUB
53. Casimiri IV regis tempora complectens (1447-1492) /
- Type:
- text, listiny, and regesty
- Subject:
- Dějiny zemí střední Evropy, Kazimír, vztahy česko-polské, Polsko, politické dějiny, politici, and světové dějiny středověku (do r. 1492)
- Language:
- Polish
- Rights:
- unknown
54. Ceny w Gdańsku w latach 1701-1815 = :
- Creator:
- Furtak, Tadeusz
- Type:
- text and monografie
- Subject:
- Dějiny zemí střední Evropy, ceny, města polská, dějiny hospodářské, Polsko, finančnictví, světové dějiny 1648-1789, and světové dějiny 1789-1918
- Language:
- Polish
- Rights:
- unknown
55. Ceny w Lublinie od XVI do końca XVIII wieku = :
- Creator:
- Adamczyk, Władysław
- Type:
- text and monografie
- Subject:
- Dějiny zemí střední Evropy, ceny, města, dějiny hospodářské, Polsko, světové dějiny novověku (1492-1918), and finančnictví
- Language:
- Polish
- Rights:
- unknown
56. Česká literatura v polských překladech (1989-2020) =
- Creator:
- Goszczyńska, Joanna,
- Type:
- text and monografie kolektivní
- Subject:
- Česká literatura (o ní), literatura česká, překlady literární, jazyk polský, bibliografie oborové, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech and Polish
- Rights:
- unknown
57. Církevní dějiny Slezska 18. až 20. století /
- Creator:
- Jirásek, Zdeněk,
- Type:
- text and monografie kolektivní
- Subject:
- Dějiny křesťanské církve, dějiny církevní, přehledná zpracování dějin českých zemí (chronologicky), and církevní a náboženské dějiny
- Language:
- Czech and Polish
- Description:
- 200 výt., Církevní dějiny Slezska 18.-20. století, and Obálkový a hřbetní název: Církevní dějiny Slezska 18.-20. století
- Rights:
- unknown
58. Codex epistolaris saeculi decimi quinti.
- Type:
- text and prameny
- Subject:
- Dějiny zemí střední Evropy, dějiny polské, vztahy polsko-české, Polsko, světové dějiny 1492-1648, přehledná zpracování (tematicky), and diplomatika, edice
- Language:
- Polish
- Rights:
- unknown
59. CoNLL 2017 and 2018 Shared Task Blind and Preprocessed Test Data
- Creator:
- Zeman, Daniel and Straka, Milan
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- tokenization, word segmentation, morphology, tagging, syntax, parsing, and universal dependencies
- Language:
- Afrikaans, Arabic, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Persian, Finnish, French, Old French (842-ca. 1400), Irish, Galician, Gothic, Ancient Greek (to 1453), Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Latin, Latvian, Dutch, Norwegian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Thai, Turkish, Uighur, Ukrainian, Urdu, Vietnamese, and Chinese
- Description:
- CoNLL 2017 and 2018 shared tasks: Multilingual Parsing from Raw Text to Universal Dependencies This package contains the test data in the form in which they ware presented to the participating systems: raw text files and files preprocessed by UDPipe. The metadata.json files contain lists of files to process and to output; README files in the respective folders describe the syntax of metadata.json. For full training, development and gold standard test data, see Universal Dependencies 2.0 (CoNLL 2017) Universal Dependencies 2.2 (CoNLL 2018) See the download links at http://universaldependencies.org/. For more information on the shared tasks, see http://universaldependencies.org/conll17/ http://universaldependencies.org/conll18/ Contents: conll17-ud-test-2017-05-09 ... CoNLL 2017 test data conll18-ud-test-2018-05-06 ... CoNLL 2018 test data conll18-ud-test-2018-05-06-for-conll17 ... CoNLL 2018 test data with metadata and filenames modified so that it is digestible by the 2017 systems.
- Rights:
- Licence Universal Dependencies v2.2, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.2, and PUB
60. CoNLL 2017 Shared Task System Outputs
- Creator:
- Zeman, Daniel, Potthast, Martin, Straka, Milan, Popel, Martin, Dozat, Timothy, Qi, Peng, Manning, Christopher, Shi, Tianze, Wu, Felix G., Chen, Xilun, Cheng, Yao, Björkelund, Anders, Falenska, Agnieszka, Yu, Xiang, Kuhn, Jonas, Che, Wanxiang, Guo, Jiang, Wang, Yuxuan, Zheng, Bo, Zhao, Huaipeng, Liu, Yang, Teng, Dechuan, Liu, Ting, Lim, Kyungtae, Poibeau, Thierry, Sato, Motoki, Manabe, Hitoshi, Noji, Hiroshi, Matsumoto, Yuji, Kırnap, Ömer, Önder, Berkay Furkan, Yuret, Deniz, Straková, Jana, Vania, Clara, Zhang, Xingxing, Lopez, Adam, Heinecke, Johannes, Asadullah, Munshi, Kanerva, Jenna, Luotolahti, Juhani, Ginter, Filip, Kuan, Yu, Sofroniev, Pavel, Schill, Erik, Hinrichs, Erhard, Nguyen, Dat Quoc, Dras, Mark, Johnson, Mark, Qian, Xian, Vilares, David, Gómez-Rodríguez, Carlos, Aufrant, Lauriane, Wisniewski, Guillaume, Yvon, François, Dumitrescu, Stefan Daniel, Boroş, Tiberiu, Tufiş, Dan, Das, Ayan, Zaffar, Affan, Sarkar, Sudeshna, Wang, Hao, Zhao, Hai, Zhang, Zhisong, Hornby, Ryan, Taylor, Clark, Park, Jungyeul, de Lhoneux, Miryam, Shao, Yan, Basirat, Ali, Kiperwasser, Eliyahu, Stymne, Sara, Goldberg, Yoav, Nivre, Joakim, Akkuş, Burak Kerim, Azizoglu, Heval, Cakici, Ruket, Moor, Christophe, Merlo, Paola, Henderson, James, Wang, Haozhou, Ji, Tao, Wu, Yuanbin, Lan, Man, de la Clergerie, Eric, Sagot, Benoît, Seddah, Djamé, More, Amir, Tsarfaty, Reut, Kanayama, Hiroshi, Muraoka, Masayasu, Yoshikawa, Katsumasa, Garcia, Marcos, and Gamallo, Pablo
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- dependency parser and parsebank
- Language:
- Arabic, Bulgarian, Russia Buriat, Czech, Catalan, Church Slavic, Danish, German, Modern Greek (1453-), English, Spanish, Estonian, Basque, Persian, Finnish, French, Irish, Galician, Gothic, Ancient Greek (to 1453), Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Latin, Latvian, Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Northern Sami, Swedish, Turkish, Uighur, Ukrainian, Urdu, Vietnamese, and Chinese
- Description:
- This package contains the system outputs from the CoNLL 2017 Shared Task in Multilingual Parsing from Raw Text to Universal Dependencies.
- Rights:
- Licence Universal Dependencies v2.0, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.0, and PUB
61. CoNLL 2018 Shared Task System Outputs
- Creator:
- Zeman, Daniel, Potthast, Martin, Duthoo, Elie, Mesnard, Olivier, Rybak, Piotr, Wróblewska, Alina, Che, Wanxiang, Liu, Yijia, Wang, Yuxuan, Zheng, Bo, Liu, Ting, Li, Zuchao, He, Shexia, Zhang, Zhuosheng, Zhao, Hai, Wu, Yingting, Tong, Jia-Jun, Nguyen, Dat Quoc, Verspoor, Karin, Wan, Hui, Naseem, Tahira, Lee, Young-Suk, Castelli, Vittorio, Ballesteros, Miguel, Hershcovich, Daniel, Abend, Omri, Rappoport, Ari, Smith, Aaron, Bohnet, Bernd, de Lhoneux, Miryam, Nivre, Joakim, Shao, Yan, Stymne, Sara, Kırnap, Ömer, Dayanık, Erenay, Yuret, Deniz, Kanerva, Jenna, Ginter, Filip, Miekka, Niko, Leino, Akseli, Salakoski, Tapio, Lim, KyungTae, Park, Cheoneum, Lee, Changki, Poibeau, Thierry, Bhat, Riyaz Ahmad, Bhat, Irshad, Bangalore, Srinivas, Qi, Peng, Dozat, Timothy, Zhang, Yuhao, Manning, Christopher, Boroș, Tiberiu, Dumitrescu, Stefan Daniel, Burtica, Ruxandra, Arakelyan, Gor, Hambardzumyan, Karen, Khachatrian, Hrant, Rosa, Rudolf, Mareček, David, Straka, Milan, Seker, Amit, More, Amir, Tsarfaty, Reut, Önder, Berkay Furkan, Gümeli, Can, Jawahar, Ganesh, Muller, Benjamin, Fethi, Amal, Martin, Louis, Villemonte de la Clergerie, Eric, Sagot, Benoît, Seddah, Djamé, Özateş, Şaziye Betül, Özgür, Arzucan, Gungor, Tunga, Öztürk, Balkız, Ji, Tao, Liu, Yufang, Wang, Yijun, Wu, Yuanbin, Lan, Man, Chen, Danlu, Lin, Mengxiao, Hu, Zhifeng, and Qiu, Xipeng
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- parsed data, conllu, and universal dependencies
- Language:
- Afrikaans, Arabic, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Persian, Finnish, French, Old French (842-ca. 1400), Irish, Galician, Gothic, Ancient Greek (to 1453), Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Latin, Latvian, Dutch, Norwegian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Thai, Turkish, Uighur, Ukrainian, Urdu, Vietnamese, and Chinese
- Description:
- Test data parsed by systems submitted to the CoNLL 2018 UD parsing shared task.
- Rights:
- Licence Universal Dependencies v2.2, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.2, and PUB
62. Coreference in Universal Dependencies 0.1 (CorefUD 0.1)
- Creator:
- Nedoluzhko, Anna, Novák, Michal, Popel, Martin, Žabokrtský, Zdeněk, and Zeman, Daniel
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- dependency, treebank, coreference, bridging relations, and harmonized annotation
- Language:
- Catalan, Czech, Dutch, English, French, German, Hungarian, Lithuanian, Polish, Russian, and Spanish
- Description:
- CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version 0.1 consists of 17 datasets for 11 languages. The datasets are enriched with automatic morphological and syntactic annotations that are fully compliant with the standards of the Universal Dependencies project. All the datasets are stored in the CoNLL-U format, with coreference- and bridging-specific information captured by attribute-value pairs located in the MISC column. The collection is divided into a public edition and a non-public (ÚFAL-internal) edition. The publicly available edition is distributed via LINDAT-CLARIAH-CZ and contains 13 datasets for 10 languages (1 dataset for Catalan, 2 for Czech, 2 for English, 1 for French, 2 for German, 1 for Hungarian, 1 for Lithuanian, 1 for Polish, 1 for Russian, and 1 for Spanish), excluding the test data. The non-public edition is available internally to ÚFAL members and contains additional 4 datasets for 2 languages (1 dataset for Dutch, and 3 for English), which we are not allowed to distribute due to their original license limitations. It also contains the test data portions for all datasets. When using any of the harmonized datasets, please get acquainted with its license (placed in the same directory as the data) and cite the original data resource too. References to original resources whose harmonized versions are contained in the public edition of CorefUD 0.1: - Catalan-AnCora: Recasens, M. and Martí, M. A. (2010). AnCora-CO: Coreferentially Annotated Corpora for Spanish and Catalan. Language Resources and Evaluation, 44(4):315–345 - Czech-PCEDT: Nedoluzhko, A., Novák, M., Cinková, S., Mikulová, M., and Mírovský, J. (2016). Coreference in Prague Czech-English Dependency Treebank. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 169–176, Portorož, Slovenia. European Language Resources Association. - Czech-PDT: Hajič, J., Bejček, E., Hlaváčová, J., Mikulová, M., Straka, M., Štěpánek, J., and Štěpánková, B. (2020). Prague Dependency Treebank - Consolidated 1.0. In Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), pages 5208–5218, Marseille, France. European Language Resources Association. - English-GUM: Zeldes, A. (2017). The GUM Corpus: Creating Multilayer Resources in the Classroom. Language Resources and Evaluation, 51(3):581–612. - English-ParCorFull: Lapshinova-Koltunski, E., Hardmeier, C., and Krielke, P. (2018). ParCorFull: a Parallel Corpus Annotated with Full Coreference. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan. European Language Resources Association. - French-Democrat: Landragin, F. (2016). Description, modélisation et détection automatique des chaı̂nes de référence (DEMOCRAT). Bulletin de l’Association Française pour l’Intelligence Artificielle, (92):11–15. - German-ParCorFull: Lapshinova-Koltunski, E., Hardmeier, C., and Krielke, P. (2018). ParCorFull: a Parallel Corpus Annotated with Full Coreference. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan. European Language Resources Association - German-PotsdamCC: Bourgonje, P. and Stede, M. (2020). The Potsdam Commentary Corpus 2.2: Extending annotations for shallow discourse parsing. In Proceedings of the 12th Language Resources and Evaluation Conference, pages 1061–1066, Marseille, France. European Language Resources Association. - Hungarian-SzegedKoref: Vincze, V., Hegedűs, K., Sliz-Nagy, A., and Farkas, R. (2018). SzegedKoref: A Hungarian Coreference Corpus. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan. European Language Resources Association. - Lithuanian-LCC: Žitkus, V. and Butkienė, R. (2018). Coreference Annotation Scheme and Corpus for Lithuanian Language. In Fifth International Conference on Social Networks Analysis, Management and Security, SNAMS 2018, Valencia, Spain, October 15-18, 2018, pages 243–250. IEEE. - Polish-PCC: Ogrodniczuk, M., Glowińska, K., Kopeć, M., Savary, A., and Zawisławska, M. (2013). Polish coreference corpus. In Human Language Technology. Challenges for Computer Science and Linguistics - 6th Language and Technology Conference, LTC 2013, Poznań, Poland, December 7-9, 2013. Revised Selected Papers, volume 9561 of Lecture Notes in Computer Science, pages 215–226. Springer. - Russian-RuCor: Toldova, S., Roytberg, A., Ladygina, A. A., Vasilyeva, M. D., Azerkovich, I. L., Kurzukov,M., Sim, G., Gorshkov, D. V., Ivanova, A., Nedoluzhko, A., and Grishina, Y. (2014). Evaluating Anaphora and Coreference Resolution for Russian. In Komp’juternaja lingvistika i intellektual’nye tehnologii. Po materialam ezhegodnoj Mezhdunarodnoj konferencii Dialog, pages 681–695. - Spanish-AnCora: Recasens, M. and Martí, M. A. (2010). AnCora-CO: Coreferentially Annotated Corpora for Spanish and Catalan. Language Resources and Evaluation, 44(4):315–345 References to original resources whose harmonized versions are contained in the ÚFAL-internal edition of CorefUD 0.1: - Dutch-COREA: Hendrickx, I., Bouma, G., Coppens, F., Daelemans, W., Hoste, V., Kloosterman, G., Mineur, A.-M., Van Der Vloet, J., and Verschelde, J.-L. (2008). A coreference corpus and resolution system for Dutch. In Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC’08), Marrakech, Morocco. European Language Resources Association. - English-ARRAU: Uryupina, O., Artstein, R., Bristot, A., Cavicchio, F., Delogu, F., Rodriguez, K. J., and Poesio, M. (2020). Annotating a broad range of anaphoric phenomena, in a variety of genres: the ARRAU Corpus. Natural Language Engineering, 26(1):95–128. - English-OntoNotes: Weischedel, R., Hovy, E., Marcus, M., Palmer, M., Belvin, R., Pradhan, S., Ramshaw, L., and Xue, N. (2011). Ontonotes: A large training corpus for enhanced processing. In Handbook of Natural Language Processing and Machine Translation: DARPA Global Autonomous Language Exploitation, pages 54–63, New York. Springer-Verlag. - English-PCEDT: Nedoluzhko, A., Novák, M., Cinková, S., Mikulová, M., and Mírovský, J. (2016). Coreference in Prague Czech-English Dependency Treebank. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16), pages 169–176, Portorož, Slovenia. European Language Resources Association.
- Rights:
- Licence CorefUD v0.1, https://lindat.mff.cuni.cz/repository/xmlui/page/license-corefud-0.1, and PUB
63. Coreference in Universal Dependencies 0.2 (CorefUD 0.2)
- Creator:
- Nedoluzhko, Anna, Novák, Michal, Popel, Martin, Žabokrtský, Zdeněk, and Zeman, Daniel
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- dependency, treebank, coreference, bridging relations, and harmonized annotation
- Language:
- Catalan, Czech, Dutch, English, French, German, Hungarian, Lithuanian, Polish, Russian, and Spanish
- Description:
- CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version 0.2 consists of 17 datasets for 11 languages. The datasets are enriched with automatic morphological and syntactic annotations that are fully compliant with the standards of the Universal Dependencies project. All the datasets are stored in the CoNLL-U format, with coreference- and bridging-specific information captured by attribute-value pairs located in the MISC column. The collection is divided into a public edition and a non-public (ÚFAL-internal) edition. The publicly available edition is distributed via LINDAT-CLARIAH-CZ and contains 13 datasets for 10 languages (1 dataset for Catalan, 2 for Czech, 2 for English, 1 for French, 2 for German, 1 for Hungarian, 1 for Lithuanian, 1 for Polish, 1 for Russian, and 1 for Spanish), excluding the test data. The non-public edition is available internally to ÚFAL members and contains additional 4 datasets for 2 languages (1 dataset for Dutch, and 3 for English), which we are not allowed to distribute due to their original license limitations. It also contains the test data portions for all datasets. When using any of the harmonized datasets, please get acquainted with its license (placed in the same directory as the data) and cite the original data resource too. Version 0.2 consists of exactly the same datasets as the version 0.1. All automatically parsed datasets were re-parsed for v0.2 using UDPipe 2 with models trained on UD 2.6. Catalan-AnCora, Spanish-AnCora and English-GUM have been updated to match the their UD 2.9 versions.
- Rights:
- Licence CorefUD v0.2, https://lindat.mff.cuni.cz/repository/xmlui/page/license-corefud-0.2, and PUB
64. Coreference in Universal Dependencies 1.0 (CorefUD 1.0)
- Creator:
- Nedoluzhko, Anna, Novák, Michal, Popel, Martin, Žabokrtský, Zdeněk, Zeldes, Amir, Zeman, Daniel, Bourgonje, Peter, Cinková, Silvie, Hajič, Jan, Hardmeier, Christian, Krielke, Pauline, Landragin, Frédéric, Lapshinova-Koltunski, Ekaterina, Martí, M. Antònia, Mikulová, Marie, Ogrodniczuk, Maciej, Recasens, Marta, Stede, Manfred, Straka, Milan, Toldova, Svetlana, Vincze, Veronika, and Žitkus, Voldemaras
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- dependency, treebank, coreference, bridging relations, and harmonized annotation
- Language:
- Catalan, Czech, Dutch, English, French, German, Hungarian, Lithuanian, Polish, Russian, and Spanish
- Description:
- CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version 1.0 consists of 17 datasets for 11 languages. The datasets are enriched with automatic morphological and syntactic annotations that are fully compliant with the standards of the Universal Dependencies project. All the datasets are stored in the CoNLL-U format, with coreference- and bridging-specific information captured by attribute-value pairs located in the MISC column. The collection is divided into a public edition and a non-public (ÚFAL-internal) edition. The publicly available edition is distributed via LINDAT-CLARIAH-CZ and contains 13 datasets for 10 languages (1 dataset for Catalan, 2 for Czech, 2 for English, 1 for French, 2 for German, 1 for Hungarian, 1 for Lithuanian, 1 for Polish, 1 for Russian, and 1 for Spanish), excluding the test data. The non-public edition is available internally to ÚFAL members and contains additional 4 datasets for 2 languages (1 dataset for Dutch, and 3 for English), which we are not allowed to distribute due to their original license limitations. It also contains the test data portions for all datasets. When using any of the harmonized datasets, please get acquainted with its license (placed in the same directory as the data) and cite the original data resource too. Version 1.0 consists of the same corpora and languages as the previous version 0.2; however, the English GUM dataset has been updated to a newer and larger version, and in the Czech/English PCEDT dataset, the train-dev-test split has been changed to be compatible with OntoNotes. Nevertheless, the main change is in the file format (the MISC attributes have new form and interpretation).
- Rights:
- Licence CorefUD v0.2, https://lindat.mff.cuni.cz/repository/xmlui/page/license-corefud-0.2, and PUB
65. Coreference in Universal Dependencies 1.1 (CorefUD 1.1)
- Creator:
- Novák, Michal, Popel, Martin, Žabokrtský, Zdeněk, Zeman, Daniel, Nedoluzhko, Anna, Acar, Kutay, Bourgonje, Peter, Cinková, Silvie, Cebiroğlu Eryiğit, Gülşen, Hajič, Jan, Hardmeier, Christian, Haug, Dag, Jørgensen, Tollef, Kåsen, Andre, Krielke, Pauline, Landragin, Frédéric, Lapshinova-Koltunski, Ekaterina, Mæhlum, Petter, Martí, M. Antònia, Mikulová, Marie, Nøklestad, Anders, Ogrodniczuk, Maciej, Øvrelid, Lilja, Pamay Arslan, Tuğba, Recasens, Marta, Solberg, Per Erik, Stede, Manfred, Straka, Milan, Toldova, Svetlana, Vadász, Noémi, Velldal, Erik, Vincze, Veronika, Zeldes, Amir, and Žitkus, Voldemaras
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- dependency, treebank, coreference, bridging relations, and harmonized annotation
- Language:
- Catalan, Czech, English, French, German, Hungarian, Lithuanian, Norwegian, Polish, Russian, Spanish, and Turkish
- Description:
- CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version 1.1 consists of 21 datasets for 13 languages. The datasets are enriched with automatic morphological and syntactic annotations that are fully compliant with the standards of the Universal Dependencies project. All the datasets are stored in the CoNLL-U format, with coreference- and bridging-specific information captured by attribute-value pairs located in the MISC column. The collection is divided into a public edition and a non-public (ÚFAL-internal) edition. The publicly available edition is distributed via LINDAT-CLARIAH-CZ and contains 17 datasets for 12 languages (1 dataset for Catalan, 2 for Czech, 2 for English, 1 for French, 2 for German, 2 for Hungarian, 1 for Lithuanian, 2 for Norwegian, 1 for Polish, 1 for Russian, 1 for Spanish, and 1 for Turkish), excluding the test data. The non-public edition is available internally to ÚFAL members and contains additional 4 datasets for 2 languages (1 dataset for Dutch, and 3 for English), which we are not allowed to distribute due to their original license limitations. It also contains the test data portions for all datasets. When using any of the harmonized datasets, please get acquainted with its license (placed in the same directory as the data) and cite the original data resource too. Compared to the previous version 1.0, the version 1.1 comprises new languages and corpora, namely Hungarian-KorKor, Norwegian-BokmaalNARC, Norwegian-NynorskNARC, and Turkish-ITCC. In addition, the English GUM dataset has been updated to a newer and larger version, and the conversion pipelines for most datasets have been refined (a list of all changes in each dataset can be found in the corresponding README file).
- Rights:
- Licence CorefUD v1.1, https://lindat.mff.cuni.cz/repository/xmlui/page/license-corefud-1.1, and PUB
66. Coreference in Universal Dependencies 1.2 (CorefUD 1.2)
- Creator:
- Popel, Martin, Novák, Michal, Žabokrtský, Zdeněk, Zeman, Daniel, Nedoluzhko, Anna, Acar, Kutay, Bamman, David, Bourgonje, Peter, Cinková, Silvie, Eckhoff, Hanne, Cebiroğlu Eryiğit, Gülşen, Hajič, Jan, Hardmeier, Christian, Haug, Dag, Jørgensen, Tollef, Kåsen, Andre, Krielke, Pauline, Landragin, Frédéric, Lapshinova-Koltunski, Ekaterina, Mæhlum, Petter, Martí, M. Antònia, Mikulová, Marie, Nøklestad, Anders, Ogrodniczuk, Maciej, Øvrelid, Lilja, Pamay Arslan, Tuğba, Recasens, Marta, Solberg, Per Erik, Stede, Manfred, Straka, Milan, Swanson, Daniel, Toldova, Svetlana, Vadász, Noémi, Velldal, Erik, Vincze, Veronika, Zeldes, Amir, and Žitkus, Voldemaras
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- coreference, bridging relations, harmonized annotation, dependency, and treebank
- Language:
- Ancient Greek (to 1453), Ancient Hebrew, Catalan, Czech, English, French, German, Hungarian, Lithuanian, Norwegian, Church Slavic, Polish, Russian, Spanish, and Turkish
- Description:
- CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version 1.2 consists of 25 datasets for 16 languages. The datasets are enriched with automatic morphological and syntactic annotations that are fully compliant with the standards of the Universal Dependencies project. All the datasets are stored in the CoNLL-U format, with coreference- and bridging-specific information captured by attribute-value pairs located in the MISC column. The collection is divided into a public edition and a non-public (ÚFAL-internal) edition. The publicly available edition is distributed via LINDAT-CLARIAH-CZ and contains 21 datasets for 15 languages (1 dataset for Ancient Greek, 1 for Ancient Hebrew, 1 for Catalan, 2 for Czech, 3 for English, 1 for French, 2 for German, 2 for Hungarian, 1 for Lithuanian, 2 for Norwegian, 1 for Old Church Slavonic, 1 for Polish, 1 for Russian, 1 for Spanish, and 1 for Turkish), excluding the test data. The non-public edition is available internally to ÚFAL members and contains additional 4 datasets for 2 languages (1 dataset for Dutch, and 3 for English), which we are not allowed to distribute due to their original license limitations. It also contains the test data portions for all datasets. When using any of the harmonized datasets, please get acquainted with its license (placed in the same directory as the data) and cite the original data resource, too. Compared to the previous version 1.1, the version 1.2 comprises new languages and corpora, namely Ancient_Greek-PROIEL, Ancient_Hebrew-PTNK, English-LitBank, and Old_Church_Slavonic-PROIEL. In addition, English-GUM and Turkish-ITCC have been updated to newer versions, conversion of zeros in Polish-PCC has been improved, and the conversion pipelines for multiple other datasets have been refined (a list of all changes in each dataset can be found in the corresponding README file).
- Rights:
- Licence CorefUD v1.2, https://lindat.mff.cuni.cz/repository/xmlui/page/license-corefud-1.2, and PUB
67. CorPipe 23 multilingual CorefUD 1.1 model (corpipe23-corefud1.1-231206)
- Creator:
- Straka, Milan
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- tool and toolService
- Subject:
- coreference resolution, CorPipe, and CorefUD
- Language:
- Catalan, Czech, German, English, Spanish, French, Hungarian, Lithuanian, Norwegian Bokmål, Norwegian Nynorsk, Polish, Russian, and Turkish
- Description:
- The `corpipe23-corefud1.1-231206` is a `mT5-large`-based multilingual model for coreference resolution usable in CorPipe 23 (https://github.com/ufal/crac2023-corpipe). It is released under the CC BY-NC-SA 4.0 license. The model is language agnostic (no _corpus id_ on input), so it can be used to predict coreference in any `mT5` language (for zero-shot evaluation, see the paper). However, note that the empty nodes must be present already on input, they are not predicted (the same settings as in the CRAC23 shared task).
- Rights:
- Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB
68. Corpus for training and evaluating diacritics restoration systems
- Creator:
- Náplava, Jakub, Straka, Milan, Hajič, Jan, and Straňák, Pavel
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- diacritical marks generation and natural language correction
- Language:
- Czech, Vietnamese, Romanian, Polish, Slovak, Spanish, Croatian, Irish, Latvian, Hungarian, French, and Turkish
- Description:
- Corpus of texts in 12 languages. For each language, we provide one training, one development and one testing set acquired from Wikipedia articles. Moreover, each language dataset contains (substantially larger) training set collected from (general) Web texts. All sets, except for Wikipedia and Web training sets that can contain similar sentences, are disjoint. Data are segmented into sentences which are further word tokenized. All data in the corpus contain diacritics. To strip diacritics from them, use Python script diacritization_stripping.py contained within attached stripping_diacritics.zip. This script has two modes. We generally recommend using method called uninames, which for some languages behaves better. The code for training recurrent neural-network based model for diacritics restoration is located at https://github.com/arahusky/diacritics_restoration.
- Rights:
- Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB
69. Črty uhlem
- Creator:
- Sienkiewicz, Henryk and Moudrý, Cyril S.
- Publisher:
- J.F. Kubeš
- Format:
- print, Text, regular print, and 104 s. ; 19 cm
- Type:
- model:monograph and TEXT
- Subject:
- 821.162.1-32 and (0:82-32)
- Language:
- Czech and Polish
- Description:
- Converted from MARCXML to MODS version 3.5 using MARC21slim2MODS3-5.xsl (Revision 1.106 2014/12/19)(EE patch 2015/05/15), obrázky z venkovského života od Henryka Sienkiewicze ; z polštiny přeložil Cyril S. Moudrý, Rok vyd. a název originálu z bibliografického katalogu 19. stol., Přívazek k: Historické arabesky / Z. Winter, and Converted from MODS 3.5 to DC version 1.8 (EE patch 2015/06/25)
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
70. CUBBITT Translation Models (en-pl) (v1.0)
- Creator:
- Popel, Martin, Tomková, Markéta, Tomek, Jakub, Kaiser, Łukasz, Uszkoreit, Jakob, Bojar, Ondřej, and Žabokrtský, Zdeněk
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- tool and toolService
- Subject:
- machine translation, neural machine translation, transformer, and cubbitt
- Language:
- English and Polish
- Description:
- CUBBITT En-Pl translation models, exported via TensorFlow Serving, available in the Lindat translation service (https://lindat.mff.cuni.cz/services/translation/). Models are compatible with Tensor2tensor version 1.6.6. For details about the model training (data, model hyper-parameters), please contact the archive maintainer. Evaluation on newstest2020 (BLEU): en->pl: 12.3 pl->en: 20.0 (Evaluated using multeval: https://github.com/jhclark/multeval)
- Rights:
- Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB
71. Cuius ius? O istocie władzy - dyskusja między Luksemburgami i śląskimi Piastami /
- Creator:
- Wiszewski, Przemysław,
- Type:
- text and studie
- Subject:
- Mezinárodní vztahy, světová politika, Lucemburkové (rod), Piastovci (rod), rody panovnické, právo dědické, panovníci, vláda panovnická, české země 1306-1419, Polsko, světové dějiny středověku (do r. 1492), panovníci, panovnické rody, dvory, and politické dějiny, politici
- Language:
- Polish
- Description:
- Cuius ius? On the essence of government - the debate between the Luxembourgs and the Piast dynasty of Silesia.
- Rights:
- unknown
72. Cyrillo-Methodiana /
- Creator:
- Brückner, Aleksander,
- Type:
- text and studie
- Subject:
- Dějiny křesťanské církve, Metoděj,, Konstantin,, světci čeští, Velká Morava, sv. Konstantin a Metoděj, and jednotlivci (církevní dějiny)
- Language:
- Polish
- Rights:
- unknown
73. Cywilizacja i język :
- Creator:
- Brückner, Aleksander,
- Type:
- text and monografie
- Subject:
- Dějiny Evropy, dějiny kultury, dějiny jazyka, jazyk polský, slova cizí, slova přejatá, Polsko, jazyk, písmo, and přehledná zpracování světových dějin (chronologicky)
- Language:
- Polish
- Rights:
- unknown
74. Czasy Karola IV we wrocławskim rękopisie "Starych Latopisów Czeskich" /
- Creator:
- Heck, Roman,
- Type:
- text and studie
- Subject:
- Dějiny Česka a Slovenska, Karel, rukopisy, historiografie středověká, české země 1306-1526, and dějepisectví, historické vědy, historici
- Language:
- Polish
- Rights:
- unknown
75. Czechosłowacja jako państwo pohabsburskie :
- Creator:
- Ther, Philipp,
- Type:
- text and studie
- Subject:
- Dějiny Česka a Slovenska, vznik Československa (1918), stát, kontinuita, historiografie, Československo 1918-1938, vznik Československa 1918, and historiografie, vědecké projekty
- Language:
- Polish
- Rights:
- unknown
76. Czeska Rada Narodowa w Żytawie /
- Creator:
- Pałys, Piotr,
- Type:
- text and studie
- Subject:
- Vnitropolitický vývoj, politický život, Češi němečtí, vztahy československo-německé, hranice státní, výbory národní, činnost osvětová, činnost kulturní, repatriace, vědecké instituce, společnosti, spolky, vysoké školy, Československo 1938-1945, dějiny správy, and dějiny vědy, umění, kultury a techniky, kulturní vztahy
- Language:
- Polish
- Description:
- Der Tschechische Nationalausschluss in Zittau. and Český národní výbor v Žitavě.
- Rights:
- unknown
77. Czeskie ślady w działalności wrocławskiej oficyny Georga Baumanna z lat 1590-1607 /
- Creator:
- Karlak, Weronika
- Type:
- text and studie
- Subject:
- Rukopisy, prvotisky, staré tisky. Vzácná a pozoruhodná díla, Baumann, Georg,, tiskaři polští, tisky staré, vztahy polsko-české, světové dějiny 1492-1648, Polsko, staré tisky, české země 1526-1620, dějiny literatury, jazyka a knihy, and dějiny knihy, knihtisk, nakladatelství
- Language:
- Polish
- Description:
- Tschechische Spuren der Tätigkeit der Breslauer Offizin Georg Baumanns des Älteren (1590-1607).
- Rights:
- unknown
78. Czołowi przedstawiciele Stańczyków - Józef Szujski a Stanisław Tarnowski - wobec Juliusza Słowackiego /
- Creator:
- Pelikán, Jarmil,
- Type:
- text and studie
- Subject:
- Polská literatura (o ní), Szujski, Józef,, Tarnowski, Stanisław,, Słowacki, Juliusz,, literatura polská, recepce literatury, Polsko, světové dějiny 1789-1918, and literatura, spisovatelé
- Language:
- Polish
- Description:
- Die Hauptvertreter des Stanczycy - Szujski und Tarnowski - über Julius Slowacki
- Rights:
- unknown
79. Czy doświadczennia bankowości Polski międzyvojennej mogą być przydatne dla polskiej bankowość współczesnej? /
- Creator:
- Landau, Zbigniew,
- Type:
- text and studie
- Subject:
- Finance, bankovnictví, kapitál zahraniční, banky, Polsko, světové dějiny 1918-1945, and finančnictví
- Language:
- Polish
- Description:
- Jsou zkušenosti meziválečného polského bankovnictví užitečné pro současné polské bankovnictví?
- Rights:
- unknown
80. Czy imię Roslan/Rusłan na słowiański rodowód? /
- Creator:
- Abramowicz, Zofia
- Type:
- text and studie
- Subject:
- Slovanské jazyky, onomastika, and historická onomastika
- Language:
- Polish
- Rights:
- unknown
81. DaMuEL 1.0: A Large Multilingual Dataset for Entity Linking
- Creator:
- Kubeša, David and Straka, Milan
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- entity linking, NEL, NER, dataset, and knowledge base
- Language:
- Afrikaans, Arabic, Armenian, Basque, Belarusian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Hebrew, Hindi, Hungarian, Indonesian, Irish, Italian, Japanese, Korean, Latin, Latvian, Lithuanian, Maltese, Marathi, Modern Greek (1453-), Northern Sami, Norwegian Nynorsk, Persian, Polish, Portuguese, Romanian, Russian, Scottish Gaelic, Serbian, Slovak, Slovenian, Spanish, Swedish, Tamil, Telugu, Uighur, Ukrainian, Urdu, Vietnamese, and Wolof
- Description:
- We present DaMuEL, a large Multilingual Dataset for Entity Linking containing data in 53 languages. DaMuEL consists of two components: a knowledge base that contains language-agnostic information about entities, including their claims from Wikidata and named entity types (PER, ORG, LOC, EVENT, BRAND, WORK_OF_ART, MANUFACTURED); and Wikipedia texts with entity mentions linked to the knowledge base, along with language-specific text from Wikidata such as labels, aliases, and descriptions, stored separately for each language. The Wikidata QID is used as a persistent, language-agnostic identifier, enabling the combination of the knowledge base with language-specific texts and information for each entity. Wikipedia documents deliberately annotate only a single mention for every entity present; we further automatically detect all mentions of named entities linked from each document. The dataset contains 27.9M named entities in the knowledge base and 12.3G tokens from Wikipedia texts. The dataset is published under the CC BY-SA licence.
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
82. Das Neumarkter Rechtsbuch und andere Neumarkter Rechtsquellen /
- Creator:
- Meinardus, Otto,
- Type:
- text, monografie, and prameny
- Subject:
- Dějiny zemí střední Evropy, Právo, právo městské, Polsko, ústavní a právní dějiny, and světové dějiny středověku (do r. 1492)
- Language:
- Polish
- Rights:
- unknown
83. Dawne polskie prawo karne :
- Creator:
- Rafacz, Józef,
- Type:
- text and monografie
- Subject:
- Právo, právo trestní, dějiny práva, dějiny polské, Polsko, ústavní a právní dějiny, and světové dějiny středověku (do r. 1492)
- Language:
- Polish
- Rights:
- unknown
84. Dawne polskie prawo sądowe w zarysie /
- Creator:
- Kutrzeba, Stanisław,
- Type:
- text and monografie
- Subject:
- Právo, Dějiny Evropy, dějiny práva, právo trestní, Polsko, přehledná zpracování světových dějin (chronologicky), and ústavní a právní dějiny
- Language:
- Polish
- Rights:
- unknown
85. Dawne warownie krakowskie /
- Creator:
- Muczkowski, Józef,
- Type:
- text and monografie
- Subject:
- Dějiny Evropy, opevnění městská, vztahy česko-polské, Polsko, zbraně, vojenská technika a zařízení, opevnění, světové dějiny středověku (do r. 1492), dějiny vojenství, and české země 1197-1306
- Language:
- Polish
- Rights:
- unknown
86. Dawny proces polski /
- Creator:
- Rafacz, Józef,
- Type:
- text and monografie
- Subject:
- Právo, Dějiny zemí střední Evropy, dějiny práva, procesy soudní, dějiny soudnictví, dějiny polské, Polsko, ústavní a právní dějiny, and světové dějiny středověku (do r. 1492)
- Language:
- Polish
- Rights:
- unknown
87. Deep Universal Dependencies 2.4
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, and Galician
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-2988). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.4, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.4, and PUB
88. Deep Universal Dependencies 2.5
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, and Skolt Sami
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3105). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.5, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.5, and PUB
89. Deep Universal Dependencies 2.6
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, Skolt Sami, Icelandic, Albanian, and Persian
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3226). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.6, https://lindat.mff.cuni.cz/repository/xmlui/page/license-ud-2.6, and PUB
90. Deep Universal Dependencies 2.7
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, Skolt Sami, Icelandic, Albanian, Persian, Akuntsu, Apurinã, Khunsari, Manx, Mundurukú, Nayini, Soi, South Levantine Arabic, and Tupinambá
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3424). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.7, https://lindat.mff.cuni.cz/repository/xmlui/page/license-ud-2.7, and PUB
91. Deep Universal Dependencies 2.8
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, Skolt Sami, Icelandic, Albanian, Persian, Akuntsu, Apurinã, Khunsari, Manx, Mundurukú, Nayini, Soi, South Levantine Arabic, Tupinambá, Beja, Western Frisian, Urubú-Kaapor, Kangri, K'iche', Low German, Makuráp, Western Armenian, and Central Siberian Yupik
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3687). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.8, https://lindat.mff.cuni.cz/repository/xmlui/page/license-ud-2.8, and PUB
92. Deltacorpus
- Creator:
- Mareček, David, Yu, Zhiwei, Zeman, Daniel, and Žabokrtský, Zdeněk
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- part of speech, tagging, semi-supervised, and cross-language
- Language:
- Belarusian, Bosnian, Bulgarian, Czech, Serbo-Croatian, Croatian, Upper Sorbian, Macedonian, Polish, Russian, Slovak, Slovenian, Serbian, Ukrainian, Latvian, Lithuanian, Afrikaans, Danish, German, English, Faroese, Western Frisian, Swiss German, Icelandic, Limburgan, Luxembourgish, Low German, Dutch, Norwegian Nynorsk, Norwegian, Scots, Swedish, Yiddish, Aragonese, Asturian, Catalan, French, Galician, Haitian, Italian, Latin, Lombard, Neapolitan, Piemontese, Portuguese, Romanian, Spanish, Venetian, Walloon, Breton, Welsh, Scottish Gaelic, Irish, Modern Greek (1453-), Armenian, Albanian, Dimli (individual language), Persian, Gilaki, Kurdish, Tajik, Bengali, Bishnupriya, Gujarati, Fiji Hindi, Hindi, Marathi, Nepali (macrolanguage), Urdu, Amharic, Arabic, Egyptian Arabic, Hebrew, Estonian, Finnish, Hungarian, Basque, Georgian, Chuvash, Azerbaijani, Turkish, Uzbek, Kazakh, Tatar, Yakut, Korean, Mongolian, Telugu, Kannada, Malayalam, Tamil, Newari, Vietnamese, Indonesian, Javanese, Malagasy, Maori, Malay (macrolanguage), Pampanga, Sundanese, Tagalog, Waray (Philippines), Swahili (macrolanguage), Esperanto, Ido, Interlingua (International Auxiliary Language Association), and Volapük
- Description:
- Texts in 107 languages from the W2C corpus (http://hdl.handle.net/11858/00-097C-0000-0022-6133-9), first 1,000,000 tokens per language, tagged by the delexicalized tagger described in Yu et al. (2016, LREC, Portorož, Slovenia).
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
93. Deltacorpus 1.1
- Creator:
- Mareček, David, Yu, Zhiwei, Zeman, Daniel, and Žabokrtský, Zdeněk
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- part of speech, tagging, semi-supervised, and cross-language
- Language:
- Belarusian, Bosnian, Bulgarian, Czech, Serbo-Croatian, Croatian, Upper Sorbian, Macedonian, Polish, Russian, Slovak, Slovenian, Serbian, Ukrainian, Latvian, Lithuanian, Afrikaans, Danish, German, English, Faroese, Western Frisian, Swiss German, Icelandic, Limburgan, Luxembourgish, Low German, Dutch, Norwegian Nynorsk, Norwegian, Scots, Swedish, Yiddish, Aragonese, Asturian, Catalan, French, Galician, Haitian, Italian, Latin, Lombard, Neapolitan, Piemontese, Portuguese, Romanian, Spanish, Venetian, Walloon, Breton, Welsh, Scottish Gaelic, Irish, Modern Greek (1453-), Armenian, Albanian, Dimli (individual language), Persian, Gilaki, Kurdish, Tajik, Bengali, Bishnupriya, Gujarati, Fiji Hindi, Hindi, Marathi, Nepali (macrolanguage), Urdu, Amharic, Arabic, Egyptian Arabic, Hebrew, Estonian, Finnish, Hungarian, Basque, Georgian, Chuvash, Azerbaijani, Turkish, Uzbek, Kazakh, Tatar, Yakut, Korean, Mongolian, Telugu, Kannada, Malayalam, Tamil, Newari, Vietnamese, Indonesian, Javanese, Malagasy, Maori, Malay (macrolanguage), Pampanga, Sundanese, Tagalog, Waray (Philippines), Swahili (macrolanguage), Esperanto, Ido, Interlingua (International Auxiliary Language Association), and Volapük
- Description:
- Texts in 107 languages from the W2C corpus (http://hdl.handle.net/11858/00-097C-0000-0022-6133-9), first 1,000,000 tokens per language, tagged by the delexicalized tagger described in Yu et al. (2016, LREC, Portorož, Slovenia). Changes in version 1.1: 1. Universal Dependencies tagset instead of the older and smaller Google Universal POS tagset. 2. SVM classifier trained on Universal Dependencies 1.2 instead of HamleDT 2.0. 3. Balto-Slavic languages, Germanic languages and Romance languages were tagged by classifier trained only on the respective group of languages. Other languages were tagged by a classifier trained on all available languages. The "c7" combination from version 1.0 is no longer used.
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
94. Demonologia ludowa - propozycje do systematyki :
- Creator:
- Kłodnicki, Zygmunt,
- Type:
- text and studie
- Subject:
- Kulturní antropologie. Etnologie. Etnografie, démonologie, etnologie, metodologie, Polsko, přehledná zpracování světových dějin (chronologicky), církevní a náboženské dějiny, and zahraniční národopis
- Language:
- Polish
- Description:
- Vorschläge zur Systematik der Volksdämonologie des Polnischen ethnographischen Atlas in Cieszyn.
- Rights:
- unknown
95. Denar świętego Piotra obrońcą jedności politycznej i kościelnej w Polsce /
- Creator:
- Ptaśnik, Jan,
- Type:
- text and monografie
- Subject:
- Dějiny zemí střední Evropy, dějiny polské, haléř svatopetrský, papežství, důchody papežské, Polsko, papežství, církevní politika, and světové dějiny středověku (do r. 1492)
- Language:
- Polish
- Rights:
- unknown
96. Der Widerstand Breslaus gegen Georg von Podiebrad /
- Creator:
- Koebner, Richard,
- Type:
- text and monografie
- Subject:
- Dějiny zemí střední Evropy, Jiří z Poděbrad,, vztahy česko-polské, města polská, Polsko, města, obce, české země 1419-1471, světové dějiny středověku (do r. 1492), and dějiny osídlení, regionální dějiny
- Language:
- Polish
- Rights:
- unknown
97. Determinizm nauk przyrodniczych
- Creator:
- Metallmann, Joachim
- Publisher:
- Polska Akademja Umiejętności
- Format:
- print and xiv, 424 s.
- Type:
- text, volume, pojednání, model:monograph, and TEXT
- Subject:
- Speciální metafyzika, determinismus, přírodní vědy, 123.2, 5, (049), and 122/129
- Language:
- Polish
- Description:
- Joachim Metallmann. and KČSN
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
98. Devadesát let Pavla Spunara (* 21. května 1928, Praha) /
- Creator:
- Hlaváček, Ivan,
- Type:
- text and články jubilejní
- Subject:
- Historická věda. Pomocné vědy historické. Archivnictví, Spunar, Pavel,, historici, kodikologové, paleografie, kultura středověká, jubilea životní, and historici (jubilea, nekrology apod.)
- Language:
- Polish
- Rights:
- unknown
99. Dictionarius seu nomenclatura quatuor linguarum Latine Italice Polonice & Theutonice, aprime cuiuis vtilissimus cum peregrinantibus tum domiresidentibus Adiecto vocabulorum indice: Vokabularź nowy czterzech iezikow: Laczinskigo:Wloskiego:Polskiego:Niemieczkiego wssem w tey slawney koronie y innym narodom barzo vźytecžny
- Publisher:
- [S.n.]
- Type:
- Text, model:monograph, and TEXT
- Subject:
- 054
- Language:
- Latin and Polish
- Description:
- Chybí listy, 3 poslední kapitoly 2. knihy. Neúplné. Čísl. vrstvami. Sign. Písmo gotické. Rubriky. Sazba ve vokabuláři 4-sloupcová. Linky. Viněta na fol. 4a propriová. Vazba původní, renesanční, žlutá kůže s intarsiemi, poškozená. Hřbet natřen barvou světle hnědou. Pův. maj. býv. kl. frant. v Uh. Hradišti. and Pův. maj. býv. kl. frant. v Uh. Hradišti
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
100. Die Anfänge des Bistums Posen und die Reihe seiner Bischöfe von 968-1489 /
- Creator:
- Sappok, Gerhard
- Type:
- text and monografie
- Subject:
- Dějiny Evropy, biskupství poznaňské, biskupové poznaňští, Polsko, světové dějiny středověku (do r. 1492), and církevní správa a hospodářství
- Language:
- Polish
- Rights:
- unknown