« Previous |
1 - 100 of 316
|
Next »
Number of results to display per page
Search Results
2. 350 años de la Paz de Westfalia :
- Type:
- text and sborníky konferenční
- Subject:
- Dějiny Evropy, mír vestfálský (1648), válka třicetiletá (1618-1648), politika zahraniční, politické dějiny, politici, světové dějiny 1492-1648, and zahraniční periodika a sborníky
- Language:
- Spanish, English, German, French, Polish, and Modern Greek (1453-)
- Rights:
- unknown
3. 380 let knížectví Lichtenštejn v Moravském Krumlově :
- Creator:
- Vařeka, Marek,
- Publisher:
- Masarykovo muzeum,
- Type:
- publikace informační
- Subject:
- Genealogie. Heraldika. Šlechta. Vlajky, Lichtenštejnové (rod), rody šlechtické, držba majetková, české země 1526-1792, šlechta, buržoazie, měšťanstvo, podnikatelé, and české země 1792-1918
- Language:
- Czech, English, and Polish
- Rights:
- unknown
4. Abstrakce.PL :
- Creator:
- Czerni, Krystyna
- Type:
- text, statický obraz, and katalogy výstav
- Subject:
- Malířství, malířství, umění abstraktní, sbírky umělecké, Polsko, světové dějiny od r. 1945 do současnosti, malířství, malíři, and české a československé výstavy
- Language:
- Czech, English, and Polish
- Description:
- Vydáno ke stejnojmenné výstavě konané v Muzeu umění Olomouc ve dnech 20.4.-19.8.2018
- Rights:
- unknown
5. Acta historico-iuridica Pilsnensia 2009-2010 /
6. Acta onomastica
- Type:
- text and sborníky
- Subject:
- Seriálové publikace. Periodika, onomastika, toponomastika, and česká periodika
- Language:
- Czech, Russian, English, Slovak, and Polish
- Rights:
- unknown
7. Acta onomastica
- Type:
- text and časopisy
- Subject:
- Seriálové publikace. Periodika, onomastika, toponomastika, and česká periodika
- Language:
- Czech, German, English, Slovak, and Polish
- Rights:
- unknown
8. Acta onomastica
- Type:
- text and časopisy
- Subject:
- Seriálové publikace. Periodika, onomastika, toponomastika, and česká periodika
- Language:
- Czech, German, English, Slovak, and Polish
- Rights:
- unknown
9. Acta onomastica
- Type:
- text and časopisy
- Subject:
- Seriálové publikace. Periodika, onomastika, toponomastika, and česká periodika
- Language:
- Czech, Russian, English, Slovak, and Polish
- Rights:
- unknown
10. Acta onomastica
- Type:
- text and časopisy
- Subject:
- Seriálové publikace. Periodika, onomastika, toponomastika, and česká periodika
- Language:
- Czech, German, English, Slovak, and Polish
- Rights:
- unknown
11. Acta onomastica
- Type:
- text and časopisy
- Subject:
- Seriálové publikace. Periodika, onomastika, toponomastika, and česká periodika
- Language:
- Czech, English, Polish, and Slovak
- Rights:
- unknown
12. Acta onomastica
- Type:
- text and časopisy
- Subject:
- Seriálové publikace. Periodika, onomastika, toponomastika, and česká periodika
- Language:
- Czech, English, Polish, and Slovak
- Rights:
- unknown
13. Aktuální otázky slovanské filologie a Šafaříkův vědecký odkaz /
- Type:
- text and sborníky
- Subject:
- Filologie, Šafařík, Pavel Josef,, slavistika, slavisté, filologie slovanská, české (československé) sborníky a kolektivní monografie, české země 1792-1918, and dějiny slavistiky
- Language:
- Czech, English, German, Italian, Polish, Russian, and Slovak
- Description:
- Zvl. otisk čas. Slavia 65 (1996), seš. 1, str. 1-162
- Rights:
- unknown
14. Alfred Neumann :
- Creator:
- Neumann, Alfred,
- Type:
- text, statický obraz, and katalogy výstav
- Subject:
- Architektura, Neumann, Alfred,, architekti, dějiny architektury, Československo 1918-1992, světové dějiny od r. 1918 do současnosti, and architektura, architekti
- Language:
- Czech, English, German, and Polish
- Description:
- Katalog vydán u příležitosti výstavy ... 2.4.-7.6.2015 Dům umění, GVUO a Kabinet architektury Ostrava, Česká republika, 18.6.-27.9.2015 Muzeum Architektury we Wrocławiu, Polsko, 14.10.-1.11.2015 Bauhaus-Universität Weimar, Německo, 18.2.-22.5.2016 Moravská galerie v Brně, Česká republika
- Rights:
- unknown
15. Almanach medievisty-editora /
- Creator:
- Krafl, Pavel,
- Publisher:
- Historický ústav,
- Type:
- almanachy
- Subject:
- Historická věda. Pomocné vědy historické. Archivnictví, Bibliografie. Katalogy, prameny písemné, archiváři, editoři, práce ediční, bibliografie personální, české (československé) sborníky a kolektivní monografie, diplomatika a paleografie, and personální bibliografie
- Language:
- Czech, English, German, and Polish
- Rights:
- unknown
16. Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.1)
- Creator:
- Ramisch, Carlos, Cordeiro, Silvio Ricardo, Savary, Agata, Vincze, Veronika, Barbu Mititelu, Verginica, Bhatia, Archna, Buljan, Maja, Candito, Marie, Gantar, Polona, Giouli, Voula, Güngör, Tunga, Hawwari, Abdelati, Iñurrieta, Uxoa, Kovalevskaitė, Jolanta, Krek, Simon, Lichte, Timm, Liebeskind, Chaya, Monti, Johanna, Parra Escartín, Carla, QasemiZadeh, Behrang, Ramisch, Renata, Schneider, Nathan, Stoyanova, Ivelina, Vaidya, Ashwini, Walsh, Abigail, Aceta, Cristina, Aduriz, Itziar, Antoine, Jean-Yves, Arhar Holdt, Špela, Berk, Gözde, Bielinskienė, Agnė, Blagus, Goranka, Boizou, Loic, Bonial, Claire, Caruso, Valeria, Čibej, Jaka, Constant, Matthieu, Cook, Paul, Diab, Mona, Dimitrova, Tsvetana, Ehren, Rafael, Elbadrashiny, Mohamed, Elyovich, Hevi, Erden, Berna, Estarrona, Ainara, Fotopoulou, Aggeliki, Foufi, Vassiliki, Geeraert, Kristina, van Gompel, Maarten, Gonzalez, Itziar, Gurrutxaga, Antton, Ha-Cohen Kerner, Yaakov, Ibrahim, Rehab, Ionescu, Mihaela, Jain, Kanishka, Jazbec, Ivo-Pavao, Kavčič, Teja, Klyueva, Natalia, Kocijan, Kristina, Kovács, Viktória, Kuzman, Taja, Leseva, Svetlozara, Ljubešić, Nikola, Malka, Ruth, Markantonatou, Stella, Martínez Alonso, Héctor, Matas, Ivana, McCrae, John, de Medeiros Caseli, Helena, Onofrei, Mihaela, Palka-Binkiewicz, Emilia, Papadelli, Stella, Parmentier, Yannick, Pascucci, Antonio, Pasquer, Caroline, Pia di Buono, Maria, Puri, Vandana, Raffone, Annalisa, Ratori, Shraddha, Riccio, Anna, Sangati, Federico, Shukla, Vishakha, Simkó, Katalin, Šnajder, Jan, Somers, Clarissa, Srivastava, Shubham, Stefanova, Valentina, Taslimipoor, Shiva, Theoxari, Natasa, Todorova, Maria, Urizar, Ruben, Villavicencio, Aline, and Zilio, Leonardo
- Publisher:
- PARSEME
- Type:
- text and corpus
- Subject:
- Multiword expressions, verbal multiword expressions, light-verb constructions, verb-particle constructions, inherently reflexive verbs, verbal idioms, and multi-verb constructions
- Language:
- Bulgarian, German, Modern Greek (1453-), Spanish, Persian, French, Hebrew, Hungarian, Italian, Lithuanian, Polish, Portuguese, Romanian, Slovenian, Turkish, Hindi, Basque, English, and Croatian
- Description:
- This multilingual resource contains corpora in which verbal MWEs have been manually annotated. VMWEs include idioms (let the cat out of the bag), light-verb constructions (make a decision), verb-particle constructions (give up), inherently reflexive verbs (help oneself), and multi-verb constructions (make do). VMWEs were annotated according to the universal guidelines in 19 languages. The corpora are provided in the cupt format, inspired by the CONLL-U format. The corpora were used in the 1.1 edition of the PARSEME Shared Task (2018). For most languages, morphological and syntactic information – not necessarily using UD tagsets – including parts of speech, lemmas, morphological features and/or syntactic dependencies are also provided. Depending on the language, the information comes from treebanks (e.g., Universal Dependencies) or from automatic parsers trained on treebanks (e.g., UDPipe). This item contains training, development and test data, as well as the evaluation tools used in the PARSEME Shared Task 1.1 (2018). The annotation guidelines are available online: http://parsemefr.lif.univ-mrs.fr/parseme-st-guidelines/1.1
- Rights:
- PARSEME Shared Task Data (v. 1.1) Agreement, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-mwe-1.1, and PUB
17. Aquila fecit :
- Type:
- text and sborníky jubilejní
- Subject:
- Dějiny Česka a Slovenska, Vorel, Petr,, historici čeští, jubilea životní, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech, English, and Polish
- Rights:
- unknown
18. Aquila fecit :
- Type:
- text and sborníky jubilejní
- Subject:
- Dějiny Česka a Slovenska, Vorel, Petr,, historici čeští, jubilea životní, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech, English, and Polish
- Rights:
- unknown
19. Archaeologia et historia urbana :
- Type:
- text and sborníky jubilejní
- Subject:
- Dějiny Evropy, Archeologie, Nawrolski, Tadeusz,, dějiny měst, archeologie středověku, zahraniční periodika a sborníky, and archeologie
- Language:
- Polish, English, and German
- Rights:
- unknown
20. Archaeologia et historia urbana :
- Type:
- text and sborníky jubilejní
- Subject:
- Dějiny Evropy, Archeologie, Nawrolski, Tadeusz,, dějiny měst, archeologie středověku, zahraniční periodika a sborníky, and archeologie
- Language:
- Polish, English, and German
- Rights:
- unknown
21. Archeologické rozhledy
- Type:
- model:periodicalitem and TEXT
- Language:
- Czech, German, English, and Polish
- Description:
- 4
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
22. Archeologický sborník :
- Publisher:
- Slezská univerzita v Opavě, Ústav archeologie,
- Type:
- sborníky jubilejní
- Subject:
- Archeologie, Janák, Vratislav,, sborníky jubilejní, archeologové, archeologie, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech, English, Polish, and Slovak
- Description:
- Obsahuje bibliografie and 100 výt.
- Rights:
- unknown
23. Archeologie barbarů 2006 :
- Type:
- text and sborníky konferenční
- Subject:
- Archeologie, archeologie, kultura laténská, doba římská, Germáni, české (československé) sborníky a kolektivní monografie, and české země v době římské, stěhování národů
- Language:
- English, Czech, German, Polish, and Slovak
- Description:
- Částečně anglický, německý, polský a slovenský text, německá resumé and Monografické č. seriálu: Archeologické výzkumy v jižních Čechách. Supplement ; 3 (2007)
- Rights:
- unknown
24. Architektura pojezuickiego kościoła św. św. Piotra i Pawła w Twardocicach /
- Creator:
- Pieczka, Michał
- Subject:
- řád, jezuité, architektura sakrální, kostely, sv. Petr a Pavel (patrocinium), světové dějiny 1648-1789, Polsko, and církevní architektura, hmotné památky, hřbitovy a poutní místa
- Language:
- Polish and English
- Description:
- Architecture of Post-Jesuit Church of Sts. Peter and Paul in Twardocice.
- Rights:
- unknown
25. Architektura świątyń jezuitskich na Śląsku. W kręgu biskupa Franciszka Ludwika von Neuburg /
- Creator:
- Baranowski, Andrzej Józef
- Subject:
- Pfalz-Neuburg, Franz Ludwig von,, řád, jezuité, kostely, architektura sakrální, biskupové, světové dějiny 1648-1789, Polsko, církevní architektura, hmotné památky, hřbitovy a poutní místa, and české země 1620-1740
- Language:
- Polish and English
- Description:
- The Architecture of Jesuit Temples in Silesia. Around Bishop Franz Ludwig von Pfalz-Neuburg.
- Rights:
- unknown
26. Arcidiecézní muzeum na Olomouckém hradě :
- Type:
- text and sborníky konferenční
- Subject:
- Výtvarné umění, muzea církevní, umění výtvarné, hrady, památky církevní, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech, English, German, and Polish
- Rights:
- unknown
27. Arcidiecézní muzeum na Olomouckém hradě :
- Type:
- text and sborníky konferenční
- Subject:
- Výtvarné umění, muzea církevní, umění výtvarné, hrady, památky církevní, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech, English, German, and Polish
- Rights:
- unknown
28. Arnošt z Pardubic (1297-1364) :
- Publisher:
- Univerzita Karlova ;, Univerzita Pardubice ;, and Uniwersytet Wrocławski,
- Type:
- sborníky konferenční
- Subject:
- Dějiny Česka a Slovenska, Arnošt,, arcibiskupové pražští, české (československé) sborníky a kolektivní monografie, české země 1306-1419, and jednotlivci (církevní dějiny)
- Language:
- Czech, English, and Polish
- Description:
- sborník z konference, Pardubice, Kladsko 22. - 24. 9. 2004
- Rights:
- unknown
29. Arnošt z Pardubic (1297-1364) :
- Type:
- text and sborníky konferenční
- Subject:
- Dějiny Česka a Slovenska, Arnošt,, arcibiskupové pražští, výročí, české (československé) sborníky a kolektivní monografie, české země 1306-1419, and jednotlivci (církevní dějiny)
- Language:
- Czech, English, and Polish
- Description:
- sborník z konference, Pardubice, Kladsko 22. - 24. 9. 2004
- Rights:
- unknown
30. Arnošt z Pardubic v dějinách střední Evropy =
- Creator:
- Bobková, Lenka,
- Type:
- text and úvodníky
- Subject:
- Dějiny Česka a Slovenska, Arnošt,, Karel, arcibiskupové pražští, konference mezinárodní, české a československé konference, kongresy, české země 1306-1419, and jednotlivci (církevní dějiny)
- Language:
- Czech, Polish, and English
- Description:
- Konference, Pardubice, Kladsko 22. - 24. 9. 2004
- Rights:
- unknown
31. Artem ad vitam :
- Type:
- text and sborníky jubilejní
- Subject:
- Výtvarné umění, Hlobil, Ivo,, historici umění, dějiny umění, české (československé) sborníky a kolektivní monografie, dějiny umění, mecenát, and přehledná zpracování dějin českých zemí (chronologicky)
- Language:
- Czech, English, German, and Polish
- Rights:
- unknown
32. Artem ad vitam :
- Type:
- text and sborníky jubilejní
- Subject:
- Výtvarné umění, Hlobil, Ivo,, historici umění, dějiny umění, české (československé) sborníky a kolektivní monografie, dějiny umění, mecenát, and přehledná zpracování dějin českých zemí (chronologicky)
- Language:
- Czech, English, German, and Polish
- Rights:
- unknown
33. Artystyczne przejawy działalności bractw jezuickich na Śląsku w czasach baroku /
- Creator:
- Mikołajek, Zuzanna
- Subject:
- řád, jezuité, dějiny umění, umění barokní, architektura barokní, grafika barokní, světové dějiny 1648-1789, Polsko, církevní řády a kongregace, náboženská bratrstva, kláštery, and dějiny umění, mecenát
- Language:
- Polish and English
- Description:
- Artistic Indications of Jesuits Confraternities' Activities in Silesia in Baroque Times.
- Rights:
- unknown
34. Baroko na Těšínsku. Ze sbírek Muzea Těšínska v České Těšíně /
- Creator:
- Pavlíková, Jiřina,
- Publisher:
- Muzeum Těšínska,
- Subject:
- umění barokní, sbírky muzejní, česká a československá muzea, galerie, expozice, české země 1526-1792, and dějiny umění, mecenát
- Language:
- Czech, English, German, and Polish
- Description:
- [souběžný český, polský, anglický a německý text]
- Rights:
- unknown
35. Bene scripsisti... :
- Type:
- text and sborníky jubilejní
- Subject:
- Filozofické systémy a hlediska, Sousedík, Stanislav,, sborníky jubilejní, filozofové čeští, jubilea životní, dějiny filozofie, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech, English, German, and Polish
- Description:
- Část. anglický, německý a polský text
- Rights:
- unknown
36. Beskydy a Pobeskydí 1895-1939 /
- Publisher:
- Wart,
- Subject:
- dějiny regionální, fotografie dokumentární, pohlednice historické, dějiny osídlení, regionální dějiny, regionální a vlastivědná práce, and české země 1848-1918
- Language:
- Czech, English, German, and Polish
- Description:
- [Souběžný čes., něm., angl. a pol. text.]
- Rights:
- unknown
37. Bezčasí :
- Type:
- text and monografie kolektivní
- Subject:
- Dějiny Česka a Slovenska, život každodenní, společnost česká, české (československé) sborníky a kolektivní monografie, Československo 1969-1989, and dějiny společnosti
- Language:
- Czech, English, Polish, and Slovak
- Rights:
- unknown
38. Bibliografický přehled českých národních písní: seznam studií, starších sbírek rukopisných, sbírek tištěných, překladů s vybranými ukázkami a podrobný abecední ukazatel písní, v knize uvedených i vůbec písní tiskem uveřejněných
- Creator:
- Čeněk Zíbrt and Česká akademie císaře Františka Josefa pro vědy, slovesnost a umění
- Publisher:
- Nákladem České akademie císaře Františka Josefa pro vědy, slovesnost a umění
- Format:
- print, svazek, and 326 stran.
- Type:
- model:monograph and TEXT
- Subject:
- Vokální hudba, Bibliografie. Katalogy, české lidové písně, historické prameny, Česko, 784.4(=162.3), (016), (437.3), 9, 12, 784, and 01
- Language:
- Czech, English, French, German, Italian, Latin, Polish, and Russian
- Description:
- sestavil Čeněk Zíbrt., Obsahuje rejstříky., Částečně souběžný anglický, francouzský, německý, italský, latinský, polský a ruský text, and Vydává III. třída České akademie císaře Františka Josefa pro vědy, slovesnost a umění v Praze
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
39. Broumovsko
- Creator:
- Záliš, Jan,
- Type:
- text and publikace fotografické
- Subject:
- Architektura, kostely, architektura sakrální, přehledná zpracování dějin českých zemí (chronologicky), and církevní architektura, hmotné památky, hřbitovy a poutní místa
- Language:
- Czech, English, German, and Polish
- Description:
- Texty k historii kostelů souběžně v českém, polském, anglickém a německém jazyce
- Rights:
- unknown
40. Bruntál :
- Publisher:
- Město Bruntál,
- Type:
- monografie kolektivní
- Subject:
- Dějiny Česka a Slovenska, dějiny měst, města, přehledná zpracování dějin českých zemí (chronologicky), and města, obce
- Language:
- Czech, English, German, and Polish
- Description:
- V tiráži jako autoři textu uvedeni: Josef Cepek ... et al.
- Rights:
- unknown
41. C4Corpus (CC BY-NC part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Kannada, Korean, Latvian, Lithuanian, Malayalam, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Panjabi, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-NonCommercial 4.0 International (CC BY-NC 4.0), http://creativecommons.org/licenses/by-nc/4.0/, and PUB
42. C4Corpus (CC BY-NC-ND part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Kannada, Korean, Latvian, Lithuanian, Malayalam, Marathi, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Urdu, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0), http://creativecommons.org/licenses/by-nc-nd/4.0/, and PUB
43. C4Corpus (CC BY-NC-SA part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Korean, Latvian, Lithuanian, Malayalam, Marathi, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Urdu, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB
44. C4Corpus (CC BY-ND part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Korean, Latvian, Lithuanian, Malayalam, Macedonian, Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-NoDerivatives 4.0 International (CC BY-ND 4.0), http://creativecommons.org/licenses/by-nc/4.0/, and PUB
45. C4Corpus (CC BY-SA part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Kannada, Korean, Latvian, Lithuanian, Malayalam, Marathi, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Panjabi, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Urdu, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
46. C4Corpus (CC-BY part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Kannada, Korean, Latvian, Lithuanian, Malayalam, Marathi, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Panjabi, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Urdu, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution 4.0 International (CC BY 4.0), http://creativecommons.org/licenses/by/4.0/, and PUB
47. C4Corpus (publicdomain part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Croatian, Hungarian, Indonesian, Italian, Japanese, Korean, Latvian, Lithuanian, Dutch, Norwegian, Polish, Portuguese, Russian, Slovenian, Somali, Spanish, Swahili (macrolanguage), Swedish, Tagalog, Thai, Turkish, Ukrainian, Undetermined, and Vietnamese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Public Domain Mark (PD), http://creativecommons.org/publicdomain/mark/1.0/, and PUB
48. Čechy jsou plné kostelů =
- Type:
- text and monografie kolektivní
- Subject:
- Výtvarné umění, Merhautová, Anežka,, sborníky, historici umění, jubilea životní, dějiny umění, architektura středověká, světové dějiny středověku (do r. 1492), české (československé) sborníky a kolektivní monografie, české země od příchodu Slovanů do roku 1306, české země 1306-1526, and dějiny umění, mecenát
- Language:
- Czech, English, French, German, and Polish
- Rights:
- unknown
49. Cele i zadania polonistyki uniwersyteckiej w czesko-polskim regionie przygranicznym /
- Creator:
- Muryc, Jiří,
- Subject:
- polonistika, univerzity, české a československé vědecké instituce a společnosti, vysoké školy, české země od r. 1993 do současnosti, and dějiny slavistiky
- Language:
- Polish and English
- Rights:
- unknown
50. Česká a slovenská slavistická komparatistika a wollmanovská tradice /
- Type:
- text and monografie kolektivní
- Subject:
- Filologie, Wollman, Frank,, slavisté, slavistika, lingvistika komparativní, komparatistika literární, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech, English, Polish, Russian, Slovak, and Ukrainian
- Description:
- Vychází ve spolupráci se Středoevropským centrem slovanských studií, Slavistickou společností Franka Wollmana a Ústavem slavistiky FF MU and Vydal Jan Sojnek - Galium
- Rights:
- unknown
51. Česko-polské kazatelské vztahy ve středověku /
- Type:
- text and sborníky
- Subject:
- Pastorální teologie, kazatelé, kázání, homiletika, vztahy česko-polské, středověk, české (československé) sborníky a kolektivní monografie, české země 1306-1419, české země 1419-1471, teologie, ikonografie, zbožnost, hagiografie, Polsko, and světové dějiny středověku (do r. 1492)
- Language:
- Czech, Latin, English, German, and Polish
- Description:
- "Editoři Krzysztof Bracha, Martin Nodl"--Obálka, Na obálce nad názvem: Centre for Medieval Studies - CMS, and Bohemian-Polish Preaching Relations in the Middle Ages: Introductory Reflection.
- Rights:
- unknown
52. Československá zahraniční politika po osvobození 1945 :
- Type:
- text, dokumenty, and edice
- Subject:
- Mezinárodní vztahy, světová politika, dějiny československé, vztahy mezinárodní, politika zahraniční, diplomacie, Československo 1945-1948, and zahraniční politika, mezinárodní vztahy
- Language:
- Czech, English, Croatian, Polish, and Slovak
- Rights:
- unknown
53. Československá zahraniční politika v roce 1943.
- Type:
- text, dokumenty, and edice
- Subject:
- Mezinárodní vztahy, světová politika, politika zahraniční, vztahy mezinárodní, vláda exilová, válka druhá světová (1939-1945), odboj druhý (protifašistický), Československo 1938-1945, and zahraniční politika, mezinárodní vztahy
- Language:
- Czech, English, French, Polish, Russian, and Slovak
- Description:
- Autentické dokumenty odhalující politické a diplomatické vztahy československé politické reprezentace k velmocím i dalším státům od počátku srpna do konce prosince roku 1943.
- Rights:
- unknown
54. Československo a krize demokracie ve střední Evropě ve 30. a 40. letech XX. století :
- Creator:
- Šedivý, Ivan,
- Type:
- text and monografie kolektivní
- Subject:
- Dějiny Česka a Slovenska, vztahy mezinárodní, politika zahraniční, válka druhá světová (1939-1945), české (československé) sborníky a kolektivní monografie, Československo 1918-1992, and zahraniční politika, mezinárodní vztahy
- Language:
- Czech, English, Polish, and Slovak
- Description:
- V prelimináriích: editoři Ivan Šedivý ... et al., Masarykův ústav a archiv AV ČR, Historický ústav AV ČR, Ústav pro soudobé dějiny AV ČR, České křižovatky evropských dějin, České křižovatky v evropských dějinách 1918-1938-1948-1968, Obálkový název:České křižovatky evropských dějin - 1938, Obálkový název:1938: Československo a krize demokracie ve střední Evropě ve 30. a 40. letech XX. století - hledání východisek, 2. část cyklu: České křižovatky evropských dějin, Další název v tiráži: České křižovatky v evropských dějinách 1918-1938-1948-1968, Obálkový název: České křižovatky evropských dějin - 1938, and Název na doplňkové titulní stránce: 1938: Československo a krize demokracie ve střední Evropě ve 30. a 40. letech XX. století - hledání východisek
- Rights:
- unknown
55. Cesta k rozmanitosti, aneb, Kavárenský povaleč digitálním historikem středověku :
- Type:
- text, monografie kolektivní, and sborníky jubilejní
- Subject:
- Dějiny civilizace. Kulturní dějiny, Uhlíř, Zdeněk,, historici, dědictví kulturní, knihovníci, české (československé) sborníky a kolektivní monografie, and knihovnictví
- Language:
- Czech, English, German, Latin, and Polish
- Description:
- Obálkový název: Cesta k rozmanitosti
- Rights:
- unknown
56. Cesta k rozmanitosti, aneb, Kavárenský povaleč digitálním historikem středověku :
- Type:
- text and monografie kolektivní
- Subject:
- Dějiny civilizace. Kulturní dějiny, Uhlíř, Zdeněk,, sborníky jubilejní, historici, dědictví kulturní, knihovníci, české (československé) sborníky a kolektivní monografie, teoretické a metodologické základy historie, and knihovnictví
- Language:
- Czech, English, German, Latin, and Polish
- Description:
- Obálkový název: Cesta k rozmanitosti
- Rights:
- unknown
57. Conclusions = :
- Creator:
- Mayer, Françoise,
- Type:
- text and studie
- Subject:
- Dějiny Evropy, povstání, antifašismus, paměť kolektivní, paměť historická, ideologie komunistická, strany politické komunistické, interpretace dějin, Československo 1945-1948, Československo 1948-1969, odboj, odpor, antifašismus, antikomunismus, Polsko, and světové dějiny od r. 1945 do současnosti
- Language:
- French, Slovak, Polish, and English
- Rights:
- unknown
58. CoNLL 2017 and 2018 Shared Task Blind and Preprocessed Test Data
- Creator:
- Zeman, Daniel and Straka, Milan
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- tokenization, word segmentation, morphology, tagging, syntax, parsing, and universal dependencies
- Language:
- Afrikaans, Arabic, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Persian, Finnish, French, Old French (842-ca. 1400), Irish, Galician, Gothic, Ancient Greek (to 1453), Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Latin, Latvian, Dutch, Norwegian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Thai, Turkish, Uighur, Ukrainian, Urdu, Vietnamese, and Chinese
- Description:
- CoNLL 2017 and 2018 shared tasks: Multilingual Parsing from Raw Text to Universal Dependencies This package contains the test data in the form in which they ware presented to the participating systems: raw text files and files preprocessed by UDPipe. The metadata.json files contain lists of files to process and to output; README files in the respective folders describe the syntax of metadata.json. For full training, development and gold standard test data, see Universal Dependencies 2.0 (CoNLL 2017) Universal Dependencies 2.2 (CoNLL 2018) See the download links at http://universaldependencies.org/. For more information on the shared tasks, see http://universaldependencies.org/conll17/ http://universaldependencies.org/conll18/ Contents: conll17-ud-test-2017-05-09 ... CoNLL 2017 test data conll18-ud-test-2018-05-06 ... CoNLL 2018 test data conll18-ud-test-2018-05-06-for-conll17 ... CoNLL 2018 test data with metadata and filenames modified so that it is digestible by the 2017 systems.
- Rights:
- Licence Universal Dependencies v2.2, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.2, and PUB
59. CoNLL 2017 Shared Task System Outputs
- Creator:
- Zeman, Daniel, Potthast, Martin, Straka, Milan, Popel, Martin, Dozat, Timothy, Qi, Peng, Manning, Christopher, Shi, Tianze, Wu, Felix G., Chen, Xilun, Cheng, Yao, Björkelund, Anders, Falenska, Agnieszka, Yu, Xiang, Kuhn, Jonas, Che, Wanxiang, Guo, Jiang, Wang, Yuxuan, Zheng, Bo, Zhao, Huaipeng, Liu, Yang, Teng, Dechuan, Liu, Ting, Lim, Kyungtae, Poibeau, Thierry, Sato, Motoki, Manabe, Hitoshi, Noji, Hiroshi, Matsumoto, Yuji, Kırnap, Ömer, Önder, Berkay Furkan, Yuret, Deniz, Straková, Jana, Vania, Clara, Zhang, Xingxing, Lopez, Adam, Heinecke, Johannes, Asadullah, Munshi, Kanerva, Jenna, Luotolahti, Juhani, Ginter, Filip, Kuan, Yu, Sofroniev, Pavel, Schill, Erik, Hinrichs, Erhard, Nguyen, Dat Quoc, Dras, Mark, Johnson, Mark, Qian, Xian, Vilares, David, Gómez-Rodríguez, Carlos, Aufrant, Lauriane, Wisniewski, Guillaume, Yvon, François, Dumitrescu, Stefan Daniel, Boroş, Tiberiu, Tufiş, Dan, Das, Ayan, Zaffar, Affan, Sarkar, Sudeshna, Wang, Hao, Zhao, Hai, Zhang, Zhisong, Hornby, Ryan, Taylor, Clark, Park, Jungyeul, de Lhoneux, Miryam, Shao, Yan, Basirat, Ali, Kiperwasser, Eliyahu, Stymne, Sara, Goldberg, Yoav, Nivre, Joakim, Akkuş, Burak Kerim, Azizoglu, Heval, Cakici, Ruket, Moor, Christophe, Merlo, Paola, Henderson, James, Wang, Haozhou, Ji, Tao, Wu, Yuanbin, Lan, Man, de la Clergerie, Eric, Sagot, Benoît, Seddah, Djamé, More, Amir, Tsarfaty, Reut, Kanayama, Hiroshi, Muraoka, Masayasu, Yoshikawa, Katsumasa, Garcia, Marcos, and Gamallo, Pablo
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- dependency parser and parsebank
- Language:
- Arabic, Bulgarian, Russia Buriat, Czech, Catalan, Church Slavic, Danish, German, Modern Greek (1453-), English, Spanish, Estonian, Basque, Persian, Finnish, French, Irish, Galician, Gothic, Ancient Greek (to 1453), Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Latin, Latvian, Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Northern Sami, Swedish, Turkish, Uighur, Ukrainian, Urdu, Vietnamese, and Chinese
- Description:
- This package contains the system outputs from the CoNLL 2017 Shared Task in Multilingual Parsing from Raw Text to Universal Dependencies.
- Rights:
- Licence Universal Dependencies v2.0, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.0, and PUB
60. CoNLL 2018 Shared Task System Outputs
- Creator:
- Zeman, Daniel, Potthast, Martin, Duthoo, Elie, Mesnard, Olivier, Rybak, Piotr, Wróblewska, Alina, Che, Wanxiang, Liu, Yijia, Wang, Yuxuan, Zheng, Bo, Liu, Ting, Li, Zuchao, He, Shexia, Zhang, Zhuosheng, Zhao, Hai, Wu, Yingting, Tong, Jia-Jun, Nguyen, Dat Quoc, Verspoor, Karin, Wan, Hui, Naseem, Tahira, Lee, Young-Suk, Castelli, Vittorio, Ballesteros, Miguel, Hershcovich, Daniel, Abend, Omri, Rappoport, Ari, Smith, Aaron, Bohnet, Bernd, de Lhoneux, Miryam, Nivre, Joakim, Shao, Yan, Stymne, Sara, Kırnap, Ömer, Dayanık, Erenay, Yuret, Deniz, Kanerva, Jenna, Ginter, Filip, Miekka, Niko, Leino, Akseli, Salakoski, Tapio, Lim, KyungTae, Park, Cheoneum, Lee, Changki, Poibeau, Thierry, Bhat, Riyaz Ahmad, Bhat, Irshad, Bangalore, Srinivas, Qi, Peng, Dozat, Timothy, Zhang, Yuhao, Manning, Christopher, Boroș, Tiberiu, Dumitrescu, Stefan Daniel, Burtica, Ruxandra, Arakelyan, Gor, Hambardzumyan, Karen, Khachatrian, Hrant, Rosa, Rudolf, Mareček, David, Straka, Milan, Seker, Amit, More, Amir, Tsarfaty, Reut, Önder, Berkay Furkan, Gümeli, Can, Jawahar, Ganesh, Muller, Benjamin, Fethi, Amal, Martin, Louis, Villemonte de la Clergerie, Eric, Sagot, Benoît, Seddah, Djamé, Özateş, Şaziye Betül, Özgür, Arzucan, Gungor, Tunga, Öztürk, Balkız, Ji, Tao, Liu, Yufang, Wang, Yijun, Wu, Yuanbin, Lan, Man, Chen, Danlu, Lin, Mengxiao, Hu, Zhifeng, and Qiu, Xipeng
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- parsed data, conllu, and universal dependencies
- Language:
- Afrikaans, Arabic, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Persian, Finnish, French, Old French (842-ca. 1400), Irish, Galician, Gothic, Ancient Greek (to 1453), Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Latin, Latvian, Dutch, Norwegian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Thai, Turkish, Uighur, Ukrainian, Urdu, Vietnamese, and Chinese
- Description:
- Test data parsed by systems submitted to the CoNLL 2018 UD parsing shared task.
- Rights:
- Licence Universal Dependencies v2.2, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.2, and PUB
61. Constans et perpetua voluntas :
- Publisher:
- Trnavská univerzita,
- Type:
- sborníky jubilejní
- Subject:
- Právo, Blaho, Peter,, právo, dějiny práva, and zahraniční periodika a sborníky
- Language:
- Slovak, Czech, German, English, Latin, and Polish
- Rights:
- unknown
62. Coreference in Universal Dependencies 0.1 (CorefUD 0.1)
- Creator:
- Nedoluzhko, Anna, Novák, Michal, Popel, Martin, Žabokrtský, Zdeněk, and Zeman, Daniel
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- dependency, treebank, coreference, bridging relations, and harmonized annotation
- Language:
- Catalan, Czech, Dutch, English, French, German, Hungarian, Lithuanian, Polish, Russian, and Spanish
- Description:
- CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version 0.1 consists of 17 datasets for 11 languages. The datasets are enriched with automatic morphological and syntactic annotations that are fully compliant with the standards of the Universal Dependencies project. All the datasets are stored in the CoNLL-U format, with coreference- and bridging-specific information captured by attribute-value pairs located in the MISC column. The collection is divided into a public edition and a non-public (ÚFAL-internal) edition. The publicly available edition is distributed via LINDAT-CLARIAH-CZ and contains 13 datasets for 10 languages (1 dataset for Catalan, 2 for Czech, 2 for English, 1 for French, 2 for German, 1 for Hungarian, 1 for Lithuanian, 1 for Polish, 1 for Russian, and 1 for Spanish), excluding the test data. The non-public edition is available internally to ÚFAL members and contains additional 4 datasets for 2 languages (1 dataset for Dutch, and 3 for English), which we are not allowed to distribute due to their original license limitations. It also contains the test data portions for all datasets. When using any of the harmonized datasets, please get acquainted with its license (placed in the same directory as the data) and cite the original data resource too. References to original resources whose harmonized versions are contained in the public edition of CorefUD 0.1: - Catalan-AnCora: Recasens, M. and Martí, M. A. (2010). AnCora-CO: Coreferentially Annotated Corpora for Spanish and Catalan. Language Resources and Evaluation, 44(4):315–345 - Czech-PCEDT: Nedoluzhko, A., Novák, M., Cinková, S., Mikulová, M., and Mírovský, J. (2016). Coreference in Prague Czech-English Dependency Treebank. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 169–176, Portorož, Slovenia. European Language Resources Association. - Czech-PDT: Hajič, J., Bejček, E., Hlaváčová, J., Mikulová, M., Straka, M., Štěpánek, J., and Štěpánková, B. (2020). Prague Dependency Treebank - Consolidated 1.0. In Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), pages 5208–5218, Marseille, France. European Language Resources Association. - English-GUM: Zeldes, A. (2017). The GUM Corpus: Creating Multilayer Resources in the Classroom. Language Resources and Evaluation, 51(3):581–612. - English-ParCorFull: Lapshinova-Koltunski, E., Hardmeier, C., and Krielke, P. (2018). ParCorFull: a Parallel Corpus Annotated with Full Coreference. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan. European Language Resources Association. - French-Democrat: Landragin, F. (2016). Description, modélisation et détection automatique des chaı̂nes de référence (DEMOCRAT). Bulletin de l’Association Française pour l’Intelligence Artificielle, (92):11–15. - German-ParCorFull: Lapshinova-Koltunski, E., Hardmeier, C., and Krielke, P. (2018). ParCorFull: a Parallel Corpus Annotated with Full Coreference. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan. European Language Resources Association - German-PotsdamCC: Bourgonje, P. and Stede, M. (2020). The Potsdam Commentary Corpus 2.2: Extending annotations for shallow discourse parsing. In Proceedings of the 12th Language Resources and Evaluation Conference, pages 1061–1066, Marseille, France. European Language Resources Association. - Hungarian-SzegedKoref: Vincze, V., Hegedűs, K., Sliz-Nagy, A., and Farkas, R. (2018). SzegedKoref: A Hungarian Coreference Corpus. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan. European Language Resources Association. - Lithuanian-LCC: Žitkus, V. and Butkienė, R. (2018). Coreference Annotation Scheme and Corpus for Lithuanian Language. In Fifth International Conference on Social Networks Analysis, Management and Security, SNAMS 2018, Valencia, Spain, October 15-18, 2018, pages 243–250. IEEE. - Polish-PCC: Ogrodniczuk, M., Glowińska, K., Kopeć, M., Savary, A., and Zawisławska, M. (2013). Polish coreference corpus. In Human Language Technology. Challenges for Computer Science and Linguistics - 6th Language and Technology Conference, LTC 2013, Poznań, Poland, December 7-9, 2013. Revised Selected Papers, volume 9561 of Lecture Notes in Computer Science, pages 215–226. Springer. - Russian-RuCor: Toldova, S., Roytberg, A., Ladygina, A. A., Vasilyeva, M. D., Azerkovich, I. L., Kurzukov,M., Sim, G., Gorshkov, D. V., Ivanova, A., Nedoluzhko, A., and Grishina, Y. (2014). Evaluating Anaphora and Coreference Resolution for Russian. In Komp’juternaja lingvistika i intellektual’nye tehnologii. Po materialam ezhegodnoj Mezhdunarodnoj konferencii Dialog, pages 681–695. - Spanish-AnCora: Recasens, M. and Martí, M. A. (2010). AnCora-CO: Coreferentially Annotated Corpora for Spanish and Catalan. Language Resources and Evaluation, 44(4):315–345 References to original resources whose harmonized versions are contained in the ÚFAL-internal edition of CorefUD 0.1: - Dutch-COREA: Hendrickx, I., Bouma, G., Coppens, F., Daelemans, W., Hoste, V., Kloosterman, G., Mineur, A.-M., Van Der Vloet, J., and Verschelde, J.-L. (2008). A coreference corpus and resolution system for Dutch. In Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC’08), Marrakech, Morocco. European Language Resources Association. - English-ARRAU: Uryupina, O., Artstein, R., Bristot, A., Cavicchio, F., Delogu, F., Rodriguez, K. J., and Poesio, M. (2020). Annotating a broad range of anaphoric phenomena, in a variety of genres: the ARRAU Corpus. Natural Language Engineering, 26(1):95–128. - English-OntoNotes: Weischedel, R., Hovy, E., Marcus, M., Palmer, M., Belvin, R., Pradhan, S., Ramshaw, L., and Xue, N. (2011). Ontonotes: A large training corpus for enhanced processing. In Handbook of Natural Language Processing and Machine Translation: DARPA Global Autonomous Language Exploitation, pages 54–63, New York. Springer-Verlag. - English-PCEDT: Nedoluzhko, A., Novák, M., Cinková, S., Mikulová, M., and Mírovský, J. (2016). Coreference in Prague Czech-English Dependency Treebank. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16), pages 169–176, Portorož, Slovenia. European Language Resources Association.
- Rights:
- Licence CorefUD v0.1, https://lindat.mff.cuni.cz/repository/xmlui/page/license-corefud-0.1, and PUB
63. Coreference in Universal Dependencies 0.2 (CorefUD 0.2)
- Creator:
- Nedoluzhko, Anna, Novák, Michal, Popel, Martin, Žabokrtský, Zdeněk, and Zeman, Daniel
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- dependency, treebank, coreference, bridging relations, and harmonized annotation
- Language:
- Catalan, Czech, Dutch, English, French, German, Hungarian, Lithuanian, Polish, Russian, and Spanish
- Description:
- CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version 0.2 consists of 17 datasets for 11 languages. The datasets are enriched with automatic morphological and syntactic annotations that are fully compliant with the standards of the Universal Dependencies project. All the datasets are stored in the CoNLL-U format, with coreference- and bridging-specific information captured by attribute-value pairs located in the MISC column. The collection is divided into a public edition and a non-public (ÚFAL-internal) edition. The publicly available edition is distributed via LINDAT-CLARIAH-CZ and contains 13 datasets for 10 languages (1 dataset for Catalan, 2 for Czech, 2 for English, 1 for French, 2 for German, 1 for Hungarian, 1 for Lithuanian, 1 for Polish, 1 for Russian, and 1 for Spanish), excluding the test data. The non-public edition is available internally to ÚFAL members and contains additional 4 datasets for 2 languages (1 dataset for Dutch, and 3 for English), which we are not allowed to distribute due to their original license limitations. It also contains the test data portions for all datasets. When using any of the harmonized datasets, please get acquainted with its license (placed in the same directory as the data) and cite the original data resource too. Version 0.2 consists of exactly the same datasets as the version 0.1. All automatically parsed datasets were re-parsed for v0.2 using UDPipe 2 with models trained on UD 2.6. Catalan-AnCora, Spanish-AnCora and English-GUM have been updated to match the their UD 2.9 versions.
- Rights:
- Licence CorefUD v0.2, https://lindat.mff.cuni.cz/repository/xmlui/page/license-corefud-0.2, and PUB
64. Coreference in Universal Dependencies 1.0 (CorefUD 1.0)
- Creator:
- Nedoluzhko, Anna, Novák, Michal, Popel, Martin, Žabokrtský, Zdeněk, Zeldes, Amir, Zeman, Daniel, Bourgonje, Peter, Cinková, Silvie, Hajič, Jan, Hardmeier, Christian, Krielke, Pauline, Landragin, Frédéric, Lapshinova-Koltunski, Ekaterina, Martí, M. Antònia, Mikulová, Marie, Ogrodniczuk, Maciej, Recasens, Marta, Stede, Manfred, Straka, Milan, Toldova, Svetlana, Vincze, Veronika, and Žitkus, Voldemaras
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- dependency, treebank, coreference, bridging relations, and harmonized annotation
- Language:
- Catalan, Czech, Dutch, English, French, German, Hungarian, Lithuanian, Polish, Russian, and Spanish
- Description:
- CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version 1.0 consists of 17 datasets for 11 languages. The datasets are enriched with automatic morphological and syntactic annotations that are fully compliant with the standards of the Universal Dependencies project. All the datasets are stored in the CoNLL-U format, with coreference- and bridging-specific information captured by attribute-value pairs located in the MISC column. The collection is divided into a public edition and a non-public (ÚFAL-internal) edition. The publicly available edition is distributed via LINDAT-CLARIAH-CZ and contains 13 datasets for 10 languages (1 dataset for Catalan, 2 for Czech, 2 for English, 1 for French, 2 for German, 1 for Hungarian, 1 for Lithuanian, 1 for Polish, 1 for Russian, and 1 for Spanish), excluding the test data. The non-public edition is available internally to ÚFAL members and contains additional 4 datasets for 2 languages (1 dataset for Dutch, and 3 for English), which we are not allowed to distribute due to their original license limitations. It also contains the test data portions for all datasets. When using any of the harmonized datasets, please get acquainted with its license (placed in the same directory as the data) and cite the original data resource too. Version 1.0 consists of the same corpora and languages as the previous version 0.2; however, the English GUM dataset has been updated to a newer and larger version, and in the Czech/English PCEDT dataset, the train-dev-test split has been changed to be compatible with OntoNotes. Nevertheless, the main change is in the file format (the MISC attributes have new form and interpretation).
- Rights:
- Licence CorefUD v0.2, https://lindat.mff.cuni.cz/repository/xmlui/page/license-corefud-0.2, and PUB
65. Coreference in Universal Dependencies 1.1 (CorefUD 1.1)
- Creator:
- Novák, Michal, Popel, Martin, Žabokrtský, Zdeněk, Zeman, Daniel, Nedoluzhko, Anna, Acar, Kutay, Bourgonje, Peter, Cinková, Silvie, Cebiroğlu Eryiğit, Gülşen, Hajič, Jan, Hardmeier, Christian, Haug, Dag, Jørgensen, Tollef, Kåsen, Andre, Krielke, Pauline, Landragin, Frédéric, Lapshinova-Koltunski, Ekaterina, Mæhlum, Petter, Martí, M. Antònia, Mikulová, Marie, Nøklestad, Anders, Ogrodniczuk, Maciej, Øvrelid, Lilja, Pamay Arslan, Tuğba, Recasens, Marta, Solberg, Per Erik, Stede, Manfred, Straka, Milan, Toldova, Svetlana, Vadász, Noémi, Velldal, Erik, Vincze, Veronika, Zeldes, Amir, and Žitkus, Voldemaras
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- dependency, treebank, coreference, bridging relations, and harmonized annotation
- Language:
- Catalan, Czech, English, French, German, Hungarian, Lithuanian, Norwegian, Polish, Russian, Spanish, and Turkish
- Description:
- CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version 1.1 consists of 21 datasets for 13 languages. The datasets are enriched with automatic morphological and syntactic annotations that are fully compliant with the standards of the Universal Dependencies project. All the datasets are stored in the CoNLL-U format, with coreference- and bridging-specific information captured by attribute-value pairs located in the MISC column. The collection is divided into a public edition and a non-public (ÚFAL-internal) edition. The publicly available edition is distributed via LINDAT-CLARIAH-CZ and contains 17 datasets for 12 languages (1 dataset for Catalan, 2 for Czech, 2 for English, 1 for French, 2 for German, 2 for Hungarian, 1 for Lithuanian, 2 for Norwegian, 1 for Polish, 1 for Russian, 1 for Spanish, and 1 for Turkish), excluding the test data. The non-public edition is available internally to ÚFAL members and contains additional 4 datasets for 2 languages (1 dataset for Dutch, and 3 for English), which we are not allowed to distribute due to their original license limitations. It also contains the test data portions for all datasets. When using any of the harmonized datasets, please get acquainted with its license (placed in the same directory as the data) and cite the original data resource too. Compared to the previous version 1.0, the version 1.1 comprises new languages and corpora, namely Hungarian-KorKor, Norwegian-BokmaalNARC, Norwegian-NynorskNARC, and Turkish-ITCC. In addition, the English GUM dataset has been updated to a newer and larger version, and the conversion pipelines for most datasets have been refined (a list of all changes in each dataset can be found in the corresponding README file).
- Rights:
- Licence CorefUD v1.1, https://lindat.mff.cuni.cz/repository/xmlui/page/license-corefud-1.1, and PUB
66. Coreference in Universal Dependencies 1.2 (CorefUD 1.2)
- Creator:
- Popel, Martin, Novák, Michal, Žabokrtský, Zdeněk, Zeman, Daniel, Nedoluzhko, Anna, Acar, Kutay, Bamman, David, Bourgonje, Peter, Cinková, Silvie, Eckhoff, Hanne, Cebiroğlu Eryiğit, Gülşen, Hajič, Jan, Hardmeier, Christian, Haug, Dag, Jørgensen, Tollef, Kåsen, Andre, Krielke, Pauline, Landragin, Frédéric, Lapshinova-Koltunski, Ekaterina, Mæhlum, Petter, Martí, M. Antònia, Mikulová, Marie, Nøklestad, Anders, Ogrodniczuk, Maciej, Øvrelid, Lilja, Pamay Arslan, Tuğba, Recasens, Marta, Solberg, Per Erik, Stede, Manfred, Straka, Milan, Swanson, Daniel, Toldova, Svetlana, Vadász, Noémi, Velldal, Erik, Vincze, Veronika, Zeldes, Amir, and Žitkus, Voldemaras
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- coreference, bridging relations, harmonized annotation, dependency, and treebank
- Language:
- Ancient Greek (to 1453), Ancient Hebrew, Catalan, Czech, English, French, German, Hungarian, Lithuanian, Norwegian, Church Slavic, Polish, Russian, Spanish, and Turkish
- Description:
- CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version 1.2 consists of 25 datasets for 16 languages. The datasets are enriched with automatic morphological and syntactic annotations that are fully compliant with the standards of the Universal Dependencies project. All the datasets are stored in the CoNLL-U format, with coreference- and bridging-specific information captured by attribute-value pairs located in the MISC column. The collection is divided into a public edition and a non-public (ÚFAL-internal) edition. The publicly available edition is distributed via LINDAT-CLARIAH-CZ and contains 21 datasets for 15 languages (1 dataset for Ancient Greek, 1 for Ancient Hebrew, 1 for Catalan, 2 for Czech, 3 for English, 1 for French, 2 for German, 2 for Hungarian, 1 for Lithuanian, 2 for Norwegian, 1 for Old Church Slavonic, 1 for Polish, 1 for Russian, 1 for Spanish, and 1 for Turkish), excluding the test data. The non-public edition is available internally to ÚFAL members and contains additional 4 datasets for 2 languages (1 dataset for Dutch, and 3 for English), which we are not allowed to distribute due to their original license limitations. It also contains the test data portions for all datasets. When using any of the harmonized datasets, please get acquainted with its license (placed in the same directory as the data) and cite the original data resource, too. Compared to the previous version 1.1, the version 1.2 comprises new languages and corpora, namely Ancient_Greek-PROIEL, Ancient_Hebrew-PTNK, English-LitBank, and Old_Church_Slavonic-PROIEL. In addition, English-GUM and Turkish-ITCC have been updated to newer versions, conversion of zeros in Polish-PCC has been improved, and the conversion pipelines for multiple other datasets have been refined (a list of all changes in each dataset can be found in the corresponding README file).
- Rights:
- Licence CorefUD v1.2, https://lindat.mff.cuni.cz/repository/xmlui/page/license-corefud-1.2, and PUB
67. CorPipe 23 multilingual CorefUD 1.1 model (corpipe23-corefud1.1-231206)
- Creator:
- Straka, Milan
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- tool and toolService
- Subject:
- coreference resolution, CorPipe, and CorefUD
- Language:
- Catalan, Czech, German, English, Spanish, French, Hungarian, Lithuanian, Norwegian Bokmål, Norwegian Nynorsk, Polish, Russian, and Turkish
- Description:
- The `corpipe23-corefud1.1-231206` is a `mT5-large`-based multilingual model for coreference resolution usable in CorPipe 23 (https://github.com/ufal/crac2023-corpipe). It is released under the CC BY-NC-SA 4.0 license. The model is language agnostic (no _corpus id_ on input), so it can be used to predict coreference in any `mT5` language (for zero-shot evaluation, see the paper). However, note that the empty nodes must be present already on input, they are not predicted (the same settings as in the CRAC23 shared task).
- Rights:
- Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB
68. CorpusExplorer
- Creator:
- Rüdiger, Jan Oliver
- Publisher:
- Jan Oliver Rüdiger
- Type:
- tool and toolService
- Subject:
- Corpus Linguisitics, NLP, conll, tei, XML, nlp, Natural Language Processing, linguistics, Linguistics, Computational Linguistics, corpus processing, tagger, POS tagger, lemmatization, text cleaning, CommonCrawl, epub, JSON, Twitter, Pandoc, Wikipedia, digital data, DTA, DSpin, MySQL, ElasticSearch, TextGrid, text corpora, TigerXML, and WeblichtXML
- Language:
- German, English, French, Italian, Dutch, Spanish, Polish, Arabic, Chinese, and Portuguese
- Description:
- Software for corpus linguists and text/data mining enthusiasts. The CorpusExplorer combines over 45 interactive visualizations under a user-friendly interface. Routine tasks such as text acquisition, cleaning or tagging are completely automated. The simple interface supports the use in university teaching and leads users/students to fast and substantial results. The CorpusExplorer is open for many standards (XML, CSV, JSON, R, etc.) and also offers its own software development kit (SDK). Source code available at https://github.com/notesjor/corpusexplorer2.0
- Rights:
- Not specified
69. CUBBITT Translation Models (en-pl) (v1.0)
- Creator:
- Popel, Martin, Tomková, Markéta, Tomek, Jakub, Kaiser, Łukasz, Uszkoreit, Jakob, Bojar, Ondřej, and Žabokrtský, Zdeněk
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- tool and toolService
- Subject:
- machine translation, neural machine translation, transformer, and cubbitt
- Language:
- English and Polish
- Description:
- CUBBITT En-Pl translation models, exported via TensorFlow Serving, available in the Lindat translation service (https://lindat.mff.cuni.cz/services/translation/). Models are compatible with Tensor2tensor version 1.6.6. For details about the model training (data, model hyper-parameters), please contact the archive maintainer. Evaluation on newstest2020 (BLEU): en->pl: 12.3 pl->en: 20.0 (Evaluated using multeval: https://github.com/jhclark/multeval)
- Rights:
- Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB
70. Cyrilometodějská teologická fakulta Univezity Palackého v Olomouci v letech 1990-2010. 20 let od jejího obnovení. /
- Type:
- text
- Subject:
- sborníky, univerzity moravské, fakulty teologické, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech, Polish, German, and English
- Description:
- Gründung und Anfänge der Universität in Olomouc.
- Rights:
- unknown
71. Czeskie badania nad Polską w kontekście Europy Środkowej i Wschodniej /
- Creator:
- Baron, Roman,
- Type:
- text and monografie kolektivní
- Subject:
- Dějiny zemí střední Evropy, polonistika, polonisté, instituce vědecké, české (československé) sborníky a kolektivní monografie, Československo 1918-1992, české země od r. 1993 do současnosti, and dějiny slavistiky
- Language:
- Polish and English
- Rights:
- unknown
72. Czeskie badania nad Polską w kontekście Europy Środkowej i Wschodniej /
- Creator:
- Baron, Roman,
- Type:
- text and monografie kolektivní
- Subject:
- Dějiny zemí střední Evropy, polonistika, polonisté, instituce vědecké, české (československé) sborníky a kolektivní monografie, Československo 1918-1992, české země od r. 1993 do současnosti, and dějiny slavistiky
- Language:
- Polish and English
- Rights:
- unknown
73. Czesław Miłosz :
- Type:
- text and sborníky konferenční
- Subject:
- Polská literatura (o ní), Miłosz, Czesław,, české (československé) sborníky a kolektivní monografie, Polsko, světové dějiny od r. 1918 do současnosti, and literatura, spisovatelé
- Language:
- Polish, Czech, and English
- Description:
- Sborník ze stejnojmenné konference konané na Filozofické fakultě Ostravské univerzity, Polský, český a anglický a text, and 250 výt.
- Rights:
- unknown
74. DaMuEL 1.0: A Large Multilingual Dataset for Entity Linking
- Creator:
- Kubeša, David and Straka, Milan
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- entity linking, NEL, NER, dataset, and knowledge base
- Language:
- Afrikaans, Arabic, Armenian, Basque, Belarusian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Hebrew, Hindi, Hungarian, Indonesian, Irish, Italian, Japanese, Korean, Latin, Latvian, Lithuanian, Maltese, Marathi, Modern Greek (1453-), Northern Sami, Norwegian Nynorsk, Persian, Polish, Portuguese, Romanian, Russian, Scottish Gaelic, Serbian, Slovak, Slovenian, Spanish, Swedish, Tamil, Telugu, Uighur, Ukrainian, Urdu, Vietnamese, and Wolof
- Description:
- We present DaMuEL, a large Multilingual Dataset for Entity Linking containing data in 53 languages. DaMuEL consists of two components: a knowledge base that contains language-agnostic information about entities, including their claims from Wikidata and named entity types (PER, ORG, LOC, EVENT, BRAND, WORK_OF_ART, MANUFACTURED); and Wikipedia texts with entity mentions linked to the knowledge base, along with language-specific text from Wikidata such as labels, aliases, and descriptions, stored separately for each language. The Wikidata QID is used as a persistent, language-agnostic identifier, enabling the combination of the knowledge base with language-specific texts and information for each entity. Wikipedia documents deliberately annotate only a single mention for every entity present; we further automatically detect all mentions of named entities linked from each document. The dataset contains 27.9M named entities in the knowledge base and 12.3G tokens from Wikipedia texts. The dataset is published under the CC BY-SA licence.
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
75. De rerum humanarum emendatione consultatio catholica a odkaz Jana Amosa Komenského pre tretie tisícročie
- Type:
- text and prameny
- Subject:
- Věda. Všeobecnosti. Základy vědy a kultury. Vědecká práce, Komenský, Jan Amos,, spisy, komeniana, zahraniční periodika a sborníky, české země 1620-1740, and dějiny vědy, umění, kultury a techniky, kulturní vztahy
- Language:
- Latin, Slovak, Czech, German, English, and Polish
- Description:
- "Zborník materiálov z medzinárodnej konferencie, konanej v Bratislave v dňoch 13. a 14. novembra 2000"--S. [1]
- Rights:
- unknown
76. De rerum humanarum emendatione consultatio catholica a odkaz Jana Amosa Komenského pre tretie tisícročie /
- Type:
- text and sborníky konferenční
- Subject:
- Přirozená teologie. Náboženská filozofie, Komenský, Jan Amos,, komeniana, dějiny vědy, umění, kultury a techniky, kulturní vztahy, české země 1620-1740, and zahraniční periodika a sborníky
- Language:
- Slovak, Czech, German, English, Spanish, and Polish
- Description:
- Na s. 1 pozn.: Zborník materiálov z medzinárodnej konferencie, konanej v Bratislave v dňoch 13. a 14. novembra 2000
- Rights:
- unknown
77. Deep Universal Dependencies 2.4
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, and Galician
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-2988). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.4, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.4, and PUB
78. Deep Universal Dependencies 2.5
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, and Skolt Sami
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3105). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.5, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.5, and PUB
79. Deep Universal Dependencies 2.6
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, Skolt Sami, Icelandic, Albanian, and Persian
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3226). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.6, https://lindat.mff.cuni.cz/repository/xmlui/page/license-ud-2.6, and PUB
80. Deep Universal Dependencies 2.7
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, Skolt Sami, Icelandic, Albanian, Persian, Akuntsu, Apurinã, Khunsari, Manx, Mundurukú, Nayini, Soi, South Levantine Arabic, and Tupinambá
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3424). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.7, https://lindat.mff.cuni.cz/repository/xmlui/page/license-ud-2.7, and PUB
81. Deep Universal Dependencies 2.8
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, Skolt Sami, Icelandic, Albanian, Persian, Akuntsu, Apurinã, Khunsari, Manx, Mundurukú, Nayini, Soi, South Levantine Arabic, Tupinambá, Beja, Western Frisian, Urubú-Kaapor, Kangri, K'iche', Low German, Makuráp, Western Armenian, and Central Siberian Yupik
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3687). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.8, https://lindat.mff.cuni.cz/repository/xmlui/page/license-ud-2.8, and PUB
82. Deltacorpus
- Creator:
- Mareček, David, Yu, Zhiwei, Zeman, Daniel, and Žabokrtský, Zdeněk
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- part of speech, tagging, semi-supervised, and cross-language
- Language:
- Belarusian, Bosnian, Bulgarian, Czech, Serbo-Croatian, Croatian, Upper Sorbian, Macedonian, Polish, Russian, Slovak, Slovenian, Serbian, Ukrainian, Latvian, Lithuanian, Afrikaans, Danish, German, English, Faroese, Western Frisian, Swiss German, Icelandic, Limburgan, Luxembourgish, Low German, Dutch, Norwegian Nynorsk, Norwegian, Scots, Swedish, Yiddish, Aragonese, Asturian, Catalan, French, Galician, Haitian, Italian, Latin, Lombard, Neapolitan, Piemontese, Portuguese, Romanian, Spanish, Venetian, Walloon, Breton, Welsh, Scottish Gaelic, Irish, Modern Greek (1453-), Armenian, Albanian, Dimli (individual language), Persian, Gilaki, Kurdish, Tajik, Bengali, Bishnupriya, Gujarati, Fiji Hindi, Hindi, Marathi, Nepali (macrolanguage), Urdu, Amharic, Arabic, Egyptian Arabic, Hebrew, Estonian, Finnish, Hungarian, Basque, Georgian, Chuvash, Azerbaijani, Turkish, Uzbek, Kazakh, Tatar, Yakut, Korean, Mongolian, Telugu, Kannada, Malayalam, Tamil, Newari, Vietnamese, Indonesian, Javanese, Malagasy, Maori, Malay (macrolanguage), Pampanga, Sundanese, Tagalog, Waray (Philippines), Swahili (macrolanguage), Esperanto, Ido, Interlingua (International Auxiliary Language Association), and Volapük
- Description:
- Texts in 107 languages from the W2C corpus (http://hdl.handle.net/11858/00-097C-0000-0022-6133-9), first 1,000,000 tokens per language, tagged by the delexicalized tagger described in Yu et al. (2016, LREC, Portorož, Slovenia).
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
83. Deltacorpus 1.1
- Creator:
- Mareček, David, Yu, Zhiwei, Zeman, Daniel, and Žabokrtský, Zdeněk
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- part of speech, tagging, semi-supervised, and cross-language
- Language:
- Belarusian, Bosnian, Bulgarian, Czech, Serbo-Croatian, Croatian, Upper Sorbian, Macedonian, Polish, Russian, Slovak, Slovenian, Serbian, Ukrainian, Latvian, Lithuanian, Afrikaans, Danish, German, English, Faroese, Western Frisian, Swiss German, Icelandic, Limburgan, Luxembourgish, Low German, Dutch, Norwegian Nynorsk, Norwegian, Scots, Swedish, Yiddish, Aragonese, Asturian, Catalan, French, Galician, Haitian, Italian, Latin, Lombard, Neapolitan, Piemontese, Portuguese, Romanian, Spanish, Venetian, Walloon, Breton, Welsh, Scottish Gaelic, Irish, Modern Greek (1453-), Armenian, Albanian, Dimli (individual language), Persian, Gilaki, Kurdish, Tajik, Bengali, Bishnupriya, Gujarati, Fiji Hindi, Hindi, Marathi, Nepali (macrolanguage), Urdu, Amharic, Arabic, Egyptian Arabic, Hebrew, Estonian, Finnish, Hungarian, Basque, Georgian, Chuvash, Azerbaijani, Turkish, Uzbek, Kazakh, Tatar, Yakut, Korean, Mongolian, Telugu, Kannada, Malayalam, Tamil, Newari, Vietnamese, Indonesian, Javanese, Malagasy, Maori, Malay (macrolanguage), Pampanga, Sundanese, Tagalog, Waray (Philippines), Swahili (macrolanguage), Esperanto, Ido, Interlingua (International Auxiliary Language Association), and Volapük
- Description:
- Texts in 107 languages from the W2C corpus (http://hdl.handle.net/11858/00-097C-0000-0022-6133-9), first 1,000,000 tokens per language, tagged by the delexicalized tagger described in Yu et al. (2016, LREC, Portorož, Slovenia). Changes in version 1.1: 1. Universal Dependencies tagset instead of the older and smaller Google Universal POS tagset. 2. SVM classifier trained on Universal Dependencies 1.2 instead of HamleDT 2.0. 3. Balto-Slavic languages, Germanic languages and Romance languages were tagged by classifier trained only on the respective group of languages. Other languages were tagged by a classifier trained on all available languages. The "c7" combination from version 1.0 is no longer used.
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
84. Der Holocaust in den mitteleuropäischen Literaturen und Kulturen seit 1989 =
- Type:
- text and sborníky konferenční
- Subject:
- Literatura (teorie), holocaust, náměty literární, zahraniční periodika a sborníky, Československo 1989-1992, české země od r. 1993 do současnosti, světové dějiny od r. 1945 do současnosti, literatura, spisovatelé, and dějiny umění, mecenát
- Language:
- German, Czech, English, and Polish
- Rights:
- unknown
85. Drobné památky na Trutnovsku :
- Creator:
- Fiedler, Günter,
- Type:
- text, statický obraz, and průvodce
- Subject:
- Architektura, památky drobné, památky sakrální, boží muka, kříže, sochy, and jednotlivé památky, památkové rezervace
- Language:
- Czech, English, German, and Polish
- Description:
- Název přílohy: Mapa polsko-czeskiego pogranicza : część zachodnia = Mapa polsko-českého příhraničí : západní část, Název v tiráži: I malé je krásné : Trutnovsko a oblast Lubawky = I male jest piękne : region Trutnowa i Lubawki = Auch klein ist schön : Region von Trutnov (Trautenau) und Lubawka (Liebau) = Even the small is nice : Region Trutnov and Lubawka, Souběžný obálkový název: Małe zabytki w Trutnovie i okolicach, and Souběžný obálkový název: Small monuments in the Trutnov Region
- Rights:
- unknown
86. Duchovní člověk a intelektuál
- Creator:
- Jan Patočka
- Publisher:
- Ed. I. Chvatík a J. Polívka. Str. 197–212. [Přepis mgf. záznamu soukromé přednášky z 11. 4. 1975.] — 2. otisk in: Souvislosti 1 (1990), č. 1, str. 9–17. — 3. otisk in: Péče o duši III (SS-3/PD-III), Praha 2002, str. 355–371 (v. 2002/1).
- Type:
- Text
- Subject:
- 1977/5, 1977/7, 1988, 1990/6, 1998/3, 1999/1, 2002/1, 2007/18, 2007/7, cs, de, en, es, fr, fulltext, it, pl, Přepis mgf. záznamu, and SS-3/PD-III
- Language:
- English, French, Italian, German, Polish, Spanish, and Czech
- Rights:
- open access and Rights holder: Archiv Jana Patočky, z.s.
87. Dům, palác a zámek v hmotné kultuře Slezska =
- Creator:
- Jež, Radim,
- Type:
- text and monografie kolektivní
- Subject:
- Architektura, zámky, paláce, domy měšťanské, kultura hmotná, kultura bydlení, přehledná zpracování dějin českých zemí (chronologicky), hrady, hradiště, zámky, tvrze, dvory, architektura, architekti, and hmotná kultura, umělecká řemesla
- Language:
- Czech, English, and Polish
- Rights:
- unknown
88. Dvory a rezidence ve středověku.
- Type:
- text and sborníky
- Subject:
- Genealogie. Heraldika. Šlechta. Vlajky, dvory, rezidence, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech, English, French, German, Latin, and Polish
- Description:
- Příspěvky z 2. kolokvia konaného 18.-19. října 2007, které uspořádal Historický ústav Akademie věd České republiky ve spolupráci s Archivem hlavního města Prahy a Ústavem českých dějin Filozofické fakulty Univerzity Karlovy
- Rights:
- unknown
89. Dvory a rezidence ve středověku.
- Type:
- text and sborníky
- Subject:
- Genealogie. Heraldika. Šlechta. Vlajky, dvory, rezidence, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech, English, French, German, Latin, and Polish
- Description:
- Příspěvky z 2. kolokvia konaného 18.-19. října 2007, které uspořádal Historický ústav Akademie věd České republiky ve spolupráci s Archivem hlavního města Prahy a Ústavem českých dějin Filozofické fakulty Univerzity Karlovy
- Rights:
- unknown
90. Dzieje biblioteki jezuickiej w Nysie. Jej wystrój i wyposażenie /
- Creator:
- Werszler, Rafał
- Subject:
- řád, jezuité, knihovny polské, dějiny knihoven, koleje jezuitské, výzdoba umělecká, and zahraniční knihovnictví
- Language:
- Polish and English
- Description:
- History of Jesuit Library in Nysa. Its Decor and Furnishings.
- Rights:
- unknown
91. Dzieła warsztatu Johanna Albrechta Siegwitza wykonane dla jezuitów na Śląsku, w hrabstwie kłodzkim i Rzeczypospolitej /
- Creator:
- Kolbiarz, Artur
- Subject:
- Siegwitz, Johann Albrecht,, řád, jezuité, sochařství barokní, sochaři barokní, kostely, světové dějiny 1648-1789, Polsko, and sochařství, sochaři, řezbářství
- Language:
- Polish and English
- Description:
- Artistic Works of Johann Albrecht Siegwitz's Workshop Made for Jesuits in Silesia, Kłodzko County and in Poland.
- Rights:
- unknown
92. Epigraphica & Sepulcralia.
- Type:
- text and sborníky
- Subject:
- Historická věda. Pomocné vědy historické. Archivnictví, epigrafika, památky sepulkrální, dějiny umění, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech, English, Slovak, and Polish
- Rights:
- unknown
93. Epigraphica & Sepulcralia.
- Type:
- text and sborníky
- Subject:
- Historická věda. Pomocné vědy historické. Archivnictví, epigrafika, památky sepulkrální, dějiny umění, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech, English, Slovak, and Polish
- Description:
- Referáty z 12. zasedání k problematice sepulkrálních památek "Justorum autem animae in manu dei sunt" v Praze 31. 10. - 1. 11. 2013 a ze 13. zasedání "O mors, quam amara est memoria tua" v Praze 30. - 31. 10. 2014
- Rights:
- unknown
94. Europeica - Slavica - Baltica :
- Type:
- text and sborníky jubilejní
- Subject:
- Filologie, Marvan, Jiří,, slavisté, baltisté, slavistika, baltistika, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech, English, Lithuanian, Polish, and Slovak
- Description:
- "... obsahově vychází ze sympozia Balto-slavica na prahu století, které ... uspořádala pražská Slovanská knihovna a Ústav slavistických a východoevropských studií Filozofické fakulty Univerzity Karlovy dne 19. dubna 2006"--Úvod
- Rights:
- unknown
95. Evropa a evropské dědictví do konce 19. století
- Creator:
- Jan Patočka
- Publisher:
- Str. 132–159. Stať. [Věnován o F. Fajfrovi k 80. narozeninám 1972 a B. Komárkové k 70. narozeninám 1973.]
- Type:
- Text
- Subject:
- 1975, 1979/25, 1981/6, 1981/7, 1988/28, 1988/31, 1988/32, 1988/33, 1988/34, 1994/7, 1996/4, 1996/7, 1998/3, 1999/8, 2001/9, 2002/21, 2006/1, 2007/1, 2008/3, be, bg, cs, de, en, es, fr, fulltext, hu, I/1979, it, lt, no, pl, ru, SS-3/PD-III, sv, and uk
- Language:
- Czech, English, Bulgarian, French, Italian, Lithuanian, Hungarian, German, Norwegian, Polish, Russian, Belarusian, Spanish, Swedish, and Ukrainian
- Rights:
- open access and Rights holder: Archiv Jana Patočky, z.s.
96. Extended CLEF eHealth 2013-2015 IR Test Collection
- Creator:
- Pecina, Pavel and Saleh, Shadi
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- cross-lingual information retrieval and machine translation
- Language:
- English, Czech, French, German, Hungarian, Polish, Spanish, and Swedish
- Description:
- This package contains an extended version of the test collection used in the CLEF eHealth Information Retrieval tasks in 2013--2015. Compared to the original version, it provides complete query translations into Czech, French, German, Hungarian, Polish, Spanish and Swedish and additional relevance assessment.
- Rights:
- Creative Commons - Attribution-NonCommercial 4.0 International (CC BY-NC 4.0), http://creativecommons.org/licenses/by-nc/4.0/, and PUB
97. Filiální kostel sv. Marka v Karviné-Fryštátě. = Succursal Church of St. Mark in Karviná-Fryštát. = Filiale St. Markus Kirche in Karviná-Fryštát. = Kościół filialny św. Marka w Karwinie-Frysztacie /
- Creator:
- Rebrová, Alexandra,
- Publisher:
- Městský úřad v Karviné,
- Subject:
- kostely, sv. Marek (patrocinium), architektura sakrální, památky stavební, církevní architektura, hmotné památky, hřbitovy a poutní místa, jednotlivé památky, památkové rezervace, přehledná zpracování dějin českých zemí (chronologicky), and architektura, architekti
- Language:
- Czech, English, German, and Polish
- Description:
- [Souběžný text v češtině, angl., něm. a pol.]
- Rights:
- unknown
98. Filologia polska pomiędzy - polonistyka w Ołomuńcu /
- Creator:
- Hanczakowski, Michał
- Subject:
- polonistika, univerzity, české a československé vědecké instituce a společnosti, vysoké školy, české země od r. 1993 do současnosti, and dějiny slavistiky
- Language:
- Polish and English
- Rights:
- unknown
99. Folia numismatica :
- Type:
- text and časopisy
- Subject:
- Seriálové publikace. Periodika, numizmatika, medaile, and česká periodika
- Language:
- Czech, English, German, and Polish
- Rights:
- unknown
100. Folklor: podrecznik dla zajmujacych sie ludoznawstwem
- Creator:
- Gomme, George Laurence, Szukiewicz, Wojciech, Eljasz-Radzikowski, Stanisław, and Towarzystwo Ludoznawcze we Lwowie
- Publisher:
- Skład Głowny
- Format:
- print and x, 183 s.
- Type:
- model:monograph and TEXT
- Subject:
- Kulturní antropologie. Etnologie. Etnografie, etnografie, folklor, folkloristika, Polsko, 39, 398, (438), (048.8), and 1
- Language:
- Polish and English
- Description:
- ułożył George Laurence Gomme ; przetłómaczyl z angielskiego Wojciech Szukiewicz ; opatrzył przedmową i wydał St. Eljasz-Radzikowski. and KČSN
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
- « Previous
- Next »
- 1
- 2
- 3
- 4