« Previous |
1 - 50 of 309
|
Next »
Number of results to display per page
Search Results
2. 120 let meteorologických měření a pozorování na Lysé hoře :
- Type:
- text and sborníky konferenční
- Subject:
- Geologie. Meteorologie. Klimatologie, meteorologie, pozorování meteorologická, Československo 1918-1992, české země od r. 1993 do současnosti, české země 1848-1918, and vědy o neživé přírodě, přírodní prostředí, astronomie
- Language:
- Czech, Polish, and Slovak
- Rights:
- unknown
3. 1939 :
- Type:
- text and monografie kolektivní
- Subject:
- Mezinárodní vztahy, světová politika, válka druhá světová (1939-1945), dějiny československé, české (československé) sborníky a kolektivní monografie, Československo 1938-1945, Slovensko 1939-1945, and politické dějiny, politici
- Language:
- Slovak, Czech, and Polish
- Rights:
- unknown
4. 1939 :
- Type:
- text and monografie kolektivní
- Subject:
- Mezinárodní vztahy, světová politika, válka druhá světová (1939-1945), dějiny československé, české (československé) sborníky a kolektivní monografie, Československo 1938-1945, Slovensko 1939-1945, and politické dějiny, politici
- Language:
- Slovak, Czech, and Polish
- Rights:
- unknown
5. Acta historico-iuridica Pilsnensia 2009-2010 /
6. Acta onomastica
- Type:
- text and sborníky
- Subject:
- Seriálové publikace. Periodika, onomastika, toponomastika, and česká periodika
- Language:
- Czech, Russian, English, Slovak, and Polish
- Rights:
- unknown
7. Acta onomastica
- Type:
- text and časopisy
- Subject:
- Seriálové publikace. Periodika, onomastika, toponomastika, and česká periodika
- Language:
- Czech, German, English, Slovak, and Polish
- Rights:
- unknown
8. Acta onomastica
- Type:
- text and časopisy
- Subject:
- Seriálové publikace. Periodika, onomastika, toponomastika, and česká periodika
- Language:
- Czech, German, English, Slovak, and Polish
- Rights:
- unknown
9. Acta onomastica
- Type:
- text and časopisy
- Subject:
- Seriálové publikace. Periodika, onomastika, toponomastika, and česká periodika
- Language:
- Czech, Russian, English, Slovak, and Polish
- Rights:
- unknown
10. Acta onomastica
- Type:
- text and časopisy
- Subject:
- Seriálové publikace. Periodika, onomastika, toponomastika, and česká periodika
- Language:
- Czech, German, English, Slovak, and Polish
- Rights:
- unknown
11. Acta onomastica
- Type:
- text and časopisy
- Subject:
- Seriálové publikace. Periodika, onomastika, toponomastika, and česká periodika
- Language:
- Czech, English, Polish, and Slovak
- Rights:
- unknown
12. Acta onomastica
- Type:
- text and časopisy
- Subject:
- Seriálové publikace. Periodika, onomastika, toponomastika, and česká periodika
- Language:
- Czech, English, Polish, and Slovak
- Rights:
- unknown
13. Adam Mickiewicz: Texty a kontexty :
- Type:
- text and sborníky
- Subject:
- Polská literatura (o ní), Mickiewicz, Adam,, konference mezinárodní, spisovatelé polští, české (československé) sborníky a kolektivní monografie, Polsko, světové dějiny 1789-1918, and literatura, spisovatelé
- Language:
- Czech, Polish, and Slovak
- Rights:
- unknown
14. Aktuální otázky slovanské filologie a Šafaříkův vědecký odkaz /
- Type:
- text and sborníky
- Subject:
- Filologie, Šafařík, Pavel Josef,, slavistika, slavisté, filologie slovanská, české (československé) sborníky a kolektivní monografie, české země 1792-1918, and dějiny slavistiky
- Language:
- Czech, English, German, Italian, Polish, Russian, and Slovak
- Description:
- Zvl. otisk čas. Slavia 65 (1996), seš. 1, str. 1-162
- Rights:
- unknown
15. Archeologický sborník :
- Publisher:
- Slezská univerzita v Opavě, Ústav archeologie,
- Type:
- sborníky jubilejní
- Subject:
- Archeologie, Janák, Vratislav,, sborníky jubilejní, archeologové, archeologie, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech, English, Polish, and Slovak
- Description:
- Obsahuje bibliografie and 100 výt.
- Rights:
- unknown
16. Archeologický výzkum krajiny a aplikace ICT :
- Type:
- text and sborníky konferenční
- Subject:
- Archeologie, archeologie nedestruktivní, krajina historická, and archeologie
- Language:
- Czech, Polish, and Slovak
- Rights:
- unknown
17. Archeologický výzkum krajiny a aplikace ICT :
- Type:
- text and sborníky konferenční
- Subject:
- Archeologie, archeologie nedestruktivní, krajina historická, and archeologie
- Language:
- Czech, Polish, and Slovak
- Rights:
- unknown
18. Archeologie barbarů 2006 :
- Type:
- text and sborníky konferenční
- Subject:
- Archeologie, archeologie, kultura laténská, doba římská, Germáni, české (československé) sborníky a kolektivní monografie, and české země v době římské, stěhování národů
- Language:
- English, Czech, German, Polish, and Slovak
- Description:
- Částečně anglický, německý, polský a slovenský text, německá resumé and Monografické č. seriálu: Archeologické výzkumy v jižních Čechách. Supplement ; 3 (2007)
- Rights:
- unknown
19. Bardejov
- Publisher:
- Vojenský zeměpisný ústav
- Format:
- map and 1 mapa : barevná ; 39 x 50 cm na listu 47 x 63 cm
- Type:
- model:map, cartographic, and IMAGE
- Subject:
- udc:913(4), Konspekt:7, udc:912, udc:913(437.6), udc:912.43, udc:(084.3), Konspekt:Geografie Evropy, reálie, cestování, Konspekt:Mapy. Atlasy. Glóby, and czenas:Bardejov (Slovensko : oblast)
- Language:
- Czech, Slovak, and Polish
- Description:
- 4266, Edice dle kladu listů, and (Language) Místní názvy slovensky a polsky
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
20. Bezčasí :
- Type:
- text and monografie kolektivní
- Subject:
- Dějiny Česka a Slovenska, život každodenní, společnost česká, české (československé) sborníky a kolektivní monografie, Československo 1969-1989, and dějiny společnosti
- Language:
- Czech, English, Polish, and Slovak
- Rights:
- unknown
21. Bibliotheca Antiqua 2022 :
- Type:
- text, statický obraz, and sborníky konferenční
- Subject:
- Rukopisy, prvotisky, staré tisky. Vzácná a pozoruhodná díla, dějiny knihoven, dějiny knihy, knihovnictví, knihtisk, kultura knižní, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech, Slovak, and Polish
- Rights:
- unknown
22. Bitka pri Moháči - historický medzník v dejinách strednej Európy (490. výročie) :
- Type:
- text and sborníky konferenční
- Subject:
- Vojenství. Obrana země. Ozbrojené síly, bitva u Moháče (1526), války turecké, and zahraniční periodika a sborníky
- Language:
- Slovak, Czech, German, Hungarian, Latin, and Polish
- Rights:
- unknown
23. Bitka pri Moháči - historický medzník v dejinách strednej Európy (490. výročie) :
- Type:
- text and sborníky konferenční
- Subject:
- Vojenství. Obrana země. Ozbrojené síly, bitva u Moháče (1526), války turecké, and zahraniční periodika a sborníky
- Language:
- Slovak, Czech, German, Hungarian, Latin, and Polish
- Rights:
- unknown
24. C4Corpus (CC BY-NC part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Kannada, Korean, Latvian, Lithuanian, Malayalam, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Panjabi, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-NonCommercial 4.0 International (CC BY-NC 4.0), http://creativecommons.org/licenses/by-nc/4.0/, and PUB
25. C4Corpus (CC BY-NC-ND part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Kannada, Korean, Latvian, Lithuanian, Malayalam, Marathi, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Urdu, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0), http://creativecommons.org/licenses/by-nc-nd/4.0/, and PUB
26. C4Corpus (CC BY-NC-SA part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Korean, Latvian, Lithuanian, Malayalam, Marathi, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Urdu, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB
27. C4Corpus (CC BY-ND part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Korean, Latvian, Lithuanian, Malayalam, Macedonian, Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-NoDerivatives 4.0 International (CC BY-ND 4.0), http://creativecommons.org/licenses/by-nc/4.0/, and PUB
28. C4Corpus (CC BY-SA part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Kannada, Korean, Latvian, Lithuanian, Malayalam, Marathi, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Panjabi, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Urdu, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
29. C4Corpus (CC-BY part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Kannada, Korean, Latvian, Lithuanian, Malayalam, Marathi, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Panjabi, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Urdu, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution 4.0 International (CC BY 4.0), http://creativecommons.org/licenses/by/4.0/, and PUB
30. Česká a slovenská slavistická komparatistika a wollmanovská tradice /
- Type:
- text and monografie kolektivní
- Subject:
- Filologie, Wollman, Frank,, slavisté, slavistika, lingvistika komparativní, komparatistika literární, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech, English, Polish, Russian, Slovak, and Ukrainian
- Description:
- Vychází ve spolupráci se Středoevropským centrem slovanských studií, Slavistickou společností Franka Wollmana a Ústavem slavistiky FF MU and Vydal Jan Sojnek - Galium
- Rights:
- unknown
31. Československá zahraniční politika po osvobození 1945 :
- Type:
- text, dokumenty, and edice
- Subject:
- Mezinárodní vztahy, světová politika, dějiny československé, vztahy mezinárodní, politika zahraniční, diplomacie, Československo 1945-1948, and zahraniční politika, mezinárodní vztahy
- Language:
- Czech, English, Croatian, Polish, and Slovak
- Rights:
- unknown
32. Československá zahraniční politika v roce 1943.
- Type:
- text, dokumenty, and edice
- Subject:
- Mezinárodní vztahy, světová politika, politika zahraniční, vztahy mezinárodní, vláda exilová, válka druhá světová (1939-1945), odboj druhý (protifašistický), Československo 1938-1945, and zahraniční politika, mezinárodní vztahy
- Language:
- Czech, English, French, Polish, Russian, and Slovak
- Description:
- Autentické dokumenty odhalující politické a diplomatické vztahy československé politické reprezentace k velmocím i dalším státům od počátku srpna do konce prosince roku 1943.
- Rights:
- unknown
33. Československo a krize demokracie ve střední Evropě ve 30. a 40. letech XX. století :
- Creator:
- Šedivý, Ivan,
- Type:
- text and monografie kolektivní
- Subject:
- Dějiny Česka a Slovenska, vztahy mezinárodní, politika zahraniční, válka druhá světová (1939-1945), české (československé) sborníky a kolektivní monografie, Československo 1918-1992, and zahraniční politika, mezinárodní vztahy
- Language:
- Czech, English, Polish, and Slovak
- Description:
- V prelimináriích: editoři Ivan Šedivý ... et al., Masarykův ústav a archiv AV ČR, Historický ústav AV ČR, Ústav pro soudobé dějiny AV ČR, České křižovatky evropských dějin, České křižovatky v evropských dějinách 1918-1938-1948-1968, Obálkový název:České křižovatky evropských dějin - 1938, Obálkový název:1938: Československo a krize demokracie ve střední Evropě ve 30. a 40. letech XX. století - hledání východisek, 2. část cyklu: České křižovatky evropských dějin, Další název v tiráži: České křižovatky v evropských dějinách 1918-1938-1948-1968, Obálkový název: České křižovatky evropských dějin - 1938, and Název na doplňkové titulní stránce: 1938: Československo a krize demokracie ve střední Evropě ve 30. a 40. letech XX. století - hledání východisek
- Rights:
- unknown
34. Cesta k realitě :
- Creator:
- Štěpán, Ludvík,
- Type:
- text and sborníky
- Subject:
- Slovanské literatury (o nich), literatura, dějiny kultury, literatura, spisovatelé, české země 1848-1918, and české (československé) sborníky a kolektivní monografie
- Language:
- Czech, Polish, and Slovak
- Description:
- Část. polský a slovenský text and Vydáno ve spolupráci s Ústavem slavistiky FF MU a Středoevropským centrem slovanských studií
- Rights:
- unknown
35. Čeština v pohledu synchronním a diachronním :
- Type:
- text and sborníky konferenční
- Subject:
- Lingvistika. Jazyky, instituce vědecké, Akademie věd ČR, Ústav pro jazyk český, české (československé) sborníky a kolektivní monografie, Československo 1918-1992, české země od r. 1993 do současnosti, jazyk, písmo, and dějiny vědeckých institucí, vysokých škol
- Language:
- Czech, Bulgarian, Polish, Slovak, and Slovenian
- Description:
- Texty příspěvků ze stejnojmenné mezinárodní konference pořádané 1.-3.6.2011 v Praze Ústavem pro jazyk český Akademie věd ČR
- Rights:
- unknown
36. Cirkvi a národy strednej Európy (1800-1950). = Die Kirchen und Völker Mitteleuropas (1800-1950) /
- Publisher:
- Univesum,
- Subject:
- sborníky, církve, dějiny církevní, and zahraniční periodika a sborníky
- Language:
- Slovak, Czech, German, and Polish
- Rights:
- unknown
37. Conclusions = :
- Creator:
- Mayer, Françoise,
- Type:
- text and studie
- Subject:
- Dějiny Evropy, povstání, antifašismus, paměť kolektivní, paměť historická, ideologie komunistická, strany politické komunistické, interpretace dějin, Československo 1945-1948, Československo 1948-1969, odboj, odpor, antifašismus, antikomunismus, Polsko, and světové dějiny od r. 1945 do současnosti
- Language:
- French, Slovak, Polish, and English
- Rights:
- unknown
38. CoNLL 2017 and 2018 Shared Task Blind and Preprocessed Test Data
- Creator:
- Zeman, Daniel and Straka, Milan
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- tokenization, word segmentation, morphology, tagging, syntax, parsing, and universal dependencies
- Language:
- Afrikaans, Arabic, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Persian, Finnish, French, Old French (842-ca. 1400), Irish, Galician, Gothic, Ancient Greek (to 1453), Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Latin, Latvian, Dutch, Norwegian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Thai, Turkish, Uighur, Ukrainian, Urdu, Vietnamese, and Chinese
- Description:
- CoNLL 2017 and 2018 shared tasks: Multilingual Parsing from Raw Text to Universal Dependencies This package contains the test data in the form in which they ware presented to the participating systems: raw text files and files preprocessed by UDPipe. The metadata.json files contain lists of files to process and to output; README files in the respective folders describe the syntax of metadata.json. For full training, development and gold standard test data, see Universal Dependencies 2.0 (CoNLL 2017) Universal Dependencies 2.2 (CoNLL 2018) See the download links at http://universaldependencies.org/. For more information on the shared tasks, see http://universaldependencies.org/conll17/ http://universaldependencies.org/conll18/ Contents: conll17-ud-test-2017-05-09 ... CoNLL 2017 test data conll18-ud-test-2018-05-06 ... CoNLL 2018 test data conll18-ud-test-2018-05-06-for-conll17 ... CoNLL 2018 test data with metadata and filenames modified so that it is digestible by the 2017 systems.
- Rights:
- Licence Universal Dependencies v2.2, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.2, and PUB
39. CoNLL 2017 Shared Task System Outputs
- Creator:
- Zeman, Daniel, Potthast, Martin, Straka, Milan, Popel, Martin, Dozat, Timothy, Qi, Peng, Manning, Christopher, Shi, Tianze, Wu, Felix G., Chen, Xilun, Cheng, Yao, Björkelund, Anders, Falenska, Agnieszka, Yu, Xiang, Kuhn, Jonas, Che, Wanxiang, Guo, Jiang, Wang, Yuxuan, Zheng, Bo, Zhao, Huaipeng, Liu, Yang, Teng, Dechuan, Liu, Ting, Lim, Kyungtae, Poibeau, Thierry, Sato, Motoki, Manabe, Hitoshi, Noji, Hiroshi, Matsumoto, Yuji, Kırnap, Ömer, Önder, Berkay Furkan, Yuret, Deniz, Straková, Jana, Vania, Clara, Zhang, Xingxing, Lopez, Adam, Heinecke, Johannes, Asadullah, Munshi, Kanerva, Jenna, Luotolahti, Juhani, Ginter, Filip, Kuan, Yu, Sofroniev, Pavel, Schill, Erik, Hinrichs, Erhard, Nguyen, Dat Quoc, Dras, Mark, Johnson, Mark, Qian, Xian, Vilares, David, Gómez-Rodríguez, Carlos, Aufrant, Lauriane, Wisniewski, Guillaume, Yvon, François, Dumitrescu, Stefan Daniel, Boroş, Tiberiu, Tufiş, Dan, Das, Ayan, Zaffar, Affan, Sarkar, Sudeshna, Wang, Hao, Zhao, Hai, Zhang, Zhisong, Hornby, Ryan, Taylor, Clark, Park, Jungyeul, de Lhoneux, Miryam, Shao, Yan, Basirat, Ali, Kiperwasser, Eliyahu, Stymne, Sara, Goldberg, Yoav, Nivre, Joakim, Akkuş, Burak Kerim, Azizoglu, Heval, Cakici, Ruket, Moor, Christophe, Merlo, Paola, Henderson, James, Wang, Haozhou, Ji, Tao, Wu, Yuanbin, Lan, Man, de la Clergerie, Eric, Sagot, Benoît, Seddah, Djamé, More, Amir, Tsarfaty, Reut, Kanayama, Hiroshi, Muraoka, Masayasu, Yoshikawa, Katsumasa, Garcia, Marcos, and Gamallo, Pablo
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- dependency parser and parsebank
- Language:
- Arabic, Bulgarian, Russia Buriat, Czech, Catalan, Church Slavic, Danish, German, Modern Greek (1453-), English, Spanish, Estonian, Basque, Persian, Finnish, French, Irish, Galician, Gothic, Ancient Greek (to 1453), Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Latin, Latvian, Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Northern Sami, Swedish, Turkish, Uighur, Ukrainian, Urdu, Vietnamese, and Chinese
- Description:
- This package contains the system outputs from the CoNLL 2017 Shared Task in Multilingual Parsing from Raw Text to Universal Dependencies.
- Rights:
- Licence Universal Dependencies v2.0, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.0, and PUB
40. CoNLL 2018 Shared Task System Outputs
- Creator:
- Zeman, Daniel, Potthast, Martin, Duthoo, Elie, Mesnard, Olivier, Rybak, Piotr, Wróblewska, Alina, Che, Wanxiang, Liu, Yijia, Wang, Yuxuan, Zheng, Bo, Liu, Ting, Li, Zuchao, He, Shexia, Zhang, Zhuosheng, Zhao, Hai, Wu, Yingting, Tong, Jia-Jun, Nguyen, Dat Quoc, Verspoor, Karin, Wan, Hui, Naseem, Tahira, Lee, Young-Suk, Castelli, Vittorio, Ballesteros, Miguel, Hershcovich, Daniel, Abend, Omri, Rappoport, Ari, Smith, Aaron, Bohnet, Bernd, de Lhoneux, Miryam, Nivre, Joakim, Shao, Yan, Stymne, Sara, Kırnap, Ömer, Dayanık, Erenay, Yuret, Deniz, Kanerva, Jenna, Ginter, Filip, Miekka, Niko, Leino, Akseli, Salakoski, Tapio, Lim, KyungTae, Park, Cheoneum, Lee, Changki, Poibeau, Thierry, Bhat, Riyaz Ahmad, Bhat, Irshad, Bangalore, Srinivas, Qi, Peng, Dozat, Timothy, Zhang, Yuhao, Manning, Christopher, Boroș, Tiberiu, Dumitrescu, Stefan Daniel, Burtica, Ruxandra, Arakelyan, Gor, Hambardzumyan, Karen, Khachatrian, Hrant, Rosa, Rudolf, Mareček, David, Straka, Milan, Seker, Amit, More, Amir, Tsarfaty, Reut, Önder, Berkay Furkan, Gümeli, Can, Jawahar, Ganesh, Muller, Benjamin, Fethi, Amal, Martin, Louis, Villemonte de la Clergerie, Eric, Sagot, Benoît, Seddah, Djamé, Özateş, Şaziye Betül, Özgür, Arzucan, Gungor, Tunga, Öztürk, Balkız, Ji, Tao, Liu, Yufang, Wang, Yijun, Wu, Yuanbin, Lan, Man, Chen, Danlu, Lin, Mengxiao, Hu, Zhifeng, and Qiu, Xipeng
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- parsed data, conllu, and universal dependencies
- Language:
- Afrikaans, Arabic, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Persian, Finnish, French, Old French (842-ca. 1400), Irish, Galician, Gothic, Ancient Greek (to 1453), Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Latin, Latvian, Dutch, Norwegian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Thai, Turkish, Uighur, Ukrainian, Urdu, Vietnamese, and Chinese
- Description:
- Test data parsed by systems submitted to the CoNLL 2018 UD parsing shared task.
- Rights:
- Licence Universal Dependencies v2.2, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.2, and PUB
41. Constans et perpetua voluntas :
- Publisher:
- Trnavská univerzita,
- Type:
- sborníky jubilejní
- Subject:
- Právo, Blaho, Peter,, právo, dějiny práva, and zahraniční periodika a sborníky
- Language:
- Slovak, Czech, German, English, Latin, and Polish
- Rights:
- unknown
42. Corpus for training and evaluating diacritics restoration systems
- Creator:
- Náplava, Jakub, Straka, Milan, Hajič, Jan, and Straňák, Pavel
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- diacritical marks generation and natural language correction
- Language:
- Czech, Vietnamese, Romanian, Polish, Slovak, Spanish, Croatian, Irish, Latvian, Hungarian, French, and Turkish
- Description:
- Corpus of texts in 12 languages. For each language, we provide one training, one development and one testing set acquired from Wikipedia articles. Moreover, each language dataset contains (substantially larger) training set collected from (general) Web texts. All sets, except for Wikipedia and Web training sets that can contain similar sentences, are disjoint. Data are segmented into sentences which are further word tokenized. All data in the corpus contain diacritics. To strip diacritics from them, use Python script diacritization_stripping.py contained within attached stripping_diacritics.zip. This script has two modes. We generally recommend using method called uninames, which for some languages behaves better. The code for training recurrent neural-network based model for diacritics restoration is located at https://github.com/arahusky/diacritics_restoration.
- Rights:
- Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB
43. DaMuEL 1.0: A Large Multilingual Dataset for Entity Linking
- Creator:
- Kubeša, David and Straka, Milan
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- entity linking, NEL, NER, dataset, and knowledge base
- Language:
- Afrikaans, Arabic, Armenian, Basque, Belarusian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Hebrew, Hindi, Hungarian, Indonesian, Irish, Italian, Japanese, Korean, Latin, Latvian, Lithuanian, Maltese, Marathi, Modern Greek (1453-), Northern Sami, Norwegian Nynorsk, Persian, Polish, Portuguese, Romanian, Russian, Scottish Gaelic, Serbian, Slovak, Slovenian, Spanish, Swedish, Tamil, Telugu, Uighur, Ukrainian, Urdu, Vietnamese, and Wolof
- Description:
- We present DaMuEL, a large Multilingual Dataset for Entity Linking containing data in 53 languages. DaMuEL consists of two components: a knowledge base that contains language-agnostic information about entities, including their claims from Wikidata and named entity types (PER, ORG, LOC, EVENT, BRAND, WORK_OF_ART, MANUFACTURED); and Wikipedia texts with entity mentions linked to the knowledge base, along with language-specific text from Wikidata such as labels, aliases, and descriptions, stored separately for each language. The Wikidata QID is used as a persistent, language-agnostic identifier, enabling the combination of the knowledge base with language-specific texts and information for each entity. Wikipedia documents deliberately annotate only a single mention for every entity present; we further automatically detect all mentions of named entities linked from each document. The dataset contains 27.9M named entities in the knowledge base and 12.3G tokens from Wikipedia texts. The dataset is published under the CC BY-SA licence.
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
44. De rerum humanarum emendatione consultatio catholica a odkaz Jana Amosa Komenského pre tretie tisícročie
- Type:
- text and prameny
- Subject:
- Věda. Všeobecnosti. Základy vědy a kultury. Vědecká práce, Komenský, Jan Amos,, spisy, komeniana, zahraniční periodika a sborníky, české země 1620-1740, and dějiny vědy, umění, kultury a techniky, kulturní vztahy
- Language:
- Latin, Slovak, Czech, German, English, and Polish
- Description:
- "Zborník materiálov z medzinárodnej konferencie, konanej v Bratislave v dňoch 13. a 14. novembra 2000"--S. [1]
- Rights:
- unknown
45. De rerum humanarum emendatione consultatio catholica a odkaz Jana Amosa Komenského pre tretie tisícročie /
- Type:
- text and sborníky konferenční
- Subject:
- Přirozená teologie. Náboženská filozofie, Komenský, Jan Amos,, komeniana, dějiny vědy, umění, kultury a techniky, kulturní vztahy, české země 1620-1740, and zahraniční periodika a sborníky
- Language:
- Slovak, Czech, German, English, Spanish, and Polish
- Description:
- Na s. 1 pozn.: Zborník materiálov z medzinárodnej konferencie, konanej v Bratislave v dňoch 13. a 14. novembra 2000
- Rights:
- unknown
46. Deep Universal Dependencies 2.4
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, and Galician
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-2988). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.4, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.4, and PUB
47. Deep Universal Dependencies 2.5
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, and Skolt Sami
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3105). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.5, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.5, and PUB
48. Deep Universal Dependencies 2.6
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, Skolt Sami, Icelandic, Albanian, and Persian
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3226). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.6, https://lindat.mff.cuni.cz/repository/xmlui/page/license-ud-2.6, and PUB
49. Deep Universal Dependencies 2.7
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, Skolt Sami, Icelandic, Albanian, Persian, Akuntsu, Apurinã, Khunsari, Manx, Mundurukú, Nayini, Soi, South Levantine Arabic, and Tupinambá
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3424). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.7, https://lindat.mff.cuni.cz/repository/xmlui/page/license-ud-2.7, and PUB
50. Deep Universal Dependencies 2.8
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, Skolt Sami, Icelandic, Albanian, Persian, Akuntsu, Apurinã, Khunsari, Manx, Mundurukú, Nayini, Soi, South Levantine Arabic, Tupinambá, Beja, Western Frisian, Urubú-Kaapor, Kangri, K'iche', Low German, Makuráp, Western Armenian, and Central Siberian Yupik
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3687). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.8, https://lindat.mff.cuni.cz/repository/xmlui/page/license-ud-2.8, and PUB