« Previous |
1 - 10 of 161
|
Next »
Number of results to display per page
Search Results
2. Acta onomastica
- Type:
- text and sborníky
- Subject:
- Seriálové publikace. Periodika, onomastika, toponomastika, and česká periodika
- Language:
- Czech, Russian, English, Slovak, and Polish
- Rights:
- unknown
3. Acta onomastica
- Type:
- text and časopisy
- Subject:
- Seriálové publikace. Periodika, onomastika, toponomastika, and česká periodika
- Language:
- Czech, Russian, English, Slovak, and Polish
- Rights:
- unknown
4. Aktuální otázky slovanské filologie a Šafaříkův vědecký odkaz /
- Type:
- text and sborníky
- Subject:
- Filologie, Šafařík, Pavel Josef,, slavistika, slavisté, filologie slovanská, české (československé) sborníky a kolektivní monografie, české země 1792-1918, and dějiny slavistiky
- Language:
- Czech, English, German, Italian, Polish, Russian, and Slovak
- Description:
- Zvl. otisk čas. Slavia 65 (1996), seš. 1, str. 1-162
- Rights:
- unknown
5. Artificial Treebank with Ellipsis
- Creator:
- Droganova, Kira, Zeman, Daniel, Kanerva, Jenna, and Ginter, Filip
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- universal dependencies, ellipsis, and gapping
- Language:
- English, Czech, Finnish, Russian, and Slovak
- Description:
- Artificially created treebank of elliptical constructions (gapping), in the annotation style of Universal Dependencies. Data taken from UD 2.1 release, and from large web corpora parsed by two parsers. Input data are filtered, sentences are identified where gapping could be applied, then those sentences are transformed, one or more words are omitted, resulting in a sentence with gapping. Details in Droganova et al.: Parse Me if You Can: Artificial Treebanks for Parsing Experiments on Elliptical Constructions, LREC 2018, Miyazaki, Japan.
- Rights:
- Licence Universal Dependencies v2.1, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.1, and PUB
6. Aurel Stodola 1859-1942 :
- Type:
- text and biografie
- Subject:
- Dějiny Česka a Slovenska, Stodola, Aurel,, vědy přírodní, Švýcarsko, školství, pedagogika, učitelé, péče o mládež, Slovensko 1848-1918, dějiny techniky, technici, Slovensko 1780-1918, světové dějiny 1789-1918, and zahraniční periodika a sborníky
- Language:
- Slovak, German, and Russian
- Rights:
- unknown
7. Bratislava, mapa z roku 1820
- Creator:
- Neyder, Johann Leopold,
- Type:
- text and mapy
- Subject:
- Geografie Česka a Slovenska, reálie, cestování, Mapy. Atlasy. Glóby, mapy staré, plány měst, and historická kartografie, atlasy, staré mapy
- Language:
- German, English, Russian, and Slovak
- Description:
- Ve společné papírové obálce and Doplňková mapka Pressburg im Jahre 1760 a kresba bratislavského hradu
- Rights:
- unknown
8. C4Corpus (CC BY-NC part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Kannada, Korean, Latvian, Lithuanian, Malayalam, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Panjabi, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-NonCommercial 4.0 International (CC BY-NC 4.0), http://creativecommons.org/licenses/by-nc/4.0/, and PUB
9. C4Corpus (CC BY-NC-ND part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Kannada, Korean, Latvian, Lithuanian, Malayalam, Marathi, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Urdu, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0), http://creativecommons.org/licenses/by-nc-nd/4.0/, and PUB
10. C4Corpus (CC BY-NC-SA part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Korean, Latvian, Lithuanian, Malayalam, Marathi, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Urdu, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB