Skip to search
Skip to main content
Skip to first result
Search
Search Results
Creator:
Gurevych, Iryna , Habernal, Ivan , and Zayed, Omnia
Publisher:
Technische Universität Darmstadt
Type:
text and corpus
Subject:
CommonCrawl , Creative Commons , Web corpus , and Amazon Web Services
Language:
Afrikaans , Arabic , Bengali , Bulgarian , Czech , Danish , German , Modern Greek (1453-) , English , Estonian , Persian , Finnish , French , Gujarati , Hebrew , Hindi , Croatian , Hungarian , Indonesian , Italian , Japanese , Korean , Latvian , Lithuanian , Malayalam , Macedonian , Dutch , Norwegian , Polish , Portuguese , Romanian , Russian , Slovak , Slovenian , Somali , Spanish , Albanian , Swahili (macrolanguage) , Swedish , Tamil , Tagalog , Thai , Turkish , Ukrainian , Undetermined , Vietnamese , and Chinese
Description:
A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
Rights:
Creative Commons - Attribution-NoDerivatives 4.0 International (CC BY-ND 4.0) , http://creativecommons.org/licenses/by-nc/4.0/ , and PUB
Creator:
Gurevych, Iryna , Habernal, Ivan , and Zayed, Omnia
Publisher:
Technische Universität Darmstadt
Type:
text and corpus
Subject:
CommonCrawl , Creative Commons , Web corpus , and Amazon Web Services
Language:
Afrikaans , Arabic , Bengali , Bulgarian , Czech , Danish , German , Modern Greek (1453-) , English , Estonian , Persian , Finnish , French , Gujarati , Hebrew , Hindi , Croatian , Hungarian , Indonesian , Italian , Japanese , Kannada , Korean , Latvian , Lithuanian , Malayalam , Marathi , Macedonian , Nepali (macrolanguage) , Dutch , Norwegian , Panjabi , Polish , Portuguese , Romanian , Russian , Slovak , Slovenian , Somali , Spanish , Albanian , Swahili (macrolanguage) , Swedish , Tamil , Telugu , Tagalog , Thai , Turkish , Ukrainian , Undetermined , Urdu , Vietnamese , and Chinese
Description:
A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
Rights:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) , http://creativecommons.org/licenses/by-sa/4.0/ , and PUB
Creator:
Gurevych, Iryna , Habernal, Ivan , and Zayed, Omnia
Publisher:
Technische Universität Darmstadt
Type:
text and corpus
Subject:
CommonCrawl , Creative Commons , Web corpus , and Amazon Web Services
Language:
Afrikaans , Arabic , Bengali , Bulgarian , Czech , Danish , German , Modern Greek (1453-) , English , Estonian , Persian , Finnish , French , Gujarati , Hebrew , Hindi , Croatian , Hungarian , Indonesian , Italian , Japanese , Kannada , Korean , Latvian , Lithuanian , Malayalam , Marathi , Macedonian , Nepali (macrolanguage) , Dutch , Norwegian , Panjabi , Polish , Portuguese , Romanian , Russian , Slovak , Slovenian , Somali , Spanish , Albanian , Swahili (macrolanguage) , Swedish , Tamil , Telugu , Tagalog , Thai , Turkish , Ukrainian , Undetermined , Urdu , Vietnamese , and Chinese
Description:
A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
Rights:
Creative Commons - Attribution 4.0 International (CC BY 4.0) , http://creativecommons.org/licenses/by/4.0/ , and PUB
Creator:
Gurevych, Iryna , Habernal, Ivan , and Zayed, Omnia
Publisher:
Technische Universität Darmstadt
Type:
text and corpus
Subject:
CommonCrawl , Creative Commons , Web corpus , and Amazon Web Services
Language:
Afrikaans , Arabic , Bulgarian , Czech , Danish , German , Modern Greek (1453-) , English , Estonian , Persian , Finnish , French , Croatian , Hungarian , Indonesian , Italian , Japanese , Korean , Latvian , Lithuanian , Dutch , Norwegian , Polish , Portuguese , Russian , Slovenian , Somali , Spanish , Swahili (macrolanguage) , Swedish , Tagalog , Thai , Turkish , Ukrainian , Undetermined , and Vietnamese
Description:
A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
Rights:
Public Domain Mark (PD) , http://creativecommons.org/publicdomain/mark/1.0/ , and PUB
Creator:
Duma, Paweł,
Type:
text and studie
Subject:
Sochařství, keramika, porcelán, umělecké zpracování kovů , archeologie, nálezy , žetony , numizmatika , and jednotlivé mince
Language:
Polish
Description:
Žetony z náměstí Nowy Targ ve Wrocławi nalezené během archeologických výzkumů v letech 2010-2011.
Rights:
unknown
Creator:
Bogucka, Maria,
Type:
text and studie
Subject:
Hospodářská a výrobní odvětví , dějiny hospodářské , manufaktury , výroba cihel , Polsko , řemesla, cechy, mlýny, lomy , and světové dějiny 1492-1648
Language:
Polish
Rights:
unknown
Creator:
Jagosz-Zarzycka, Zofia
Type:
text and studie
Subject:
Dějiny zemí střední Evropy , kultura púchovská , Keltové , osídlení pravěké , archeologie, lokality , archeologie, nálezy , and české země v době laténské
Language:
Polish
Description:
The Celts in Teschen. Finding-place of Púchov culture on the Castle Hill.
Rights:
unknown
Creator:
Novotný, Lubomír,
Type:
text and články
Subject:
Knihovny , knihovny vědecké , péče památková , fondy knihovní , digitalizace , české a československé knihovny, knižní fondy , and jednotlivé památky, památkové rezervace
Language:
Polish and Czech
Rights:
unknown
Creator:
Urbańczyk, Przemysław,
Type:
text and studie
Subject:
Architektura , hradiště , středověk raný , symbolika , světové dějiny středověku (do r. 1492) , hrady, hradiště, zámky, tvrze, dvory , and politické dějiny, politici
Language:
Polish
Description:
Central functions of strongholds in early mediaval societies.
Rights:
unknown
Creator:
Bogus-Spyra, Marzena,
Type:
text and studie
Subject:
Výchova a vzdělávání , učitelé , Češi slezští , spolky , hnutí národní , otázka jazyková , české země 1848-1918 , dějiny spolků , školství, pedagogika, učitelé, péče o mládež , and národnosti, vztahy mezi národnostmi a národní hnutí
Language:
Polish
Description:
Central Association of Czech Teachers in Silesia [Ústřední spolek českých učitelů ve Slezsku] in the Years 1894-1918.
Rights:
unknown