Skip to search
Skip to main content
Skip to first result
Search
Search Results
Creator:
Gurevych, Iryna , Habernal, Ivan , and Zayed, Omnia
Publisher:
Technische Universität Darmstadt
Type:
text and corpus
Subject:
CommonCrawl , Creative Commons , Web corpus , and Amazon Web Services
Language:
Afrikaans , Arabic , Bulgarian , Czech , Danish , German , Modern Greek (1453-) , English , Estonian , Persian , Finnish , French , Croatian , Hungarian , Indonesian , Italian , Japanese , Korean , Latvian , Lithuanian , Dutch , Norwegian , Polish , Portuguese , Russian , Slovenian , Somali , Spanish , Swahili (macrolanguage) , Swedish , Tagalog , Thai , Turkish , Ukrainian , Undetermined , and Vietnamese
Description:
A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
Rights:
Public Domain Mark (PD) , http://creativecommons.org/publicdomain/mark/1.0/ , and PUB
Creator:
Miloš Weingart
Publisher:
Klub moderních filologů
Format:
print and xxxii, 378 s.
Type:
text , volume , sborníky , model:monograph , and TEXT
Subject:
Filologie , Slovanské jazyky , Pastrnek, František , 1853-1940 , filologie , slavistika , slovanské jazyky , slovanské literatury , 80 , 80(=16)+908(4) , 811.16 , 821.16 , (082) , and 11
Language:
Czech , Croatian , and Russian
Description:
red. Miloš Weingart. and KČSN
Rights:
http://creativecommons.org/publicdomain/mark/1.0/ and policy:public