Skip to search
Skip to main content
Skip to first result
Search
Search Results
Type:
corpus
Language:
Arabic , Danish , Dutch , English , German , Modern Greek (1453-) , Italian , Japanese , Korean , Portuguese , Russian , Spanish , and Turkish
Description:
Large set of subtitles available for download in multiple languages. Can be used as parallel corpus.
Rights:
Not specified
Publisher:
Center for Sprogteknologi, University of Copenhagen
Type:
toolService
Language:
Danish , Dutch , English , German , Modern Greek (1453-) , Icelandic , Norwegian , Russian , Slovenian , and Swedish
Description:
1) Fully automatic rule based lemmatization of inflected languages 2) Fully automatic training of lemmatization rules based on full form-lemma list
Rights:
Not specified
Creator:
Cikán, Ondřej,
Type:
text and studie
Subject:
Filologie , Longos, , spisovatelé řečtí , romány , humor , réva vinná , kritika textová , starověké Řecko, Kréta , and literatura, spisovatelé
Language:
German and Modern Greek (1453-)
Description:
Buried wine, lucrative Syrinx: on Longus' humor and a translation problem in I, 19 and III, 29.
Rights:
unknown
Type:
corpus
Language:
Modern Greek (1453-)
Description:
70K words, Non-validated sentence segmentation. Non-validated POS tagging, Manual annotation of syntactic dependencies and dependency labels, Manual annotation of semantic roles, Manual annotation of events based on a shallow domain specific ontology (only for a 31K words subset of GDT)
Rights:
Not specified
Publisher:
Institute for Language and Speech Processing
Format:
application/octet-stream
Type:
corpus
Language:
Modern Greek (1453-)
Description:
General language corpus of standard Modern Greek; 47 MWs
Rights:
Not specified
Type:
lexicalConceptualResource
Language:
Bulgarian , English , Modern Greek (1453-) , Serbian , and Slovenian
Description:
17357 terms, XML
Rights:
Not specified
Type:
corpus
Language:
English , French , and Modern Greek (1453-)
Description:
Multilingual (EN, EL, FR); multimodal (Video, Text); parallel (EN, EL, FR subtitles); comparable (transcripts, subtitles); 120 hours
Rights:
Not specified
Publisher:
Universität Bamberg, World Language Documentation Centre
Format:
application/octet-stream
Type:
lexicalConceptualResource
Language:
Afrikaans , Arabic , Basque , Bulgarian , Catalan , Chinese , Czech , Danish , Dutch , English , Esperanto , Estonian , Finnish , French , Galician , Georgian , Modern Greek (1453-) , Hebrew , Hungarian , Icelandic , Indonesian , Interlingua (International Auxiliary Language Association) , Irish , Italian , Japanese , Khmer , Norwegian , Polish , Portuguese , Romanian , Russian , Serbian , Slovak , Spanish , Swedish , Turkish , Ukrainian , and Welsh
Rights:
GFDL or CC and http://www.omegawiki.org/Licensing
Publisher:
Center for Reading Research, Ghent University
Type:
lexicalConceptualResource
Language:
Chinese , Dutch , English , German , Modern Greek (1453-) , and Spanish
Rights:
Not specified
Format:
text/html
Type:
corpus
Language:
Modern Greek (1453-)
Description:
ca. 700.000 tokens; linked with relational database; XML-encoding in progress
Rights:
http://titus.uni-frankfurt.de/texte/texte2.htm#Estart