Skip to search
Skip to main content
Skip to first result
Search
Search Results
Type:
corpus
Language:
Italian
Description:
Dialect (Tuscan); 380.000 entries; written; DBT tagset
Rights:
Not specified
Type:
corpus
Language:
Arabic , Danish , Dutch , English , German , Modern Greek (1453-) , Italian , Japanese , Korean , Portuguese , Russian , Spanish , and Turkish
Description:
Large set of subtitles available for download in multiple languages. Can be used as parallel corpus.
Rights:
Not specified
Publisher:
Università degli studi di Napoli Federico II
Type:
corpus
Language:
Italian
Description:
Audio files of about 100 hours of speech from 15 different cities in Italy. Various recordings are transcribed to read in PDF
Rights:
Not specified
Publisher:
Copenhagen Business School
Format:
application/octet-stream
Type:
corpus
Subject:
parallel treebank , POS annotation , discourse annotation , morphological annotation , syntactic annotation , and semantic annotation
Language:
Danish , English , German , Italian , and Spanish
Description:
Parallel treebanks with annotation of syntax, discourse, coreference, morphology, and semantics. Version 3 also includes the Danish Dependency Treebank (version 1) and the Danish-English Parallel Dependency Treebank (version 2).
Rights:
GNU General Public License
Publisher:
University of Glasgow
Type:
corpus
Language:
Italian
Description:
Italian emblem books from the Stirling Maxwell Collection (University of Glasgow). Transcribed text and photographi reproducitons. Searchable and browsable online
Rights:
Not specified
Publisher:
Joint Research Centre of the EU
Type:
corpus
Language:
Bulgarian , Czech , Danish , Dutch , English , Estonian , Finnish , French , German , Modern Greek (1453-) , Hungarian , Italian , Latvian , Maltese , Norwegian , Polish , Portuguese , Romanian , Slovak , Slovenian , Spanish , and Swedish
Description:
The largest parallel corpus, contains EU law, the Acquis Communautaire in 22 languages.
Rights:
Not specified
Publisher:
Max Planck Institute for Psycholinguistics
Type:
corpus
Language:
German , Italian , and Polish
Description:
Language Acquisition corpus
Rights:
Not specified
Publisher:
Max Planck Institute for Psycholinguistics
Type:
corpus
Language:
Italian
Description:
Language and Cognition corpus
Rights:
Not specified
Type:
corpus
Language:
Danish , Dutch , English , Finnish , French , German , Italian , Latin , Portuguese , Russian , Spanish , Swedish , and Telugu
Description:
Possibility to download or to browse free electronic books; Angebot: Download von und Online-Zugang zu frei verfügbaren E-Books; deutschsprachige Literatur stellt nur einen Teilbereich der verfügbaren E-Books dar
Rights:
Not specified
Publisher:
Machine Learning and NLP group at Trento
Type:
corpus
Subject:
sentiment analysis
Language:
English and Italian
Description:
Sentiment analysis of Youtube videos with joint models of text and speech
Rights:
Not specified