Skip to search
Skip to main content
Skip to first result
Search
Search Results
Type:
corpus
Language:
Arabic , Danish , Dutch , English , German , Modern Greek (1453-) , Italian , Japanese , Korean , Portuguese , Russian , Spanish , and Turkish
Description:
Large set of subtitles available for download in multiple languages. Can be used as parallel corpus.
Rights:
Not specified
Publisher:
Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra
Type:
lexicalConceptualResource
Language:
Catalan , English , French , Galician , Italian , Portuguese , and Spanish
Description:
A vocabulary resulting from the cooperation of the groups of REALITER network that collects the basic terminology mostly used in texts about Genomics. It contains equivalents in English, Peninsular and Latinamerican Spanish, French, Italian, Galician, Portuguese and Catalan.
Rights:
Not specified
Publisher:
TALG Research Group (University of Vigo)
Type:
corpus
Language:
Basque , Catalan , English , French , Galician , German , Portuguese , and Spanish
Description:
Parallel corpus, 22 million words
Rights:
Not specified
Creator:
Rüdiger, Jan Oliver
Publisher:
Jan Oliver Rüdiger
Type:
tool and toolService
Subject:
Corpus Linguisitics , NLP , conll , tei , XML , nlp , Natural Language Processing , linguistics , Linguistics , Computational Linguistics , corpus processing , tagger , POS tagger , lemmatization , text cleaning , CommonCrawl , epub , JSON , Twitter , Pandoc , Wikipedia , digital data , DTA , DSpin , MySQL , ElasticSearch , TextGrid , text corpora , TigerXML , and WeblichtXML
Language:
German , English , French , Italian , Dutch , Spanish , Polish , Arabic , Chinese , and Portuguese
Description:
Software for corpus linguists and text/data mining enthusiasts. The CorpusExplorer combines over 45 interactive visualizations under a user-friendly interface. Routine tasks such as text acquisition, cleaning or tagging are completely automated. The simple interface supports the use in university teaching and leads users/students to fast and substantial results. The CorpusExplorer is open for many standards (XML, CSV, JSON, R, etc.) and also offers its own software development kit (SDK).
Source code available at https://github.com/notesjor/corpusexplorer2.0
Rights:
Not specified
Type:
text and sborníky
Subject:
Iberorománské jazyky , Dějiny Jižní Ameriky. Latinská Amerika , hispanistika , iberoamerikanistika , and česká periodika
Language:
Spanish and Portuguese
Rights:
unknown
Publisher:
Karolinum,
Type:
sborníky
Subject:
Iberorománské jazyky , Dějiny Jižní Ameriky. Latinská Amerika , hispanistika , iberoamerikanistika , and česká periodika
Language:
Spanish and Portuguese
Rights:
unknown
Publisher:
Karolinum,
Type:
sborníky
Subject:
Iberorománské jazyky , Dějiny Jižní Ameriky. Latinská Amerika , hispanistika , iberoamerikanistika , and česká periodika
Language:
Spanish and Portuguese
Rights:
unknown
Type:
text and sborníky
Subject:
Iberorománské jazyky , Dějiny Jižní Ameriky. Latinská Amerika , hispanistika , iberoamerikanistika , and česká periodika
Language:
Spanish and Portuguese
Rights:
unknown
Publisher:
Karolinum,
Type:
sborníky
Subject:
Iberorománské jazyky , Dějiny Jižní Ameriky. Latinská Amerika , hispanistika , iberoamerikanistika , and česká periodika
Language:
Spanish and Portuguese
Rights:
unknown
Publisher:
Karolinum,
Type:
sborníky
Subject:
Iberorománské jazyky , Dějiny Jižní Ameriky. Latinská Amerika , hispanistika , iberoamerikanistika , and česká periodika
Language:
Spanish and Portuguese
Rights:
unknown