Skip to search
Skip to main content
Skip to first result
Search
Search Results
Creator:
Rüdiger, Jan Oliver
Publisher:
Jan Oliver Rüdiger
Type:
tool and toolService
Subject:
Corpus Linguisitics , NLP , conll , tei , XML , nlp , Natural Language Processing , linguistics , Linguistics , Computational Linguistics , corpus processing , tagger , POS tagger , lemmatization , text cleaning , CommonCrawl , epub , JSON , Twitter , Pandoc , Wikipedia , digital data , DTA , DSpin , MySQL , ElasticSearch , TextGrid , text corpora , TigerXML , and WeblichtXML
Language:
German , English , French , Italian , Dutch , Spanish , Polish , Arabic , Chinese , and Portuguese
Description:
Software for corpus linguists and text/data mining enthusiasts. The CorpusExplorer combines over 45 interactive visualizations under a user-friendly interface. Routine tasks such as text acquisition, cleaning or tagging are completely automated. The simple interface supports the use in university teaching and leads users/students to fast and substantial results. The CorpusExplorer is open for many standards (XML, CSV, JSON, R, etc.) and also offers its own software development kit (SDK).
Source code available at https://github.com/notesjor/corpusexplorer2.0
Rights:
Not specified
Publisher:
Joint Research Centre of the EU
Type:
corpus
Language:
Bulgarian , Czech , Danish , Dutch , English , Estonian , Finnish , French , German , Modern Greek (1453-) , Hungarian , Italian , Latvian , Maltese , Norwegian , Polish , Portuguese , Romanian , Slovak , Slovenian , Spanish , and Swedish
Description:
The largest parallel corpus, contains EU law, the Acquis Communautaire in 22 languages.
Rights:
Not specified
Publisher:
Universität Bamberg, World Language Documentation Centre
Format:
application/octet-stream
Type:
lexicalConceptualResource
Language:
Afrikaans , Arabic , Basque , Bulgarian , Catalan , Chinese , Czech , Danish , Dutch , English , Esperanto , Estonian , Finnish , French , Galician , Georgian , Modern Greek (1453-) , Hebrew , Hungarian , Icelandic , Indonesian , Interlingua (International Auxiliary Language Association) , Irish , Italian , Japanese , Khmer , Norwegian , Polish , Portuguese , Romanian , Russian , Serbian , Slovak , Spanish , Swedish , Turkish , Ukrainian , and Welsh
Rights:
GFDL or CC and http://www.omegawiki.org/Licensing
Type:
corpus
Language:
Czech , Danish , Dutch , English , Finnish , French , German , Hungarian , Italian , Polish , Portuguese , Russian , Spanish , Swedish , Turkish , Chinese , Hebrew , Japanese , Korean , and Thai
Description:
28 speech databases containing broadband recordings from 550 adults and 50 children per language. Contains interesting phonetically rich material. All orthographically transcribed. Speaker information included for gender, age, accent. Including pronunciation lexicon.
Rights:
Not specified
Type:
text and slovníky
Subject:
Historická věda. Pomocné vědy historické. Archivnictví , Lingvistika. Jazyky , sfragistika , and terminologie odborná
Language:
Slovak , Czech , Polish , Hungarian , Belarusian , German , Spanish , French , English , Italian , Lithuanian , Norwegian , Dutch , Portuguese , Romanian , Russian , Swedish , and Ukrainian
Description:
"Adaptovaný a ilustrovaný slovensko-česko-poľsko-maďarský preklad Medzinárodného sfragistického slovníka ... s pripojenými prekladmi názvov hesiel v bieloruštine, nemčine, španielčine, francúzštine, angličtine, taliančine, litovčine, nórčine, holandštine, portugalčine, rumunčine, ruštine, švédštine a ukrajinčine"--Strana 5, Přeloženo z francouzštiny?, and Obsahuje rejstříky
Rights:
unknown