Skip to search
Skip to main content
Skip to first result
Search
Search Results
Publisher:
King's College London
Format:
application/tei+xml
Type:
corpus
Language:
English
Description:
Charters written in Anglo-Saxon England before A.D. 900, marked-up in TEI XML. Browsable online.
Rights:
Not specified
Publisher:
Berlin-Brandenburg Academy of Sciences and Humanities
Format:
application/tei+xml
Type:
corpus
Language:
German
Description:
Transcribed narrative interviews with people from East and West Berlin about the events of November 9. 282,000 tokens. TEI XML, lemma and POS. Normalized version also available.
Rights:
Not specified
Publisher:
Coventry University, University of Reading, University of Warwick
Format:
application/tei+xml
Type:
corpus
Language:
English
Description:
Transcribed recordings of 160 lectures and 39 seminars held in university departments. Four broad disciplinary groups, 1,644,942 tokens in total.
Rights:
Not specified
Publisher:
University College, Cork
Format:
application/tei+xml
Type:
corpus
Language:
English , Irish , and Latin
Description:
searchable online corpus of multilingual texts of Irish literature and history
Rights:
Not specified
Publisher:
Berlin-Brandenburg Academy of Sciences and Humanities
Format:
application/tei+xml
Type:
corpus
Language:
German
Description:
9 million words in 1150 texts from GDR written between 1949 and 1990. Part of the DWDS project
Rights:
Not specified
Publisher:
Berlin-Brandenburg Academy of Sciences and Humanities
Format:
application/tei+xml
Type:
lexicalConceptualResource
Language:
German
Description:
retro-digitized version of the first edition of the Deutsches Wörterbuch by Jacob and Wilhelm Grimm, originally published from 1854 to 1960
Rights:
Not specified
Publisher:
Berlin-Brandenburg Academy of Sciences and Humanities
Format:
application/tei+xml
Type:
corpus
Subject:
Germanistik
Language:
German
Description:
German reference corpus. Ca 100 million words, 20th Century. Searchable online. Part of 'Digitales Wörterbuch der deutschen Sprache des 20. Jahrhunderts' project; Korpus der BBAW; Grundlage des DWDS
Rights:
Not specified
Publisher:
Real Academia Española
Format:
application/tei+xml
Type:
corpus
Language:
Spanish
Description:
Written and spoken (10%) material from 1975-2004. About 160 mwd
Rights:
Not specified
Publisher:
Real Academia Española
Format:
application/tei+xml
Type:
corpus
Language:
Spanish
Description:
Written, diachronic corpus with a variety of text types produced before 1975. About 250 mwd.
Rights:
Not specified
Publisher:
University of Tartu
Format:
application/tei+xml
Type:
corpus
Language:
Estonian
Description:
Collection of Estonian texts (divided into subcorpora); ca 175 million words; TEI
Rights:
Not specified
Publisher:
University of Cambridge
Format:
application/tei+xml
Type:
corpus
Language:
Welsh
Description:
Welsh texts from the period 1500-1850. Overall the corpus contains around 420,000 words from 30 texts.
Rights:
Not specified
Publisher:
Berlin-Brandenburg Academy of Sciences and Humanities
Format:
application/tei+xml
Type:
corpus
Language:
German
Description:
Written German from 1920-39. 500,000 tokens, 392 texts. POS and lemma, TEI XML. Part of Das digitale Wörterbuch der deutschen Sprache der 20. Jahrhunderts
Rights:
Not specified
Publisher:
Berlin-Brandenburg Academy of Sciences and Humanities
Format:
application/tei+xml
Type:
corpus
Language:
German
Description:
Articles from the 'Berliner Zeitung' online edition from 3.1.1994 to 31.12.2005. About 252 million tokens in 869,000 articles. Part of the DWDS project.
Rights:
Not specified
Publisher:
Berlin-Brandenburg Academy of Sciences and Humanities
Format:
application/tei+xml
Type:
corpus
Language:
German
Description:
Transcribed speech from the 20th century, about 2,5 million words. 7 categories, 756 speakers. Part of the DWDS project
Rights:
Not specified
Publisher:
Newcastle University
Format:
application/tei+xml
Type:
corpus
Language:
English
Description:
A corpus of dialect speech from Tyneside in North-East England. digitized audio, standard orthographic transcription, phonetic transcription, and part-of-speech tagged
Rights:
Not specified
Publisher:
Berlin-Brandenburg Academy of Sciences and Humanities
Format:
application/tei+xml
Type:
corpus
Subject:
corpus
Language:
German
Description:
The C4 corpus is a joined effort of the project Digitales Wörterbuch der deutschen Sprache (DWDS), the Austrian Academy Corpus (AAC), the Korpus Südtirol and the Schweizer Textkorpus (CHTK). The Corpus is composed of corpora of all four partner institutions.
Rights:
Not specified
Publisher:
Berlin-Brandenburg Academy of Sciences and Humanities
Format:
application/tei+xml
Type:
lexicalConceptualResource
Language:
German
Description:
6 volume dictionary of Standard German, retro-digitization of the printed version which appeared 196–-1977
Rights:
Not specified
Publisher:
Berlin-Brandenburg Academy of Sciences and Humanities
Format:
application/tei+xml
Type:
corpus
Language:
German
Description:
Corpus of the weekly Die Zeit from 1946 - present day (complete runs from 1996). Over 100 million words in 200,000 articles. Updated daily. Part of DWDS project.
Rights:
Not specified