Skip to search
Skip to main content
Skip to first result
Search
Search Results
Publisher:
King's College London
Format:
application/tei+xml
Type:
corpus
Language:
English
Description:
Charters written in Anglo-Saxon England before A.D. 900, marked-up in TEI XML. Browsable online.
Rights:
Not specified
Publisher:
Coventry University, University of Reading, University of Warwick
Format:
application/tei+xml
Type:
corpus
Language:
English
Description:
Transcribed recordings of 160 lectures and 39 seminars held in university departments. Four broad disciplinary groups, 1,644,942 tokens in total.
Rights:
Not specified
Publisher:
Research Group in Computational Linguistics, University of Wolverhampton
Type:
corpus
Language:
English
Description:
Sentences annotated for important units of text for summarisation. 145,473 words / 6584 sentences
Rights:
Not specified
Publisher:
Archives of Latvian Folklore, Institute of Literature, Folklore and Art, University of Latvia and Institute of Mathematics and Computer Science, University of Latvia
Type:
corpus
Language:
Latvian
Description:
Latvian proverbs collected by Archives of Latvian Folklore (~ 20 000 items)
Rights:
Not specified
Publisher:
Kompetenzzentrum für elektronische Erschließungs and Publikationsverfahren in den Geisteswissenschaften
Type:
lexicalConceptualResource
Language:
German
Description:
Online edition of the Grimm brothers' "Deutsche Wörterbuch" (1838). Each word shows the Grimms' etymological sources. Also available on CD-ROM
Rights:
Not specified
Publisher:
University of Dundee
Type:
lexicalConceptualResource
Language:
English
Description:
Historical dictionary of the Scottish language as written and spoken by lowland Scots in Scotland and Ulster from the 12th century onward. Over eighty thousand full-word entries.
Rights:
Not specified
Publisher:
Radboud University Nijmegen , Max Planck Institute for Psycholinguistics , University of Stockholm , and City University London
Format:
video/mpeg
Type:
corpus
Description:
This is a corpus of four European sign languages. It contains richly annotated video files of Sign Language of the Netherlands (Nederlandse Gebarentaal), British Sign Language, and Swedish Sign Language; data include narratives, dialogues, small lexicons, and poetry. In addition, parts of a corpus of German Sign Language (Deutsche Gebärdensprache) is included that was already published on paper before.
Rights:
Creative Commons BY-NC-SA 3.0 NL license and http://creativecommons.org/licenses/by-nc-sa/3.0/nl/
Publisher:
University of Cambridge
Format:
application/tei+xml
Type:
corpus
Language:
Welsh
Description:
Welsh texts from the period 1500-1850. Overall the corpus contains around 420,000 words from 30 texts.
Rights:
Not specified
Publisher:
Academy of Sciences
Format:
application/xml
Type:
corpus
Language:
Hungarian
Description:
Containing 27 million running words the Hungarian Historical Corpus provides a valuable basis for research on the history of words of Hungarian between the second half of the 18th century and 2000.
Rights:
Not specified
Publisher:
Academy of Sciences
Format:
application/xml
Type:
corpus
Subject:
synchronic corpus
Language:
Hungarian
Description:
Written general synchronic reference corpus; 190m tokens; POS annotated XML
Rights:
Not specified