Skip to search
Skip to main content
Skip to first result
Search
Search Results
Publisher:
King's College London
Format:
application/tei+xml
Type:
corpus
Language:
English
Description:
Charters written in Anglo-Saxon England before A.D. 900, marked-up in TEI XML. Browsable online.
Rights:
Not specified
Publisher:
The Research Institute for the Languages of Finland
Type:
toolService
Language:
Finnish
Description:
The digital atlas illustrates the distribution of 234 common Finnish place-name elements based on data in the Names Archive.
Rights:
Not specified
Publisher:
Berlin-Brandenburg Academy of Sciences and Humanities
Format:
application/tei+xml
Type:
corpus
Language:
German
Description:
Transcribed narrative interviews with people from East and West Berlin about the events of November 9. 282,000 tokens. TEI XML, lemma and POS. Normalized version also available.
Rights:
Not specified
Publisher:
Coventry University, University of Reading, University of Warwick
Format:
application/tei+xml
Type:
corpus
Language:
English
Description:
Transcribed recordings of 160 lectures and 39 seminars held in university departments. Four broad disciplinary groups, 1,644,942 tokens in total.
Rights:
Not specified
Publisher:
Research Group in Computational Linguistics, University of Wolverhampton
Type:
corpus
Language:
English
Description:
Sentences annotated for important units of text for summarisation. 145,473 words / 6584 sentences
Rights:
Not specified
Publisher:
Max Planck Institute for Psycholinguistics
Type:
corpus
Language:
Dutch
Description:
The code-switching corpus consists of 5x30-minute conversations between four speakers (i.e. a total of 20 speakers). The speakers are bilingual speakers of Papiamento (a creole langauge spoken in the Dutch Antilles) and Dutch. In the course of their free conversations, they engage in code-switching, that is, they use both languages within the same utterance in systematic ways. The corpus is fully transcribed and glossed, coded for language and word class, in ELAN.
Rights:
Not specified
Publisher:
Archives of Latvian Folklore, Institute of Literature, Folklore and Art, University of Latvia and Institute of Mathematics and Computer Science, University of Latvia
Type:
corpus
Language:
Latvian
Description:
Latvian proverbs collected by Archives of Latvian Folklore (~ 20 000 items)
Rights:
Not specified
Publisher:
University of Tampere
Format:
application/octet-stream
Type:
corpus
Language:
Finnish and Russian
Description:
Juridical texts in Russian and Finnish arranged as a comparable text corpus
Rights:
Not specified
Publisher:
Institute of Mathematics and Computer Science, University of Latvia
Format:
text/plain
Type:
corpus
Subject:
balanced corpus
Language:
Latvian
Description:
Balanced corpus of Modern Latvian (~ 1 million running words, currently in plain-text), publicly available via Bonito interface
Rights:
Not specified
Publisher:
Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra
Type:
corpus
Subject:
oral corpus
Language:
Catalan
Description:
Oral corpus containing 10 sociolinguistic interviews carried out in La Canonja (Tarragona).
Rights:
Not specified