Harvested from: LINDAT/CLARIAH-CZ repository and LINDAT/CLARIAH-CZ repository

Start Over Harvested from LINDAT/CLARIAH-CZ repository Harvested from LINDAT/CLARIAH-CZ repository Date 2008 to 2009

1. Comparable Russian-Finnish corpus of juridical texts

Publisher:: University of Tampere
Format:: application/octet-stream
Type:: corpus
Language:: Finnish and Russian
Description:: Juridical texts in Russian and Finnish arranged as a comparable text corpus
Rights:: Not specified

2. Corpus "Miljons"

Publisher:: Institute of Mathematics and Computer Science, University of Latvia
Format:: text/plain
Type:: corpus
Subject:: balanced corpus
Language:: Latvian
Description:: Balanced corpus of Modern Latvian (~ 1 million running words, currently in plain-text), publicly available via Bonito interface
Rights:: Not specified

3. Corpus de parlants catalanoparlants de La Canonja en temps real (TR)

Publisher:: Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra
Type:: corpus
Subject:: oral corpus
Language:: Catalan
Description:: Oral corpus containing 10 sociolinguistic interviews carried out in La Canonja (Tarragona).
Rights:: Not specified

4. Corpus Nederlandse Gebarentaal (CNGT)

Publisher:: Radboud University Nijmegen
Type:: corpus
Subject:: Linguistics and language technology
Description:: The Corpus NGT is a collection of data from deaf signers using Sign Language of the Netherlands (NGT). The data consist of recordings with multiple synchronised video cameras, accompanied by gloss and translation annotations.
Rights:: Creative Commons BY-NC-SA 3.0 NL license and http://creativecommons.org/licenses/by-nc-sa/3.0/nl/

5. Czech Academic Corpus (CAC) 2.0

Publisher:: Charles University
Type:: corpus
Language:: Czech
Description:: The Prague family of annotated corpora has a new member, the Czech Academic Corpus version 2.0 (CAC 2.0). CAC 2.0 consists of 650,000 words from various 1970s and 1980s newspapers, magazines and radio and television broadcast transcripts manually annotated for morphology and syntax.
Rights:: LDC Licence and LDC Catalog No.: LDC2008T22

6. Delftse Bijbel 1477

Publisher:: NBG/DBNL/INL; Nicoline van der Sijs
Type:: corpus
Language:: Dutch
Description:: Digitised version of the Delftse Bijbel 1477
Rights:: Not specified

7. DPC (Dutch Parallel Corpus)

Publisher:: Katholieke Universiteit Leuven Campus Kortrijk, Hogeschool Gent
Type:: corpus
Language:: Dutch, English, and French
Description:: Parallel corpus, with Dutch as first language, 10 M words (under construction). DPC is a STEVIN-project.
Rights:: Not specified

8. Early Irish Glossaries

Publisher:: Department of Anglo-Saxon, Norse, and Celtic at the University of Cambridge
Type:: lexicalConceptualResource
Language:: Irish
Description:: Database of three inter-related early Irish glossaries. The texts, compiled from the eighth century, comprise several thousand headwords followed by entries that can range from single word explanations to whole narratives running to several pages.
Rights:: Not specified

9. French learner language oral corpora

Publisher:: University of Southampton and Newcastle University
Type:: corpus
Language:: French
Description:: Seven French L2 corpora. Digital sound files and related transcripts formatted using CHILDES software. The database currently contains over 4000 files (sound files, transcripts and morphosyntactically tagged transcripts). .
Rights:: Not specified

10. IFA dialog video corpus

Publisher:: IFA-groep, University of Amsterdam
Type:: corpus
Language:: Dutch
Description:: A video collection of spontaneous speech dialogues of 42 participants (14m, 28f)
Rights:: GNU GPL

1. Comparable Russian-Finnish corpus of juridical texts

2. Corpus "Miljons"

3. Corpus de parlants catalanoparlants de La Canonja en temps real (TR)

4. Corpus Nederlandse Gebarentaal (CNGT)

5. Czech Academic Corpus (CAC) 2.0

6. Delftse Bijbel 1477

7. DPC (Dutch Parallel Corpus)

8. Early Irish Glossaries

9. French learner language oral corpora

10. IFA dialog video corpus

Limit your search

Show values starting with

Show values starting with

Show values starting with

Search

Search Constraints

Search Results

Limit your search

Contributor

Show values starting with

Coverage

Creator

Format

Language

Show values starting with

Publisher

Show values starting with

Rights

Subject

Type

Date

Original context has metadata only

Harvested from