Original context has metadata only: true / Type: corpus - LINDAT/CLARIAH-CZ Catalog Search Results

Start Over Type corpus Original context has metadata only true

121. French learner language oral corpora

Publisher:: University of Southampton and Newcastle University
Type:: corpus
Language:: French
Description:: Seven French L2 corpora. Digital sound files and related transcripts formatted using CHILDES software. The database currently contains over 4000 files (sound files, transcripts and morphosyntactically tagged transcripts). .
Rights:: Not specified

122. French-Croatian Parallel Corpus

Type:: corpus
Language:: Croatian and French
Description:: written; domain-specific (fiction); diachronic (the French side); bilingual; parallel; ca 263,000 tokens (148 Kw French; 115 Kw Croatian); XML; S-alignment
Rights:: Not specified

123. GerManC : A representative historical corpus of German 1650-1800

Type:: corpus
Language:: German
Description:: The ultimate aim of the project is to compile a representative historical corpus of written German for the years 1650-1800. The complete GerManC corpus will contain 2000 word samples from nine genres
Rights:: Not specified

124. Greek Dependency Treebank (GDT)

Type:: corpus
Language:: Modern Greek (1453-)
Description:: 70K words, Non-validated sentence segmentation. Non-validated POS tagging, Manual annotation of syntactic dependencies and dependency labels, Manual annotation of semantic roles, Manual annotation of events based on a shallow domain specific ontology (only for a 31K words subset of GDT)
Rights:: Not specified

125. Helsinki annotated corpus of Russian language HANCO

Publisher:: The Department of Modern Languages, University of Helsinki and University of Helsinki
Format:: application/octet-stream
Type:: corpus
Subject:: Coprus linguistics
Language:: Russian
Description:: Morphologically and syntactically annotated corpus of the modern Russian language.
Rights:: Not specified

126. Helsinki Corpus of British English Dialects

Publisher:: University of Helsinki
Format:: text/plain
Type:: corpus
Language:: English
Description:: Collection of orthographically transcribed audio recorded speech, mainly from East Anglia and the South-West, with a minor collection from Lancashire. The recordings were made in the 1970s and the 1980s by Finnish postgraduates.
Rights:: Not specified

127. Helsinki Corpus of English Texts (HC)

Publisher:: University of Helsinki
Format:: text/plain
Type:: corpus
Language:: English
Description:: A balanced multi-genre corpus of English texts between the years c. 730-1710.
Rights:: Not specified

128. Helsinki Corpus of Older Scots (HCOS)

Publisher:: University of Helsinki
Format:: text/plain
Type:: corpus
Language:: English
Description:: A balanced multi-genre corpus modelled on the Helsinki Corpus, covering the years 1450-1700.
Rights:: Not specified

129. Historical Corpus of the Welsh Language 1500-1850

Publisher:: University of Cambridge
Format:: application/tei+xml
Type:: corpus
Language:: Welsh
Description:: Welsh texts from the period 1500-1850. Overall the corpus contains around 420,000 words from 30 texts.
Rights:: Not specified

130. HNC (Hellenic National Corpus)

Publisher:: Institute for Language and Speech Processing
Format:: application/octet-stream
Type:: corpus
Language:: Modern Greek (1453-)
Description:: General language corpus of standard Modern Greek; 47 MWs
Rights:: Not specified

« Previous
Next »
1
2
…
9
10
11
12
13
14
15
16
17
…
38
39