Harvested from: LINDAT/CLARIAH-CZ repository - LINDAT/CLARIAH-CZ Catalog Search Results

Start Over Harvested from LINDAT/CLARIAH-CZ repository Date Unknown

171. Corpus of Old Literary Finnish

Publisher:: The Research Institute for the Languages of Finland
Type:: corpus
Language:: Finnish
Description:: This is a linguistically unannotated corpus of various historical texts written between 1543 and 1809. The corpus consists of 3,428,618 words and is available for online browsing.
Rights:: Not specified

172. Corpus of Old Written Estonian

Publisher:: University of Tartu
Type:: corpus
Language:: Estonian
Description:: Corpus of texts written fully or partly in Estonian, from 13.-19. century; 1,5 million words
Rights:: Not specified

173. Corpus of precisely articulated Czech speech

Creator:: Hanzlíček, Zdeněk, Kochová, Pavla, Tihelka, Daniel, Kövérová, Markéta, Matoušek, Jindřich, and Ševeček, Pavel
Publisher:: University of West Bohemia, Department of Cybernetics and Lingea, s.r.o.
Type:: audio and corpus
Subject:: speech corpus, text-to-speech (TTS), speech synthesis, and hyperarticulated speech
Language:: Czech
Description:: The corpus contains speech data of 2 Czech native speakers, male and female. The speech is very precisely articulated up to hyper-articulated, and the speech rate is low. The speech data with a highlighted articulation is suitable for teaching foreigners the Czech language, and it can also be used for people with hearing or speech impairment. The recorded sentences can be used either directly, e.g., as a part of educational material, or as source data for building complex educational systems incorporating speech synthesis technology. All recorded sentences were precisely orthographically annotated and phonetically segmented, i.e., split into phones, using modern neural network-based methods.
Rights:: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB

174. Corpus of Present-day Written Estonian

Type:: corpus
Language:: Estonian
Description:: written general; 95 mio words; TEI/SGML
Rights:: Not specified

175. Corpus of Proverbs and Other Colloquial Expressions

Publisher:: The Research Institute for the Languages of Finland
Type:: corpus
Language:: Finnish
Rights:: Not specified

176. Corpus of Spoken Estonian

Type:: corpus
Language:: Estonian
Description:: spoken general; 1 mio words; local tagset
Rights:: Not specified

177. Corpus of the Contemporary Lithuanian Language

Publisher:: Center of Computational Linguistics, Vytautas Magnus University
Format:: application/octet-stream
Type:: corpus
Language:: Lithuanian
Description:: 140 million words; Corpus of the Contemporary Lithuanian Language which comprises 160 million words is a collection of texts designed to represent current Lithuanian. The corpus is compiled from printed material during Lithuania's independence period (since 1990). The corpus is designed to represent as wide a range of contemporary written Lithuanian as possible. The largest part of the corpus is comprised of General Press (texts from regional and national newspapers), Popular Press, and Special Press (specialized newspapers and magazines). These texts have been intended for general readers, as well as specialists. The rest of the corpus consists of Fiction, Memoirs, other literature (scientific and popular), and various official texts. The larger part of the corpus is freely accessible for online search at http://donelaitis.vdu.lt.
Rights:: Not specified

178. Corpus of Written Estonian

Type:: corpus
Language:: Estonian
Description:: 4.4 mio words; TEI/SGML
Rights:: Not specified

179. Corpus PAAU 92

Publisher:: Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra
Type:: corpus
Language:: Spanish
Description:: The electronic version of the book “Corpus PAAU 1992: Descriptive Studies, Texts and Vocabulary” includes the texts that have been object of analysis in this project as well as the vocabulary lists that make up the Corpus 92.
Rights:: Not specified

180. Corpus query for Estonian corpora

Publisher:: University of Tartu
Type:: toolService
Language:: Estonian
Description:: Web application for querying the automatically morphologically disambiguated Mixed corpus of Estonian
Rights:: Not specified

« Previous
Next »
1
2
…
14
15
16
17
18
19
20
21
22
…
112
113

171. Corpus of Old Literary Finnish

172. Corpus of Old Written Estonian

173. Corpus of precisely articulated Czech speech

174. Corpus of Present-day Written Estonian

175. Corpus of Proverbs and Other Colloquial Expressions

176. Corpus of Spoken Estonian

177. Corpus of the Contemporary Lithuanian Language

178. Corpus of Written Estonian

179. Corpus PAAU 92

180. Corpus query for Estonian corpora

Limit your search

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Search

Search Constraints

Search Results

Limit your search

Contributor

Show values starting with

Coverage

Show values starting with

Creator

Show values starting with

Format

Language

Show values starting with

Publisher

Show values starting with

Rights

Show values starting with

Subject

Show values starting with

Type

Show values starting with

Original context has metadata only

Harvested from