Harvested from: LINDAT/CLARIAH-CZ repository / Language: English - LINDAT/CLARIAH-CZ Catalog Search Results

111. English TTS speech corpus of air traffic (pilot) messages - German accent

Creator:: Matoušek, Jindřich and Tihelka, Daniel
Publisher:: University of West Bohemia, Department of Cybernetics
Type:: audio and corpus
Subject:: speech corpus, text-to-speech (TTS), and pitch-marks
Language:: English
Description:: The corpus contains recordings of male speaker, native in German, talking in English. The sentences that were read by the speaker originate in the domain of air traffic control (ATC), specifically the messages used by plane pilots during routine flight. The text in the corpus originates from the transcripts of the real recordings, part of which has been released in LINDAT/CLARIN (http://hdl.handle.net/11858/00-097C-0000-0001-CCA1-0), and individual phrases were selected by special algorithm described in Jůzová, M. and Tihelka, D.: Minimum Text Corpus Selection for Limited Domain Speech Synthesis (DOI 10.1007/978-3-319-10816-2_48). The corpus was used to create a limited domain speech synthesis system capable of simulating a pilot communication with an ATC officer.
Rights:: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB

112. English TTS speech corpus of air traffic (pilot) messages - Serbian accent

Creator:: Matoušek, Jindřich and Tihelka, Daniel
Publisher:: University of West Bohemia, Department of Cybernetics
Type:: audio and corpus
Subject:: speech corpus, text-to-speech (TTS), and pitch-marks
Language:: English
Description:: The corpus contains recordings of male speaker, native in Serbian, talking in English. The sentences that were read by the speaker originate in the domain of air traffic control (ATC), specifically the messages used by plane pilots during routine flight. The text in the corpus originates from the transcripts of the real recordings, part of which has been released in LINDAT/CLARIN (http://hdl.handle.net/11858/00-097C-0000-0001-CCA1-0), and individual phrases were selected by special algorithm described in Jůzová, M. and Tihelka, D.: Minimum Text Corpus Selection for Limited Domain Speech Synthesis (DOI 10.1007/978-3-319-10816-2_48). The corpus was used to create a limited domain speech synthesis system capable of simulating a pilot communication with an ATC officer.
Rights:: Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0), http://creativecommons.org/licenses/by-nc-sa/3.0/, and PUB

113. English TTS speech corpus of air traffic (pilot) messages - Taiwanese accent

Creator:: Matoušek, Jindřich and Tihelka, Daniel
Publisher:: University of West Bohemia, Department of Cybernetics
Type:: audio and corpus
Subject:: speech corpus, text-to-speech (TTS), and pitch-marks
Language:: English
Description:: The corpus contains recordings of male speaker, native in Taiwanese, talking in English. The sentences that were read by the speaker originate in the domain of air traffic control (ATC), specifically the messages used by plane pilots during routine flight. The text in the corpus originates from the transcripts of the real recordings, part of which has been released in LINDAT/CLARIN (http://hdl.handle.net/11858/00-097C-0000-0001-CCA1-0), and individual phrases were selected by special algorithm described in Jůzová, M. and Tihelka, D.: Minimum Text Corpus Selection for Limited Domain Speech Synthesis (DOI 10.1007/978-3-319-10816-2_48). The corpus was used to create a limited domain speech synthesis system capable of simulating a pilot communication with an ATC officer.
Rights:: Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0), http://creativecommons.org/licenses/by-nc-sa/3.0/, and PUB

114. English-Bulgarian INTERA

Type:: corpus
Language:: Bulgarian and English
Description:: Alignment – TMX, structural – XCES, morphosyntactic – XCES, MTE tagset
Rights:: Not specified

115. English-Czech Corpus from Wikipedia

Creator:: Štromajerová, Adéla, Baisa, Vít, and Blahuš, Marek
Publisher:: Masaryk University, NLP Centre
Type:: text and corpus
Subject:: Wikipedia
Language:: English and Czech
Description:: Sentence-parallel corpus made from English and Czech Wikipedias based on translated articles from English into Czech. The work done is described in the paper: ŠTROMAJEROVÁ, Adéla, Vít BAISA a Marek BLAHUŠ. Between Comparable and Parallel: English-Czech Corpus from Wikipedia. In RASLAN 2016 Recent Advances in Slavonic Natural Language Processing. Brno: Tribun EU, 2016. s. 3-8, 6 s. ISBN 978-80-263-1095-2.
Rights:: Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB

116. English-Hindi Parallel Corpus

Creator:: Bojar, Ondřej, Straňák, Pavel, Zeman, Daniel, Jain, Gaurav, and Damani, Om Prakesh
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: English-Hindi parallel corpus and parallel corpus
Language:: Hindi and English
Description:: English-Hindi parallel corpus collected from several sources. Tokenized and sentence-aligned. A part of the data is our patch for the Emille parallel corpus. and FP7-ICT-2007-3-231720 (EuroMatrix Plus) 7E09003 (Czech part of EM+)
Rights:: Creative Commons - Attribution 3.0 Unported (CC BY 3.0), http://creativecommons.org/licenses/by/3.0/, and PUB

117. English-Latvian SMT system

Publisher:: Institute of Mathematics and Computer Science, University of Latvia
Type:: toolService
Language:: English
Description:: English-Latvian factored SMT system uses Moses decoder, trained on JRC-Acquis and some other parallel texts
Rights:: Not specified

118. English-Lithuanian Machine Translation Service

Publisher:: Center of Computational Linguistics, Vytautas Magnus University
Type:: toolService
Language:: English and Lithuanian
Description:: On-line freely accessible machine translation tool for translating English webpages or texts into Lithuanian.
Rights:: Not specified

119. English-Luganda Parallel Corpus

Publisher:: Center for Dutch Language and Speech, University of Antwerp
Type:: corpus
Language:: English
Description:: Bible. Word-alligned corpus
Rights:: Not specified

120. English-Slovak Parallel Corpus

Creator:: Galuščáková, Petra, Garabík, Radovan, and Bojar, Ondřej
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: parallel corpus and English-Slovak corpus
Language:: Slovak and English
Description:: English-Slovak parallel corpus consisting of several freely available corpora (Acquis [1], Europarl [2], Official Journal of the European Union [3] and part of OPUS corpus [4] – EMEA, EUConst, KDE4 and PHP) and downloaded website of European Commission [5]. Corpus is published in both in plaintext format and with an automatic morphological annotation. References: [1] http://langtech.jrc.it/JRC-Acquis.html/ [2] http://www.statmt.org/europarl/ [3] http://apertium.eu/data [4] http://opus.lingfil.uu.se/ [5] http://ec.europa.eu/ and This work has been supported by the grant Euro-MatrixPlus (FP7-ICT-2007-3-231720 of the EU and 7E09003 of the Czech Republic)
Rights:: Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0), http://creativecommons.org/licenses/by-nc-sa/3.0/, and PUB

111. English TTS speech corpus of air traffic (pilot) messages - German accent

112. English TTS speech corpus of air traffic (pilot) messages - Serbian accent

113. English TTS speech corpus of air traffic (pilot) messages - Taiwanese accent

114. English-Bulgarian INTERA

115. English-Czech Corpus from Wikipedia

116. English-Hindi Parallel Corpus

117. English-Latvian SMT system

118. English-Lithuanian Machine Translation Service

119. English-Luganda Parallel Corpus

120. English-Slovak Parallel Corpus

Limit your search

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Search

Search Constraints

Search Results

Limit your search

Contributor

Show values starting with

Coverage

Show values starting with

Creator

Show values starting with

Format

Language

Show values starting with

Publisher

Show values starting with

Rights

Show values starting with

Subject

Show values starting with

Type

Show values starting with

Date

Original context has metadata only

Harvested from