Harvested from: LINDAT/CLARIAH-CZ repository - LINDAT/CLARIAH-CZ Catalog Search Results

1811. Söderwall/Söderwall supplement

Type:: lexicalConceptualResource
Description:: appr. 43,000 entries (appr. 25,000 distinct entries), various (XML version underway)
Rights:: Not specified

1812. SOLC

Publisher:: Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra
Type:: toolService
Language:: Catalan
Description:: An orthologic server for Catalan. A query system for the orthologic dictionary which allows making searches using dialectal and pragmatic variables.
Rights:: Not specified

1813. Somali Web Corpus

Creator:: Suchomel, Vít and Rychlý, Pavel
Publisher:: Masaryk University, NLP Centre
Type:: text and corpus
Subject:: text corpora, Ethiopian languages, web corpora, under-resourced languages, and Somali
Language:: Somali
Description:: Somali web corpus. Crawled by SpiderLing in January 2016. Encoded in UTF-8, cleaned, deduplicated.
Rights:: NLP Centre Web Corpus License, https://lindat.mff.cuni.cz/repository/xmlui/page/license-NLPC-WeC, and ACA

1814. Sophie Parallel Treebank

Type:: corpus
Language:: Estonian
Description:: 200 sentences, TIGER-XML
Rights:: Not specified

1815. Spanish Resource Grammar

Publisher:: Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra
Type:: languageDescription
Language:: Spanish
Description:: HPSG like grammar for the analysis of Spanish, implemented in LKB
Rights:: Not specified

1816. Spanish WordNet 3.0

Publisher:: Universitat de Barcelona
Type:: lexicalConceptualResource
Language:: Spanish
Description:: 63.000 synsets, plain text
Rights:: Not specified

1817. Special Nouns Lexicon

Creator:: Namly, Driss
Publisher:: Ibtikarate
Type:: text, computationalLexicon, and lexicalConceptualResource
Subject:: particles
Language:: Arabic
Description:: An XML-based file containing Arabic Stop-words respecting nouns syntax; particle nouns, signal nouns, separated pronouns and connected nouns Citation: Driss Namly, Yasser Regragui, Karim Bouzoubaa. "Interoperable Arabic language resources building and exploitation in SAFAR platform". 13th ACS/IEEE International Conference on Computer Systems and Applications (AICCSA) November 29th to December 2nd, 2016.
Rights:: Creative Commons - Attribution-NonCommercial 4.0 International (CC BY-NC 4.0), http://creativecommons.org/licenses/by-nc/4.0/, and PUB

1818. Speech by Edvard Beneš on the 25th anniversary of Czechoslovakia

Creator:: (:unav) Unknown author
Publisher:: Národní filmový archiv
Type:: video and clip
Subject:: projev Beneš Edvard, výročí vznik ČSR 25., Vznik ČSR, Places::Velká Británie::Aston Abbots::vila Edvarda Beneše, and People::Beneš Edvard (1884-1948)
Language:: English
Description:: An audio speech by President Edvard Beneš to commemorate the 25th anniversary of Czechoslovakia. He speaks in English.
Rights:: http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)

1819. Speech Commands Dataset Enhanced for Direction-of-Arrival Estimation

Creator:: Beneš, David
Publisher:: University of West Bohemia, Department of Cybernetics
Type:: audio and corpus
Subject:: speech commands and keyword direction of arrival
Language:: English
Description:: This dataset can serve as a training and evaluation corpus for the task of training keyword detection with speaker direction estimation (keyword direction of arrival - KWDOA). It was created by processing the existing Speech Commands dataset [1] with the PyroomAcoustics library so that the resulting speech recordings simulate the usage of a circular microphone array with 4 microphones having a distance of 57 mm between adjacent microphones. Such design of a simulated microphone array was chosen in order to match the existing physical microphone array from the Seeeduino series. [1] Warden, Pete. “Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition.” ArXiv.org, 2018, arxiv.org/abs/1804.03209
Rights:: Creative Commons - Attribution 4.0 International (CC BY 4.0), http://creativecommons.org/licenses/by/4.0/, and PUB

1820. Speech databases of typical children and children with SLI

Creator:: Tučková, Jana, Grill, Pavel, Vavřina, Josef, and Bártů, Marek
Publisher:: Department of Circuit Theory, Czech Technical University in Prague, Faculty of Electrical Engineering
Type:: audio and corpus
Subject:: Specific Language Impairments, Developmental Dysphasia, and Children Pathological Speech
Language:: Czech
Description:: Our Laboratory of Artificial Neural Network Applications (LANNA) in the Czech Technical University in Prague (head of the laboratory is professor Jana Tučková) collaborates on a project with the Department of Paediatric Neurology, 2nd Faculty of Medicine of Charles University in Prague and with the Motol University Hospital (head of clinic is professor Vladimír Komárek), which focuses on the study of children with SLI. The speech database contains two subgroups of recordings of children's speech from different types of speakers. The first subgroup (healthy) consists of recordings of children without speech disorders; the second subgroup (patients) consists of recordings of children with SLI. These children have different degrees of severity (1 – mild, 2 – moderate, and 3 – severe). The speech therapists and specialists from Motol Hospital decided upon this classification. The children’s speech was recorded in the period 2003-2013. These databases were commonly created in a schoolroom or a speech therapist’s consulting room, in the presence of surrounding background noise. This situation simulates the natural environment in which the children live, and is important for capturing the normal behavior of children. The database of healthy children’s speech was created as a referential database for the computer processing of children’s speech. It was recorded on the SONY digital Dictaphone (sampling frequency, fs = 16 kHz, 16-bit resolution in stereo mode in the standardized wav format) and on the MD SONY MZ-N710 (sampling frequency, fs = 44.1 kHz, 16-bit resolution in stereo mode in the standardized wav format). The corpus was recorded in the natural environment of a schoolroom and in a clinic. This subgroup contains a total of 44 native Czech participants (15 boys, 29 girls) aged 4 to 12 years, and was recorded during the period 2003–2005. The database of children with SLI was recorded in a private speech therapist’s office. The children’s speech is captured by means of a SHURE lapel microphone using the solution by the company AVID (MBox – USB AD/DA converter and ProTools LE software) on an Apple laptop (iBook G4). The sound recordings are saved in the standardized wav format. The sampling frequency is set to 44.1 kHz with 16-bit resolution in mono mode. This subgroup contains a total of 54 native Czech participants (35 boys, 19 girls) aged 6 to 12 years, and was recorded during the period 2009–2013. This package contains wav data sets for development and testing methods for detection children with SLI. Software pack: FORANA - was developed the original software FORANA for formants analysis. It is based on the MATLAB programming environment. The development of this software was mainly driven by the need to have the ability to complete formant analysis correctly and full automation of the process of extracting formants from the recorded speech signals. Development of this application is still running. Software was developed in the LANNA at CTU FEE in Prague. LABELING - the program LABELING is used for segmentation of the speech signal. It is a part of SOMLab program system. Software was developed in the LANNA at CTU FEE in Prague. PRAAT - is an acoustic analysis software. The Praat program was created by Paul Boersma and David Weenink of the Institute of Phonetics Sciences of the University of Amsterdam. Home page: http://www.praat.org or http://www.fon.hum.uva.nl/praat/.
Rights:: Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0), http://creativecommons.org/licenses/by-nc/3.0/, and PUB

1811. Söderwall/Söderwall supplement

1812. SOLC

1813. Somali Web Corpus

1814. Sophie Parallel Treebank

1815. Spanish Resource Grammar

1816. Spanish WordNet 3.0

1817. Special Nouns Lexicon

1818. Speech by Edvard Beneš on the 25th anniversary of Czechoslovakia

1819. Speech Commands Dataset Enhanced for Direction-of-Arrival Estimation

1820. Speech databases of typical children and children with SLI

Limit your search

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Search

Search Constraints

Search Results

Limit your search

Contributor

Show values starting with

Coverage

Show values starting with

Creator

Show values starting with

Format

Language

Show values starting with

Publisher

Show values starting with

Rights

Show values starting with

Subject

Show values starting with

Type

Show values starting with

Date

Original context has metadata only

Harvested from