Rights: PUB - LINDAT/CLARIAH-CZ Catalog Search Results

481. Victor

Creator:: Marek, Michal
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: toolService
Subject:: html cleaning
Description:: Victor is a web page cleaning tool. It is aimed at removing menu, ads, footers, headers, etc. from HTML web pages, so that only main web page content remains. Victor is based on a conditional random fields algorithm.
Rights:: GNU General Public License, version 2, http://www.gnu.org/licenses/gpl-2.0.html, and PUB

482. Victoria

Creator:: Spousta, Miroslav
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: toolService
Subject:: web page processing
Description:: Victoria is an on-line HTML web page annotation tool suitable for selecting texts on the web pages. It can be used to mark important/interesting parts of web pages for further processing.
Rights:: GNU General Public License, version 2, http://www.gnu.org/licenses/gpl-2.0.html, and PUB

483. Visible Vowels

Creator:: van de Velde, Hans and Heeringa, Wilbert
Publisher:: Fryske Akademy
Type:: tool and toolService
Subject:: Phonetics, language variation, and dialect variation
Description:: This program enables the user to plot vowels in the F1/F2 space for multiple points in the vowel interval, e.g. at 20%, 50% and 80%.
Rights:: GNU General Public License 2 or later (GPL-2.0), http://opensource.org/licenses/GPL-2.0, and PUB

484. Visible Vowels (2017-05-29)

Creator:: Van de Velde, Hans and Heeringa, Wilbert
Publisher:: Fryske Akademy
Type:: tool and toolService
Subject:: language variation, dialect variation, f0, formants, duration, normalization, visualization, and phonetics
Description:: This program enables the user to visualize f0 contours, to plot vowels in the F1/F2 space for multiple points in the vowel interval, e.g. at 20%, 50% and 80%, and to visualize vowel durations. (The tool is implemented in R. We used the following packages: phonR, gplots, plotrix, lattice, readxl, WriteXLS, DT, psych and pracma. We thank the developers of these packages.)
Rights:: GNU General Public License 2 or later (GPL-2.0), http://opensource.org/licenses/GPL-2.0, and PUB

485. VPS-30-En

Creator:: Cinková, Silvie, Holub, Martin, Rambousek, Adam, and Smejkalová, Lenka
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text, lexicon, and lexicalConceptualResource
Subject:: corpus pattern analysis, clustering, lexical semantics, and verbs
Language:: English
Description:: VPS-30-En is a small lexical resource that contains the following 30 English verbs: access, ally, arrive, breathe, claim, cool, crush, cry, deny, enlarge, enlist, forge, furnish, hail, halt, part, plough, plug, pour, say, smash, smell, steer, submit, swell, tell, throw, trouble, wake and yield. We have created and have been using VPS-30-En to explore the interannotator agreement potential of the Corpus Pattern Analysis. VPS-30-En is a small snapshot of the Pattern Dictionary of English Verbs (Hanks and Pustejovsky, 2005), which we revised (both the entries and the annotated concordances) and enhanced with additional annotations. and This work has been partly supported by the Ministry of Education of CR within the LINDAT-Clarin project LM2010013, and by the Czech Science Foundation under the projects P103/12/G084, P406/2010/0875 and P401/10/0792.
Rights:: Creative Commons - Attribution 3.0 Unported (CC BY 3.0), http://creativecommons.org/licenses/by/3.0/, and PUB

486. VPS-GradeUp (2016-10-10)

Creator:: Baisa, Vít, Cinková, Silvie, Krejčová, Ema, and Vernerová, Anna
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text, other, and lexicalConceptualResource
Subject:: Pattern Dictionary of English Verbs, usage patterns, lexical semantics, dictionaries, clustering, Corpus Pattern Analysis, verbs, graded decisions, Likert scale, and Word Sense Disambiguation
Language:: English
Description:: VPS-GradeUp is a collection of triple manual annotations of 29 English verbs based on the Pattern Dictionary of English Verbs (PDEV) and comprising the following lemmas: abolish, act, adjust, advance, answer, approve, bid, cancel, conceive, cultivate, cure, distinguish, embrace, execute, hire, last, manage, murder, need, pack, plan, point, praise, prescribe, sail, seal, see, talk, urge . It contains results from two different tasks: 1. Graded decisions 2. Best-fit pattern (WSD) . In both tasks, the annotators were matching verb senses defined by the PDEV patterns with 50 actual uses of each verb (using concordances from the BNC [2]). The verbs were randomly selected from a list of completed PDEV lemmas with at least 3 patterns and at least 100 BNC concordances not previously annotated by PDEV’s own annotators. Also, the selection excluded verbs contained in VPS-30-En[3], a data set we developed earlier. This data set was built within the project Reviving Zellig S. Harris: more linguistic information for distributional lexical analysis of English and Czech and in connection with the SemEval-2015 CPA-related task.
Rights:: Creative Commons - Attribution 4.0 International (CC BY 4.0), http://creativecommons.org/licenses/by/4.0/, and PUB

487. Vystadial 2013 – Czech data

Creator:: Korvas, Matěj, Plátek, Ondřej, Dušek, Ondřej, Žilka, Lukáš, and Jurčíček, Filip
Publisher:: Charles University, Faculty of Mathematics and Physics
Type:: audio and corpus
Subject:: acoustic data, speech corpus, spoken corpus, orthographic transcriptions, telephone speech, voip, and dialogue system
Language:: Czech
Description:: Vystadial 2013 is a dataset of telephone conversations in English and Czech, developed for training acoustic models for automatic speech recognition in spoken dialogue systems. It ships in three parts: Czech data, English data, and scripts. The data comprise over 41 hours of speech in English and over 15 hours in Czech, plus orthographic transcriptions. The scripts implement data pre-processing and building acoustic models using the HTK and Kaldi toolkits. This is the Czech data part of the dataset. and This research was funded by the Ministry of Education, Youth and Sports of the Czech Republic under the grant agreement LK11221.
Rights:: Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0), http://creativecommons.org/licenses/by-sa/3.0/, and PUB

488. Vystadial 2013 – English data

Creator:: Korvas, Matěj, Plátek, Ondřej, Dušek, Ondřej, Žilka, Lukáš, and Jurčíček, Filip
Publisher:: Charles University, Faculty of Mathematics and Physics
Type:: audio and corpus
Subject:: acoustic data, speech corpus, spoken corpus, orthographic transcriptions, telephone speech, voip, and dialogue system
Language:: English
Description:: Vystadial 2013 is a dataset of telephone conversations in English and Czech, developed for training acoustic models for automatic speech recognition in spoken dialogue systems. It ships in three parts: Czech data, English data, and scripts. The data comprise over 41 hours of speech in English and over 15 hours in Czech, plus orthographic transcriptions. The scripts implement data pre-processing and building acoustic models using the HTK and Kaldi toolkits. This is the English data part of the dataset. and This research was funded by the Ministry of Education, Youth and Sports of the Czech Republic under the grant agreement LK11221.
Rights:: Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0), http://creativecommons.org/licenses/by-sa/3.0/, and PUB

489. Vystadial 2013 – scripts

Creator:: Korvas, Matěj, Plátek, Ondřej, Dušek, Ondřej, Žilka, Lukáš, and Jurčíček, Filip
Publisher:: Charles University, Faculty of Mathematics and Physics
Type:: toolService and tool
Subject:: ASR, HTK, Kaldi, and acoustic model
Language:: English and Czech
Description:: Vystadial 2013 is a dataset of telephone conversations in English and Czech, developed for training acoustic models for automatic speech recognition in spoken dialogue systems. It ships in three parts: Czech data, English data, and scripts. The data comprise over 41 hours of speech in English and over 15 hours in Czech, plus orthographic transcriptions. The scripts implement data pre-processing and building acoustic models using the HTK and Kaldi toolkits. This is the scripts part of the dataset. and This research was funded by the Ministry of Education, Youth and Sports of the Czech Republic under the grant agreement LK11221.
Rights:: Apache License 2.0, http://opensource.org/licenses/Apache-2.0, and PUB

490. Vystadial 2016 – Czech data

Creator:: Plátek, Ondřej, Dušek, Ondřej, and Jurčíček, Filip
Publisher:: Charles University, Faculty of Mathematics and Physics
Type:: audio and corpus
Subject:: acoustic data, speech corpus, spoken corpus, telephone speech, voip, and dialogue system
Language:: Czech
Description:: This is the Czech data collected during the `VYSTADIAL` project. It is an extension of the 'Vystadial 2013' Czech part data release. The dataset comprises of telephone conversations in Czech, developed for training acoustic models for automatic speech recognition in spoken dialogue systems.
Rights:: Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB

481. Victor

482. Victoria

483. Visible Vowels

484. Visible Vowels (2017-05-29)

485. VPS-30-En

486. VPS-GradeUp (2016-10-10)

487. Vystadial 2013 – Czech data

488. Vystadial 2013 – English data

489. Vystadial 2013 – scripts

490. Vystadial 2016 – Czech data

Limit your search

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Search

Search Constraints

Search Results

Limit your search

Contributor

Show values starting with

Coverage

Creator

Show values starting with

Language

Show values starting with

Publisher

Show values starting with

Rights

Show values starting with

Subject

Show values starting with

Type

Show values starting with

Original context has metadata only

Harvested from