Number of results to display per page
Search Results
1542. Vystadial 2013 – Czech data
- Creator:
- Korvas, Matěj, Plátek, Ondřej, Dušek, Ondřej, Žilka, Lukáš, and Jurčíček, Filip
- Publisher:
- Charles University, Faculty of Mathematics and Physics
- Type:
- audio and corpus
- Subject:
- acoustic data, speech corpus, spoken corpus, orthographic transcriptions, telephone speech, voip, and dialogue system
- Language:
- Czech
- Description:
- Vystadial 2013 is a dataset of telephone conversations in English and Czech, developed for training acoustic models for automatic speech recognition in spoken dialogue systems. It ships in three parts: Czech data, English data, and scripts. The data comprise over 41 hours of speech in English and over 15 hours in Czech, plus orthographic transcriptions. The scripts implement data pre-processing and building acoustic models using the HTK and Kaldi toolkits. This is the Czech data part of the dataset. and This research was funded by the Ministry of Education, Youth and Sports of the Czech Republic under the grant agreement LK11221.
- Rights:
- Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0), http://creativecommons.org/licenses/by-sa/3.0/, and PUB
1543. Vystadial 2013 – English data
- Creator:
- Korvas, Matěj, Plátek, Ondřej, Dušek, Ondřej, Žilka, Lukáš, and Jurčíček, Filip
- Publisher:
- Charles University, Faculty of Mathematics and Physics
- Type:
- audio and corpus
- Subject:
- acoustic data, speech corpus, spoken corpus, orthographic transcriptions, telephone speech, voip, and dialogue system
- Language:
- English
- Description:
- Vystadial 2013 is a dataset of telephone conversations in English and Czech, developed for training acoustic models for automatic speech recognition in spoken dialogue systems. It ships in three parts: Czech data, English data, and scripts. The data comprise over 41 hours of speech in English and over 15 hours in Czech, plus orthographic transcriptions. The scripts implement data pre-processing and building acoustic models using the HTK and Kaldi toolkits. This is the English data part of the dataset. and This research was funded by the Ministry of Education, Youth and Sports of the Czech Republic under the grant agreement LK11221.
- Rights:
- Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0), http://creativecommons.org/licenses/by-sa/3.0/, and PUB
1544. Vystadial 2013 – scripts
- Creator:
- Korvas, Matěj, Plátek, Ondřej, Dušek, Ondřej, Žilka, Lukáš, and Jurčíček, Filip
- Publisher:
- Charles University, Faculty of Mathematics and Physics
- Type:
- toolService and tool
- Subject:
- ASR, HTK, Kaldi, and acoustic model
- Language:
- English and Czech
- Description:
- Vystadial 2013 is a dataset of telephone conversations in English and Czech, developed for training acoustic models for automatic speech recognition in spoken dialogue systems. It ships in three parts: Czech data, English data, and scripts. The data comprise over 41 hours of speech in English and over 15 hours in Czech, plus orthographic transcriptions. The scripts implement data pre-processing and building acoustic models using the HTK and Kaldi toolkits. This is the scripts part of the dataset. and This research was funded by the Ministry of Education, Youth and Sports of the Czech Republic under the grant agreement LK11221.
- Rights:
- Apache License 2.0, http://opensource.org/licenses/Apache-2.0, and PUB
1545. Vystadial 2016 – Czech data
- Creator:
- Plátek, Ondřej, Dušek, Ondřej, and Jurčíček, Filip
- Publisher:
- Charles University, Faculty of Mathematics and Physics
- Type:
- audio and corpus
- Subject:
- acoustic data, speech corpus, spoken corpus, telephone speech, voip, and dialogue system
- Language:
- Czech
- Description:
- This is the Czech data collected during the `VYSTADIAL` project. It is an extension of the 'Vystadial 2013' Czech part data release. The dataset comprises of telephone conversations in Czech, developed for training acoustic models for automatic speech recognition in spoken dialogue systems.
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
1546. W2C – Web to Corpus – Corpora
- Creator:
- Majliš, Martin
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- multilingual corpora
- Language:
- Afrikaans, Tosk Albanian, Amharic, Arabic, Aragonese, Egyptian Arabic, Asturian, Azerbaijani, Belarusian, Bengali, Bosnian, Bishnupriya, Breton, Buginese, Bulgarian, Catalan, Cebuano, Czech, Chuvash, Corsican, Welsh, Danish, German, Dimli (individual language), Modern Greek (1453-), English, Esperanto, Estonian, Basque, Faroese, Persian, Finnish, French, Western Frisian, Gan Chinese, Scottish Gaelic, Irish, Galician, Gilaki, Gujarati, Haitian, Serbo-Croatian, Hebrew, Fiji Hindi, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Ido, Interlingua (International Auxiliary Language Association), Indonesian, Icelandic, Italian, Javanese, Japanese, Kannada, Georgian, Kazakh, Korean, Kurdish, Latin, Latvian, Limburgan, Lithuanian, Lombard, Luxembourgish, Malayalam, Marathi, Macedonian, Malagasy, Mongolian, Maori, Malay (macrolanguage), Burmese, Neapolitan, Low German, Nepali (macrolanguage), Newari, Dutch, Norwegian Nynorsk, Norwegian, Occitan (post 1500), Ossetian, Pampanga, Piemontese, Polish, Portuguese, Quechua, Romanian, Russian, Yakut, Sicilian, Scots, Slovak, Slovenian, Spanish, Albanian, Serbian, Sundanese, Swahili (macrolanguage), Swedish, Tamil, Tatar, Telugu, Tajik, Tagalog, Thai, Turkish, Ukrainian, Urdu, Uzbek, Venetian, Vietnamese, Volapük, Waray (Philippines), Walloon, Yiddish, Yoruba, and Chinese
- Description:
- A set of corpora for 120 languages automatically collected from wikipedia and the web. Collected using the W2C toolset: http://hdl.handle.net/11858/00-097C-0000-0022-60D6-1
- Rights:
- Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0), http://creativecommons.org/licenses/by-sa/3.0/, and PUB
1547. W2C – Web to Corpus – tool
- Creator:
- Majliš, Martin
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- toolService and suiteOfTools
- Subject:
- web data, wikipedia, and corpus creation
- Description:
- A tool used to build multilingual corpora from wikipedia. Download the web pages, convert them to plain text, identify language, etc. A set of 120 corpora collected using this tool is available at https://ufal-point.mff.cuni.cz/xmlui/handle/11858/00-097C-0000-0022-6133-9
- Rights:
- Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0), http://creativecommons.org/licenses/by-sa/3.0/, and PUB
1548. Water Polo Championship
- Creator:
- Aktualita
- Publisher:
- Národní filmový archiv
- Type:
- video and clip
- Subject:
- závody vodní pólo, pólo vodní, bazén plavecký, mužstvo Slavia, mužstvo ČPK Plzeň, diváci na vodním pólu, akce Kuratorium pro výchovu mládeže, Kuratorium pro výchovu mládeže akce, Kuratorium, Places::Praha::Barrandov::bazén, and Český zvukový týdeník Aktualita::1943/26A
- Language:
- Czech
- Description:
- Segment from Český zvukový týdeník Aktualita (Czech Aktualita Sound Newsreel) 1943 issue no. 26A from 1943 captures the Slavia vs. Pilsen water polo match that was a part of the Provincial Youth Swimming Championship organised by the Board of Trustees for the Education of Youth in cooperation with the Czech Amateur Swimming Union and held at the swimming pool in Prague-Barrandov on 3 and 4 July.
- Rights:
- http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
1549. Week of Czech Youth
- Creator:
- (:unav) Unknown author
- Publisher:
- Národní filmový archiv
- Type:
- video and clip
- Subject:
- Týden mládeže akce, akce Týden mládeže 1944, kroje lidové, slavnost mládeže, kapela mládežnická na slavnosti, Kuratorium pro výchovu mládeže v Čechách a na Moravě a, akce Kuratorium pro výchovu mládeže, znak Kuratorium pro výchovu mládeže, dirigent kapely mládežnické, průvod mládeže, mládež v krojích, vlajky české protektorátní, vlajky s hákovým křížem, dívky pochodující, stejnokroje Kuratorium pro výchovu mládeže, stadion, cvičení hromadná, diváci na stadionu, důstojníci němečtí v publiku na stadionu, projev veřejný, cvičenci nastoupení na stadionu, cvičení prostná ženy, závody běžecké, běh štafetový, cvičení na nářadí, cvičení Pestrá louka, tance na stadionu, cvičenci hajlující, hajlování, Kuratorium, Places::Praha::Nové Město::Václavské náměstí, Places::Praha::Nové Město::Na Příkopě, Places::Praha::Strahov::stadion, Places::Praha::Staré Město::Staroměstské náměstí, Places::Praha::Staré Město::Staroměstská radnice, People::Moravec Emanuel (1893-1945), People::Pfitzner Josef (1901-1945), People::Krejčí Jaroslav (1892-1956), and People::Teuner František (1911-1978)
- Language:
- Czech
- Description:
- Segment from Český zvukový týdeník Aktualita (Czech Aktualita Sound Newsreel) issue no. 29A from 1944 was shot during the Week of Czech Youth event organised by the Board of Trustees for the Education of Youth and held from 1 to 9 July. The programme included a concert held on Old Town Square on 8 July. The orchestra and choir consisted of several hundred young musicians and singers. Minister of Education and People´s Enlightenment and Chairman of the Board Emanuel Moravec and Deputy Mayor Joseph Pfitzner watched the event from the balcony of the Old Town Hall. The Board of Trustees´ youth set out from the square in a parade through the streets of Prague. The following day, a sports afternoon took place at Strahov Stadium. Guests of honour included Prime Minister Jaroslav Krejčí and the General Secretary of the Board František Teuner. Emanuel Moravec spoke to the participants. The programme included women´s floor exercises, track and field races and women in stylised costumes dancing to folk songs. The event was concluded with the athletes and audience paying homage to Adolf Hitler.
- Rights:
- http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
1550. Wenzl Jaksch's Speech
- Creator:
- Aktualita
- Publisher:
- Národní filmový archiv
- Type:
- video and clip
- Subject:
- projev Jaksch Wenzl, portrét Masaryk Tomáš Garrigue, Mnichovská dohoda, People::Jaksch Wenzel (1896-1966), and Československý zvukový týdeník Aktualita::1938/8A
- Language:
- Czech
- Description:
- The segment of Československý zvukový týdeník Aktualita (Czechoslovak Aktualita Sound Newsreel), 1938, issue no. 8A shows a speech delivered in Czech by Wenzel Jaksch, the MP for the German Social Democratic Workers' Party (DSAP), about the possible coexistence of, and understanding between, Czechs and Germans.
- Rights:
- http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)