Skip to search
Skip to main content
Skip to first result
Search
Search Results
Creator:
Böhm, Stanislav , Hajič, Jan , Srdečný, Vojtěch , Toman, Josef , and Košarko, Ondřej
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
infrastructure and toolService
Subject:
oral history , speech , and search
Language:
Czech
Description:
A VIADAT module; VIADAT-ANNOTATE is an interactive annotation environment.
Developed in cooperation with ÚSD AV ČR and NFA.
Rights:
BSD 3-Clause "New" or "Revised" license , http://opensource.org/licenses/BSD-3-Clause , and PUB
Creator:
Böhm, Stanislav , Hajič, Jan , Srdečný, Vojtěch , Toman, Josef , and Košarko, Ondřej
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
infrastructure and toolService
Subject:
oral history , speech , and search
Language:
Czech
Description:
A VIADAT module; VIADAT-ANNOTATE is an interactive annotation environment.
Developed in cooperation with ÚSD AV ČR and NFA.
Rights:
BSD 3-Clause "New" or "Revised" license , http://opensource.org/licenses/BSD-3-Clause , and PUB
Creator:
Böhm, Stanislav , Hajič, Jan , Srdečný, Vojtěch , Toman, Josef , and Košarko, Ondřej
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
infrastructure and toolService
Subject:
oral history , speech , and search
Language:
Czech
Description:
A VIADAT module; VIADAT-GIS connects the platform with maps.
Developed in cooperation with ÚSD AV ČR and NFA.
Rights:
BSD 3-Clause "New" or "Revised" license , http://opensource.org/licenses/BSD-3-Clause , and PUB
Creator:
Böhm, Stanislav , Hajič, Jan , Srdečný, Vojtěch , Toman, Josef , and Košarko, Ondřej
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
infrastructure and toolService
Subject:
oral history , speech , and search
Language:
Czech
Description:
A VIADAT module; VIADAT-GIS connects the platform with maps.
Developed in cooperation with ÚSD AV ČR and NFA.
Rights:
BSD 3-Clause "New" or "Revised" license , http://opensource.org/licenses/BSD-3-Clause , and PUB
Creator:
Böhm, Stanislav and Hajič, Jan
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
infrastructure and toolService
Subject:
oral history , speech , and search
Language:
Czech
Description:
VIADAT-SEARCH in connection with VIADAT-REPO enables searching transcripts of oral history recordings. Language analysis has been used to preprocess the recordings, which makes it possible to search the fulltext using multiple criteria, including names, different forms of the same word etc.
Developed in cooperation with ÚSD AV ČR and NFA.
Rights:
BSD 3-Clause "New" or "Revised" license , http://opensource.org/licenses/BSD-3-Clause , and PUB
Creator:
Böhm, Stanislav , Hajič, Jan , Srdečný, Vojtěch , Toman, Josef , and Košarko, Ondřej
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
infrastructure and toolService
Subject:
oral history , speech , and search
Language:
Czech
Description:
A VIADAT module; the purpose of VIADAT-STAT is statistical analysis of recordings stored by the platform.
Developed in cooperation with ÚSD AV ČR and NFA.
Rights:
BSD 3-Clause "New" or "Revised" license , http://opensource.org/licenses/BSD-3-Clause , and PUB
Creator:
Böhm, Stanislav , Hajič, Jan , Srdečný, Vojtěch , Toman, Josef , and Košarko, Ondřej
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
infrastructure and toolService
Subject:
oral history , speech , and search
Language:
Czech
Description:
A VIADAT module; the purpose of VIADAT-STAT is statistical analysis of recordings stored by the platform.
Developed in cooperation with ÚSD AV ČR and NFA.
Rights:
BSD 3-Clause "New" or "Revised" license , http://opensource.org/licenses/BSD-3-Clause , and PUB
Creator:
Böhm, Stanislav , Hajič, Jan , Srdečný, Vojtěch , Toman, Josef , and Košarko, Ondřej
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
infrastructure and toolService
Subject:
oral history , speech , and search
Language:
Czech
Description:
A VIADAT module; the purpose of VIADAT-TEXT is analysis of transcribed recordings.
Developed in cooperation with ÚSD AV ČR and NFA.
Rights:
BSD 3-Clause "New" or "Revised" license , http://opensource.org/licenses/BSD-3-Clause , and PUB
Creator:
Majliš, Martin
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
text and corpus
Subject:
multilingual corpora
Language:
Afrikaans , Tosk Albanian , Amharic , Arabic , Aragonese , Egyptian Arabic , Asturian , Azerbaijani , Belarusian , Bengali , Bosnian , Bishnupriya , Breton , Buginese , Bulgarian , Catalan , Cebuano , Czech , Chuvash , Corsican , Welsh , Danish , German , Dimli (individual language) , Modern Greek (1453-) , English , Esperanto , Estonian , Basque , Faroese , Persian , Finnish , French , Western Frisian , Gan Chinese , Scottish Gaelic , Irish , Galician , Gilaki , Gujarati , Haitian , Serbo-Croatian , Hebrew , Fiji Hindi , Hindi , Croatian , Upper Sorbian , Hungarian , Armenian , Ido , Interlingua (International Auxiliary Language Association) , Indonesian , Icelandic , Italian , Javanese , Japanese , Kannada , Georgian , Kazakh , Korean , Kurdish , Latin , Latvian , Limburgan , Lithuanian , Lombard , Luxembourgish , Malayalam , Marathi , Macedonian , Malagasy , Mongolian , Maori , Malay (macrolanguage) , Burmese , Neapolitan , Low German , Nepali (macrolanguage) , Newari , Dutch , Norwegian Nynorsk , Norwegian , Occitan (post 1500) , Ossetian , Pampanga , Piemontese , Polish , Portuguese , Quechua , Romanian , Russian , Yakut , Sicilian , Scots , Slovak , Slovenian , Spanish , Albanian , Serbian , Sundanese , Swahili (macrolanguage) , Swedish , Tamil , Tatar , Telugu , Tajik , Tagalog , Thai , Turkish , Ukrainian , Urdu , Uzbek , Venetian , Vietnamese , Volapük , Waray (Philippines) , Walloon , Yiddish , Yoruba , and Chinese
Description:
A set of corpora for 120 languages automatically collected from wikipedia and the web.
Collected using the W2C toolset: http://hdl.handle.net/11858/00-097C-0000-0022-60D6-1
Rights:
Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0) , http://creativecommons.org/licenses/by-sa/3.0/ , and PUB
Creator:
Hoang, Duc Tam and Bojar, Ondřej
Publisher:
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:
text and corpus
Subject:
test data , parallel corpus , and Vietnamese
Language:
Vietnamese , Czech , English , German , French , Spanish , and Russian
Description:
We provide the Vietnamese version of the multi-lingual test set from WMT 2013 [1] competition. The Vietnamese version was manually translated from English. For completeness, this record contains the 3000 sentences in all the WMT 2013 original languages (Czech, English, French, German, Russian and Spanish), extended with our Vietnamese version. Test set is used in [2] to evaluate translation between Czech, English and Vietnamese.
References
1. http://www.statmt.org/wmt13/evaluation-task.html
2. Duc Tam Hoang and Ondřej Bojar, The Prague Bulletin of Mathematical Linguistics. Volume 104, Issue 1, Pages 75--86, ISSN 1804-0462. 9/2015
Rights:
Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) , http://creativecommons.org/licenses/by-nc-sa/4.0/ , and PUB