Language: Czech / Rights: PUB - LINDAT/CLARIAH-CZ Catalog Search Results

31. Annotated Corpus of Czech Case Law for Segmentation Tasks

Creator:: Harašta, Jakub, Šavelka, Jaromír, Kasl, František, and Míšek, Jakub
Publisher:: Masaryk University, Brno
Type:: text and corpus
Subject:: document segmentation and legal texts
Language:: Czech
Description:: Annotated corpus of 350 decision of Czech top-tier courts (Supreme Court, Supreme Administrative Court, Constitutional Court). 280 decisions were annotated by one trained annotator and then manually adjudicated by one trained curator. 70 decisions were annotated by two trained annotators and then manually adjudicated by one trained curator. Adjudication was conducted destructively, therefore dataset contains only the correct annotations and does not contain all original annotations. Corpus was developed as training and testing material for text segmentation tasks. Dataset contains decision segmented into Header, Procedural History, Submission/Rejoinder, Court Argumentation, Footer, Footnotes, and Dissenting Opinion. Segmentation allows to treat different parts of text differently even if it contains similar linguistic or other features.
Rights:: Creative Commons - Attribution 4.0 International (CC BY 4.0), http://creativecommons.org/licenses/by/4.0/, and PUB

32. Annotation of Dramatic Situations in Theater Play Scripts

Creator:: Mareček, David, Nováková, Marie, Vosecká, Klára, Doležal, Josef, and Rosa, Rudolf
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) and The Academy of Performing Arts in Prague, Theatre Faculty (DAMU)
Type:: text and corpus
Subject:: theatre, play script, and dramatic situation
Language:: Czech
Description:: We defined 58 dramatic situations and annotated them in 19 play scripts. Then we selected only 5 well-recognized dramatic situations and annotated further 33 play scripts. In this version of the data, we release only play scripts that can be freely distributed, which is 9 play scripts. One play is annotated independently by three annotators.
Rights:: Creative Commons - Attribution 4.0 International (CC BY 4.0), http://creativecommons.org/licenses/by/4.0/, and PUB

33. Anti-tuberculosis Stations

Creator:: Aktualita
Publisher:: Národní filmový archiv
Type:: video and clip
Subject:: nemoc tuberkuloza, váha lékařská, pacient měření, lékař, rentgen, snímek rentgenový plíce, plíce snímek rentgenový, akce Týden národního zdraví, Týden národního zdraví akce, vůz rentgenový, žáci u rentgenového vozu, solarium, děti ozařované, sestry zdravotní, slunce horské, brýle ochranné, Liga proti tuberkulose akce, akce Liga proti tuberkulose, Protektorát zdravotnictví, Places::Moravská Ostrava viz Ostrava, Places::Ostrava::Lidový sociálně zdravotní ústav /ext.,int./, Český zvukový týdeník Aktualita::1942/18AB, Heydrichiáda, and Zdravotní a sociální péče
Language:: Czech
Description:: The segment from the 1942 Český zvukový týdeník Aktualita (Czech Aktualita Sound Newsreel) Issue No. 18 features the event Týden národního zdraví (A Week for National Health) organised by The Ministry of the Interior and The Health Institute of the Protectorate of Bohemia and Moravia from 3 to 10 May 1942. The official goal of the event was to advocate for the importance of healthcare. The report covers the establishment of anti-tuberculosis stations in a number of places around the Protectorate. Footage of the measuring of body height and weight of patients. A showcase of how an X-ray station in Moravská Ostrava operates. Footage of doctors working with X-ray machines. A close-up of an X-ray image of the lungs. The segment includes footage of mobile X-ray cars set up for the treatment of child patients. Footage from a solarium intended for irradiating children with sunlamps.
Rights:: Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0), http://creativecommons.org/licenses/by-nc-nd/4.0/, and PUB

34. Antonín Martin Brousil (vice-chancellor of Prague's Academy)

Creator:: Krátký film and Veselý, Bohumil
Publisher:: Národní filmový archiv
Type:: video and clip
Subject:: řetěz rektorský, cena MFF Karlovy Vary, festival filmový MFF karlovy Vary, Galerie osobností, People::Revueltas Rosaura (1910-1996), People::Brousil Antonín Martin (1907-1986), and People::Plicka Karel (1894-1986)
Language:: Czech
Description:: Antonín Martin Brousil, the vice-chancellor of Prague's Academy of Performing Arts, and Mexican actress Rosaura Revueltas at the 1954 Karlovy Vary International Film Festival in a fragmented segment from the weekly film newsreel.
Rights:: http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)

35. Antonín Pelc (painter)

Creator:: Krátký film and Veselý, Bohumil
Publisher:: Národní filmový archiv
Type:: video and clip
Subject:: ateliér malířský, narozeniny Pelc Antonín 60., Galerie osobností, People::Pelc Antonín (1895-1967), People::Záhořová Jarmila (1924-1958), and Československé filmové noviny 1952/43
Language:: Czech
Description:: Painter Antonín Pelc with his wife Jarmila Záhořová in the studio in a segment from Československé filmové noviny (Czechoslovak Film News) 1952, issue no. 43. The painter in his studio on the day of his 60th birthday in a segment from Československý filmový týdeník (Czechoslovak Film Weekly Newsreel) 1955, issue no. 4.
Rights:: http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)

36. Antonín Přecechtěl (otolaryngologist)

Creator:: Veselý, Bohumil
Publisher:: Národní filmový archiv
Type:: video and clip
Subject:: otorinolaryngologie, lékař při práci, Galerie osobností, Places::Praha::Klinika nemocí ušních::ústních a hrtanových, and People::Přecechtěl Antonín (1885-1971)
Language:: Czech
Description:: Professor and otolaryngologist Antonín Přecechtěl working at the clinic.
Rights:: http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)

37. Artificial Treebank with Ellipsis

Creator:: Droganova, Kira, Zeman, Daniel, Kanerva, Jenna, and Ginter, Filip
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: universal dependencies, ellipsis, and gapping
Language:: English, Czech, Finnish, Russian, and Slovak
Description:: Artificially created treebank of elliptical constructions (gapping), in the annotation style of Universal Dependencies. Data taken from UD 2.1 release, and from large web corpora parsed by two parsers. Input data are filtered, sentences are identified where gapping could be applied, then those sentences are transformed, one or more words are omitted, resulting in a sentence with gapping. Details in Droganova et al.: Parse Me if You Can: Artificial Treebanks for Parsing Experiments on Elliptical Constructions, LREC 2018, Miyazaki, Japan.
Rights:: Licence Universal Dependencies v2.1, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.1, and PUB

38. Aspect-Term Annotated Customer Reviews in Czech

Creator:: Fiala, Ondřej
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: sentiment analysis, opinion target, and customer review
Language:: Czech
Description:: This dataset contains a number of user product reviews which are publicly available on the website of an established Czech online shop with electronic devices. Each review consists of negative and positive aspects of the product. This setting pushes the customer to rate important characteristics. We have selected 2000 positive and negative segments from these reviews and manually tagged their targets. Additionally, we selected 200 of the longest reviews and annotated them in the same way. The targets were either aspects of the evaluated product or some general attributes (e.g. price, ease of use).
Rights:: Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0), http://creativecommons.org/licenses/by-nc-sa/3.0/, and PUB

39. AudioPSP 24.01: Audio recordings of proceedings of the Chamber of Deputies of the Parliament of the Czech Republic

Creator:: Kopp, Matyáš
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: audio and corpus
Subject:: Parliament of the Czech Republic
Language:: Czech
Description:: This record contains audio recordings of proceedings of the Chamber of Deputies of the Parliament of the Czech Republic. The recordings have been provided by the official websites of the Chamber of Deputies, and the set contains them in their original format with no further processing. Recordings cover all available audio files from 2013-11-25 to 2023-07-26. Audio files are packed by year (2013-2023) and quarter (Q1-Q4) in tar archives audioPSP-YYYY-QN.tar. Furthermore, there are two TSV files: audioPSP-meta.quarterArchive.tsv contains metadata about archives, and audioPSP-meta.audioFile.tsv contains metadata about individual audio files.
Rights:: Public Domain Dedication (CC Zero), http://creativecommons.org/publicdomain/zero/1.0/, and PUB

40. Automatic Paraphrases of Czech Reference Sentences for WMT11, 13 and 14

Creator:: Barančíková, Petra and Tamchyna, Aleš
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: machine translation, automatic evaluation, and paraphrasing
Language:: Czech
Description:: This dataset contains automatic paraphrases of Czech official reference translations for the Workshop on Statistical Machine Translation shared task. The data covers the years 2011, 2013 and 2014. For each sentence, at most 10000 paraphrases were included (randomly selected from the full set). The goal of using this dataset is to improve automatic evaluation of machine translation outputs. If you use this work, please cite the following paper: Tamchyna Aleš, Barančíková Petra: Automatic and Manual Paraphrases for MT Evaluation. In proceedings of LREC, 2016.
Rights:: Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB

31. Annotated Corpus of Czech Case Law for Segmentation Tasks

32. Annotation of Dramatic Situations in Theater Play Scripts

33. Anti-tuberculosis Stations

34. Antonín Martin Brousil (vice-chancellor of Prague's Academy)

35. Antonín Pelc (painter)

36. Antonín Přecechtěl (otolaryngologist)

37. Artificial Treebank with Ellipsis

38. Aspect-Term Annotated Customer Reviews in Czech

39. AudioPSP 24.01: Audio recordings of proceedings of the Chamber of Deputies of the Parliament of the Czech Republic

40. Automatic Paraphrases of Czech Reference Sentences for WMT11, 13 and 14

Limit your search

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Search

Search Constraints

Search Results

Limit your search

Contributor

Show values starting with

Creator

Show values starting with

Language

Show values starting with

Publisher

Show values starting with

Rights

Show values starting with

Subject

Show values starting with

Type

Show values starting with

Date

Original context has metadata only

Harvested from