Original context has metadata only: false / Rights: PUB - LINDAT/CLARIAH-CZ Catalog Search Results

571. Gabriela Preissová (writer)

Creator:: Veselý, Bohumil
Publisher:: Národní filmový archiv
Type:: video and clip
Subject:: park městský, límec kožešinový, narozeniny Preissová Gabriela 70., Galerie osobností, People::Preissová Gabriela (1862-1946), and People::Preissová Adriena (1915-2009)
Language:: No linguistic content
Description:: A segment to mark the 70th birthday of writer Gabriela Preissová. Preissová having a walk with her granddaughter Adriana.
Rights:: http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)

572. GECCC Grammar Error Correction Corpus for Czech

Creator:: Náplava, Jakub, Straka, Milan, Straková, Jana, and Rosen, Alexandr
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: gec, grammatical error correction, and dataset
Language:: Czech
Description:: Grammar Error Correction Corpus for Czech (GECCC) consists of 83 058 sentences and covers four diverse domains, including essays written by native students, informal website texts, essays written by Romani ethnic minority children and teenagers and essays written by nonnative speakers. All domains are professionally annotated for GEC errors in a unified manner, and errors were automatically categorized with a Czech-specific version of ERRANT released at https://github.com/ufal/errant_czech The dataset was introduced in the paper Czech Grammar Error Correction with a Large and Diverse Corpus that was accepted to TACL. Until published in TACL, see the arXiv version: https://arxiv.org/pdf/2201.05590.pdf
Rights:: Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), PUB, and http://creativecommons.org/licenses/by-sa/4.0/

573. GECCC Grammar Error Correction Corpus for Czech (2022-09-28)

Creator:: Náplava, Jakub, Straka, Milan, Straková, Jana, and Rosen, Alexandr
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: gec, grammatical error correction, and dataset
Language:: Czech
Description:: Grammar Error Correction Corpus for Czech (GECCC) consists of 83 058 sentences and covers four diverse domains, including essays written by native students, informal website texts, essays written by Romani ethnic minority children and teenagers and essays written by nonnative speakers. All domains are professionally annotated for GEC errors in a unified manner, and errors were automatically categorized with a Czech-specific version of ERRANT released at https://github.com/ufal/errant_czech The dataset was introduced in the paper Czech Grammar Error Correction with a Large and Diverse Corpus that was accepted to TACL. Until published in TACL, see the arXiv version: https://arxiv.org/pdf/2201.05590.pdf This version fixes double annotation errors in train and dev M2 files, and also contains more metadata information.
Rights:: Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), PUB, and http://creativecommons.org/licenses/by-sa/4.0/

574. Gender-fair language on the websites of German, Austrian, Swiss and South Tyrolean cities

Creator:: Müller-Spitzer, Carolin and Ochs, Samira
Publisher:: IDS Mannheim
Type:: text and corpus
Subject:: gender-fair language, websites, personal designations, gender-inclusive language, and gender linguistics
Language:: German
Description:: Annotated dataset consisting of personal designations found on websites of 42 German, Austrian, Swiss and South Tyrolean cities. Our goal is to re-evaluate the websites every year in order to see how the use of gender-fair language develops over time. The dataset contains coordinates for the creation of map material.
Rights:: Creative Commons - Attribution 4.0 International (CC BY 4.0), http://creativecommons.org/licenses/by/4.0/, and PUB

575. General Syrový Addresses Citizens

Creator:: Aktualita
Publisher:: Národní filmový archiv
Type:: video and clip
Subject:: projev Syrový Jan, Mnichovská dohoda, and People::Syrový Jan (1888-1970)
Language:: Czech
Description:: The segment of Československý zvukový týdeník Aktualita (Czechoslovak Aktualita Sound Newsreel) from late September 1938 captures the recording of a radio speech given by General Jan Syrový to accept his appointment to the office of Prime Minister on 22 September 1938, in which he responds to the national demonstration for the unity of Czechoslovakia held in front of the Parliament building in Prague. He urges the demonstrators, as well as all citizens, to remain calm and sensible and to return to work.
Rights:: http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)

576. Generator of Czech lyrics according to structure

Creator:: Štěpánková, Barbora
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: tool and toolService
Subject:: Song lyrics generation
Language:: Czech
Description:: Fine-tuned Czech TinyLlama model (https://huggingface.co/BUT-FIT/CSTinyLlama-1.2B) and Czech GPT2 small model (https://huggingface.co/lchaloupsky/czech-gpt2-oscar) to generate lyrics of song sections based on the provided syllable counts, keywords and rhyme scheme. The TinyLlama-based model yields better results, however, the GPT2-based model can run locally. Both models are discussed in a Bachelor Thesis: Generation of Czech Lyrics to Cover Songs.
Rights:: The MIT License (MIT), http://opensource.org/licenses/mit-license.php, and PUB

577. Géza Včelička, true name Antonín Eduard Včelička (writer)

Creator:: Veselý, Bohumil
Publisher:: Národní filmový archiv
Type:: video and clip
Subject:: Galerie osobností, Places::Praha::Nové Město::Školská::pavlač domu, People::Včelička Géza (1901-1966), and People::Včeličková-Kučerová Daniela (1946-)
Language:: No linguistic content
Description:: Writer Géza Včelička, first on his own and later with his wife and his daughter Daniela on Bohumil Veselý's balcony.
Rights:: http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)

578. Giuseppe Dalla Torre on Czechoslovakia

Creator:: Aktualita
Publisher:: Národní filmový archiv
Type:: video and clip
Subject:: projev Dalla Tórre Giuseppe, vztahy mezinárodní Itálie-ČSR, vztahy mezinárodní ČSR-Itálie, Mnichovská dohoda, People::Dalla Tórre di Sanguinetto Giuseppe (1885-1967), and Československý zvukový týdeník Aktualita::1938/28
Language:: Czech
Description:: The segment of Československý zvukový týdeník Aktualita (Czechoslovak Aktualita Sound Newsreel), 1938, issue no. 28 reports on the visit of Giuseppe Dalla Torre, the editor-in-chief of the Vatican City State´s daily newspaper of L´Osservatorio Romano, to Czechoslovakia.
Rights:: http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)

579. Gold Standard Reference Data for Multiword Expression Extraction: Czech Dependency Bigrams from the Prague Dependency Treebank

Creator:: Pecina, Pavel
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text, lexicalConceptualResource, and computationalLexicon
Subject:: multiword expressions
Language:: Czech
Description:: Annotated list of dependency bigrams occurring in the PDT more than five times and having part-of-speech patterns that can possibly form a collocation. Each bigram is assigned to one of the six MWE categories by three annotators.
Rights:: Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0), http://creativecommons.org/licenses/by-nc/3.0/, and PUB

580. GrandStaff-LMX: Linearized MusicXML Encoding of the GrandStaff Dataset

Creator:: Mayer, Jiří, Straka, Milan, Hajič jr., Jan, and Pecina, Pavel
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: image and corpus
Subject:: GrandStaff, pianoform scores, MusicXML, and Linearized MusicXML
Language:: No linguistic content
Description:: The GrandStaff-LMX dataset is based on the GrandStaff dataset described in the "End-to-end optical music recognition for pianoform sheet music" paper by Antonio Ríos-Vila et al., 2023, https://doi.org/10.1007/s10032-023-00432-z . The GrandStaff-LMX dataset contains MusicXML and Linearized MusicXML encodings of all systems from the original datase, suitable for evaluation with the TEDn metric. It also contains the GrandStaff official train/dev/split.
Rights:: Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB

571. Gabriela Preissová (writer)

572. GECCC Grammar Error Correction Corpus for Czech

573. GECCC Grammar Error Correction Corpus for Czech (2022-09-28)

574. Gender-fair language on the websites of German, Austrian, Swiss and South Tyrolean cities

575. General Syrový Addresses Citizens

576. Generator of Czech lyrics according to structure

577. Géza Včelička, true name Antonín Eduard Včelička (writer)

578. Giuseppe Dalla Torre on Czechoslovakia

579. Gold Standard Reference Data for Multiword Expression Extraction: Czech Dependency Bigrams from the Prague Dependency Treebank

580. GrandStaff-LMX: Linearized MusicXML Encoding of the GrandStaff Dataset

Limit your search

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Search

Search Constraints

Search Results

Limit your search

Contributor

Show values starting with

Coverage

Creator

Show values starting with

Language

Show values starting with

Publisher

Show values starting with

Rights

Show values starting with

Subject

Show values starting with

Type

Show values starting with

Date

Original context has metadata only

Harvested from