Universal Segmentations (UniSegments) is a collection of lexical resources capturing morphological segmentations harmonised into a cross-linguistically consistent annotation scheme for many languages. The annotation scheme consists of simple tab-separated columns that stores a word and its morphological segmentations, including pieces of information about the word and the segmented units, e.g., part-of-speech categories, type of morphs/morphemes etc. The current public version of the collection contains 38 harmonised segmentation datasets covering 30 different languages.
The segment of Československý zvukový týdeník Aktualita (Czechoslovak Aktualita Sound Newsreel), 1938, issue no. 22 captures the festivities held in Líšná u Zbiroha on 15 May 1938 in honour of the old Opatrný and Mudra farming families. The report shows the unveiling of commemorative plaques for both families, and audio records of the speeches given by the eldest member of the Opatrný family, Mr. Mudra, and MP Rudolf Beran, the Chairman of the Agrarian Party.
Director Václav Binovec and actress Blanka Waleská in the Theatre and Film Department of the Municipal Library in Prague in a segment from Československý zvukový týdeník Aktualita (Czechoslovak Aktualita Sound Newsreel) 1942, issue no. 49. Footage of Binovec and actor Vladimir Chinkulov Vladimírov during the shooting of the film Plameny života (The Flames of Life, 1920) at the Weteb Film Studio. Binovec meeting Soviet actor Ivan Mozzhukhin in Prague. Binovec shows off his muscles on Bohumil Veselý's balcony.
Film director Václav Krška accepts a commendation as a model worker in a news segment from Československý filmový týdeník (Czechoslovak Film Weekly Newsreel) 1953, issue no. 5. At a table with actress Zdeňka Gráfová. With actor Eduard Cupák during the shooting of Mladá léta (Youthful Years, dir. Václav Krška, 1952).
Conductor Václav Talich with the Czech Philharmonic recording Petr Ilyich Tchaikovsky´s Pathétique Symphony on a vinyl record in a fragmented segment from Československý filmový týdeník (Czechoslovak Film Weekly Newsreel) 1953, issue no. 36. Talich in a newsreel segment from the documentary Velcí hudebníci a zpěváci (Great Musicians and Singers, 1931), now considered lost. Talich with Stanislav Mojžíž-Lom in his office in the National Theatre. Talich during the opening of the Municipal Theatre in Písek on 1 February 1940 in a segment from Československý zvukový týdeník Aktualita (Czechoslovak Aktualita Sound Newsreel) 1940, issue no. 7B. Excerpt from the speech given by Talich during the presentation of National Awards at the Municipal House in Prague on 12 July 1942 in a fragmented segment from Československý zvukový týdeník Aktualita (Czechoslovak Aktualita Sound Newsreel) 1942, issue no. 29.
Actor Václav Vydra Jr. with his wife, actress Dana Medřická, during the election of National Front candidates in a segment from Československý filmový týdeník (Czechoslovak Film Weekly Newsreel) 1954, issue no. 21. The film obituary of actor Václav Vydra Sr. in a fragmented segment from Československý filmový týdeník (Czechoslovak Film Weekly Newsreel) 1953, issue no. 17. Actor Václav Vydra Sr. in Zlatý klíček (The Gold Key, dir. Jaroslav Kvapil, 1922).
The Valency Lexicon of Czech Verbs, Version 2.5 (VALLEX 2.5), is a collection of linguistically annotated data and documentation, resulting from an attempt at formal description of valency frames of Czech verbs. VALLEX 2.5 has been developed at the Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University, Prague.
VALLEX 2.5 provides information on the valency structure (combinatorial potential) of verbs in their particular senses - there are roughly 2,730 lexeme entries containing together around 6,460 lexical units ("senses"). and LC 536 - Center for Computational Linguistics, 1ET100300517 and 1ET101120503.
VALLEX 3.0 provides information on the valency structure (combinatorial potential) of verbs in their particular senses, which are characterized by glosses and examples. VALLEX 3.0 describes almost 4 600 Czech verbs in more than 10 800 lexical units, i.e., given verbs in the given senses.
VALLEX 3.0 is a is a collection of linguistically annotated data and documentation, resulting from an attempt at formal description of valency frames of Czech verbs. In order to satisfy different needs of different potential users, the lexicon is distributed (i) in a HTML version (the data allows for an easy and fast navigation through the lexicon) and (ii) in a machine-tractable form as a single XML file, so that the VALLEX data can be used in NLP applications.
VALLEX 4.0 provides information on the valency structure (combinatorial potential) of verbs in their particular senses; each sense is by a gloss and examples. VALLEX 4.0 describes almost 4 700 Czech verbs in more than 11 000 lexical units, i.e., given verbs in the given senses. VALLEX 4.0 is a is a collection of linguistically annotated data and documentation, resulting from an attempt at formal description of valency frames of Czech verbs. In order to satisfy different needs of different potential users, the lexicon is distributed (i) in a HTML version (the data allows for an easy and fast navigation through the lexicon) and (ii) in a machine-tractable form, so that the VALLEX data can be used in NLP applications. VALLEX 4.0 provides (in addition to information from previous versions) also characteristics of verbs expressing reciprocity and reflexivity.
The data is provided in two formats: XML and JSON.