Language: English - LINDAT/CLARIAH-CZ Catalog Search Results

Start Over Language English

45351. Woodrow Wilson and a revolutionary world, 1913-1921 /

Type:: text and sborníky
Subject:: Dějiny Spojených států amerických, Wilson, Woodrow,, prezidenti USA, politika zahraniční, USA, panovníci, panovnické rody, dvory, světové dějiny 1789-1918, and zahraniční politika, mezinárodní vztahy
Language:: English
Description:: "Papers ... originally presented at an international symposium at Princeton University from October 10 through October 12, 1979"--Introd
Rights:: unknown

45352. Woodrow Wilson and the American diplomatic tradition :

Creator:: Ambrosius, Lloyd E.,
Type:: text, monografie, and biografie
Subject:: Dějiny Spojených států amerických, Wilson, Woodrow,, politici USA, prezidenti USA, diplomacie, válka první světová (1914-1918), USA, politické dějiny, politici, světové dějiny 1789-1918, and světové dějiny 1918-1945
Language:: English
Rights:: unknown

45353. Woodrow Wilson and the American Diplomatic Tradition. The Treaty Fight in Perspective /

Creator:: Ambrosius, Loyd E.
Publisher:: Cambridge University Prass,
Subject:: Wilson, Woodrow,, politici USA, prezidenti USA, biografie, diplomacie, válka první světová (1914-1918), politické dějiny, politici, světové dějiny 1789-1918, světové dějiny 1918-1945, and USA
Language:: English
Rights:: unknown

45354. Word from the editors

Creator:: Bečka, Jan and Smetana, Vít
Format:: bez média and svazek
Type:: model:article and TEXT
Language:: English
Rights:: http://creativecommons.org/licenses/by-nc-sa/4.0/ and policy:public

45355. Word Importance Dataset

Creator:: Osuský, Adam and Javorský, Dávid
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: word importance, ranking, and importance ranking
Language:: English
Description:: This dataset comprises a corpus of 50 text contexts, each about 60 words in length, sourced from five distinct domains. Each context has been evaluated by multiple annotators who identified and ranked the most important words—up to 10% of each text—according to their perceived significance. The annotators followed specific guidelines to ensure consistency in word selection and ranking. For further details, please refer to the cited source. --- rankings_task.csv - This csv contains information about the contexts which are to be annotated: - id: A unique identifier for each task. - content: The context to be ranked. --- rankings_ranking.csv - This csv includes ranking information for various assignments. It contains four columns: - id: A unique identifier for each ranking entry. - score: The score assigned to the entry. - word_order: A JSON detailing the order of words positions. It is essentially the selected word positions and their ordering from an annotator. - assignment_id: A reference ID linking to the assignments. --- rankings_assignment.csv - This csv tracks the completion status of tasks by users. It includes four columns: - id: A unique identifier for each assignment entry. - is_completed: A binary indicator (1 for completed, 0 for not completed). - task_id: A reference ID linking to the tasks. - user_id: The identifier for the user who should complete the task (rank the words). --- Known Issues: Please note that each annotator was intended to rank each context only once. However, due to a bug in the deployment of the annotation tool, some entries may be duplicated. Users of this dataset should be cautious of this issue and verify the uniqueness of the annotations where necessary. --- This dataset is a part of work from a bachelor thesis: OSUSKÝ, Adam. Predicting Word Importance Using Pre-Trained Language Models. Bachelor thesis, supervisor Javorský, Dávid. Prague: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics, 2024.
Rights:: Creative Commons - Attribution 4.0 International (CC BY 4.0), http://creativecommons.org/licenses/by/4.0/, and PUB

45356. Word Order Variation in John Malalas' Chronicle /

Creator:: Bočková Loudová, Kateřina,
Type:: text
Subject:: Ióannés Malalas,, jazyk řecký, kroniky byzantské, světové dějiny středověku (do r. 1492), and jazyk, písmo
Language:: English
Rights:: unknown

45357. Word representations for multiple languages

Creator:: Müller, Thomas and Schütze, Hinrich
Publisher:: Center for Information and Language Processing, University of Munich
Type:: text and corpus
Subject:: morphological dictionary, morphological analysis, and PoS tagging
Language:: English, German, Latin, Hungarian, Spanish, and Czech
Description:: Dictionaries with different representations for various languages. Representations include brown clusters of different sizes and morphological dictionaries extracted using different morphological analyzers. All representations cover the most frequent 250,000 word types on the Wikipedia version of the respective language. Analzers used: MAGYARLANC (Hungarian, Zsibrita et al. (2013)), FREELING (English and Spanish, Padro and Stanilovsky (2012)), SMOR (German, Schmid et al. (2004)), an MA from Charles University (Czech, Hajic (2001)) and LATMOR (Latin, Springmann et al. (2014)).
Rights:: Creative Commons - Attribution 3.0 Unported (CC BY 3.0), http://creativecommons.org/licenses/by/3.0/, and PUB

45358. Word-final /s/ durations in spoken German

Creator:: Luef, Eva Maria
Publisher:: Charles University and Universität Hamburg
Type:: text and corpus
Subject:: sibilant, acoustics, word-final s, duration, and German
Language:: English
Description:: German has various homophonous sibilant fricatives of phonemic or morphemic nature that can appear in word-final position. In English, the functional status of a word-final \s\ influences its durational properties, with phonemic \s\ being longer than morphemic types. The data set presented here is a small selection of laboratory-elicited German sentences containing various words with final sibilant phonemes (e.g., "das Haus") and morphemes (plural, genitive, clitic, inflection). Durations of the \s\ types were measured and compared across the conditions. An ANOVA between the \s\ types and post-hoc Tukey pair-wise comparisons are presented that show various significant differences. The submission consists of a csv data file, containing a number of variables, and a PDF document detailing the experiment and variables.
Rights:: Creative Commons - Attribution-NoDerivatives 4.0 International (CC BY-ND 4.0), http://creativecommons.org/licenses/by-nd/4.0/, and PUB

45359. Words in Music - Music in Words /

Creator:: Doubravová, Jarmila,
Subject:: Erben, Karel Jaromír,, Dvořák, Antonín,, Novák, Vítězslav,, hudba česká, literatura česká, vztahy literatura-hudba, české země 1848-1914, and hudba, tanec, hudební nástroje
Language:: English
Rights:: unknown

45360. WordSim353-cs: Evaluation Dataset for Lexical Similarity and Relatedness, based on WordSim353

Creator:: Cinková, Silvie, Straková, Jana, Hajič, Jakub, Hajič, Jan, Hajič, Jan, jr., Janoušková, Jolana, Straka, Milan, and Urešová, Miroslava
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text, wordList, and lexicalConceptualResource
Subject:: lexical semantics, similarity, relatedness, evaluation, and distributional semantics
Language:: Czech and English
Description:: Czech translation of WordSim353. The Czech translation of English WordSim353 word pairs were obtained from four translators. All translation variants were scored according to the lexical similarity/relatedness annotation instructions for WordSim353 annotators, by 25 Czech annotators. The resulting data set consists of two annotation files: "WordSim353-cs.csv" and "WordSim-cs-Multi.csv". Both files are encoded in UTF-8, have a header, text is enclosed in double quotes, and columns are separated by commas. The rows are numbered. The WordSim-cs-Multi data set has rows numbered from 1 to 634, whereas the row indices in the WordSim353-cs data set reflect the corresponding row numbers in the WordSim-cs-Multi data set. The WordSim353-cs file contains a one-to-one mapping selection of 353 Czech equivalent pairs whose judgments have proven to be most similar to the judgments of their corresponding English originals (compared by the absolute value of the difference between the means over all annotators in each language counterpart). In one case ("psychology-cognition"), two Czech equivalent pairs had identical means as well as confidence intervals, so we randomly selected one. The "WordSim-cs-Multi.csv" file contains human judgments for all translation variants. In both data sets, we preserved all 25 individual scores. In the WordSim353-cs data set, we added a column with their Czech means as well as a column containing the original English means and 95% confidence intervals in separate columns for each mean (computed by the CI function in the Rmisc R package). The WordSim-cs-Multi data set contains only the Czech means and confidence intervals. For the most convenient lexical search, we provided separate columns with the respective Czech and English single words, entire word pairs, and eventually an English-Czech quadruple in both data sets. The data set also contains an xls table with the four translations and a preliminary selection of the best variants performed by an adjudicator.
Rights:: Creative Commons - Attribution 4.0 International (CC BY 4.0), http://creativecommons.org/licenses/by/4.0/, and PUB

45351. Woodrow Wilson and a revolutionary world, 1913-1921 /

45352. Woodrow Wilson and the American diplomatic tradition :

45353. Woodrow Wilson and the American Diplomatic Tradition. The Treaty Fight in Perspective /

45354. Word from the editors

45355. Word Importance Dataset

45356. Word Order Variation in John Malalas' Chronicle /

45357. Word representations for multiple languages

45358. Word-final /s/ durations in spoken German

45359. Words in Music - Music in Words /

45360. WordSim353-cs: Evaluation Dataset for Lexical Similarity and Relatedness, based on WordSim353

Limit your search

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Search

Search Constraints

Search Results

Limit your search

Contributor

Show values starting with

Coverage

Show values starting with

Creator

Show values starting with

Format

Show values starting with

Language

Show values starting with

Publisher

Show values starting with

Rights

Show values starting with

Subject

Show values starting with

Type

Show values starting with

Date

Original context has metadata only

Harvested from