Harvested from: LINDAT/CLARIAH-CZ repository - LINDAT/CLARIAH-CZ Catalog Search Results

1801. Slavic Forest, Norwegian Wood (scripts)

Creator:: Rosa, Rudolf, Zeman, Daniel, Mareček, David, and Žabokrtský, Zdeněk
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: suiteOfTools and toolService
Subject:: parsing, dependency parser, universal dependencies, and cross-lingual parsing
Language:: Czech, Slovak, Slovenian, Croatian, Danish, Swedish, and Norwegian
Description:: Tools and scripts used to create the cross-lingual parsing models submitted to VarDial 2017 shared task (https://bitbucket.org/hy-crossNLP/vardial2017), as described in the linked paper. The trained UDPipe models themselves are published in a separate submission (https://lindat.mff.cuni.cz/repository/xmlui/handle/11234/1-1971). For each source (SS, e.g. sl) and target (TT, e.g. hr) language, you need to add the following into this directory: - treebanks (Universal Dependencies v1.4): SS-ud-train.conllu TT-ud-predPoS-dev.conllu - parallel data (OpenSubtitles from Opus): OpenSubtitles2016.SS-TT.SS OpenSubtitles2016.SS-TT.TT !!! If they are originally called ...TT-SS... instead of ...SS-TT..., you need to symlink them (or move, or copy) !!! - target tagging model TT.tagger.udpipe All of these can be obtained from https://bitbucket.org/hy-crossNLP/vardial2017 You also need to have: - Bash - Perl 5 - Python 3 - word2vec (https://code.google.com/archive/p/word2vec/); we used rev 41 from 15th Sep 2014 - udpipe (https://github.com/ufal/udpipe); we used commit 3e65d69 from 3rd Jan 2017 - Treex (https://github.com/ufal/treex); we used commit d27ee8a from 21st Dec 2016 The most basic setup is the sl-hr one (train_sl-hr.sh): - normalization of deprels - 1:1 word-alignment of parallel data with Monolingual Greedy Aligner - simple word-by-word translation of source treebank - pre-training of target word embeddings - simplification of morpho feats (use only Case) - and finally, training and evaluating the parser Both da+sv-no (train_ds-no.sh) and cs-sk (train_cs-sk.sh) add some cross-tagging, which seems to be useful only in specific cases (see paper for details). Moreover, cs-sk also adds more morpho features, selecting those that seem to be very often shared in parallel data. The whole pipeline takes tens of hours to run, and uses several GB of RAM, so make sure to use a powerful computer.
Rights:: GNU General Public License 2 or later (GPL-2.0), http://opensource.org/licenses/GPL-2.0, and PUB

1802. Slávka Procházková (opera singer)

Creator:: Veselý, Bohumil
Publisher:: Národní filmový archiv
Type:: video and clip
Subject:: Galerie osobností, Places::Praha::Nové Město::Školská::pavlač domu, People::Procházková Slávka (1912-1978), and People::Hájek Karel (1900-1978)
Language:: No linguistic content
Description:: Opera singer Slávka Procházková with her daughter, husband (photographer Karel Hájek), and an unidentified woman on Bohumil Veselý's balcony.
Rights:: http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)

1803. Slavonic Colour Lexicon

Publisher:: University of Surrey, Surrey Morphology Group
Type:: lexicalConceptualResource
Description:: Full report on the research activities and results of the project: Predicting the past: reconstructing the Slavonic colour lexicon
Rights:: Not specified

1804. Slovak Demonstration for the Unity of Czechoslovakia

Creator:: Aktualita
Publisher:: Národní filmový archiv
Type:: video and clip
Subject:: manifestace za jednotu ČSR, tábor lidu, lidé v krojích, vlajky československé, projev veřejný, přehlídka vojenská, transparenty v průvodu, Mnichovská dohoda, People::Dérer Ivan (1884-1973), People::Hodža Milan (1878-1944), and Český zvukový týdeník Aktualita::1938/24
Language:: Czech
Description:: The segment of Československý zvukový týdeník Aktualita (Czechoslovak Aktualita Sound Newsreel), 1938, issue no. 24 captures the demonstration for the unity of Czechoslovakia held on Hviezdoslav Square in Bratislava on 6 June 1938. Prime Minister Milan Hodža speaks at the demonstration (no sound). Ivan Dérer, the Minister of Justice, is also present.
Rights:: http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)

1805. Slovak Dependency Treebank

Creator:: Gajdošová, Katarína, Šimková, Mária, and et al.
Publisher:: Jazykovedný ústav Ľ. Štúra Slovenskej akadémie vied
Type:: text and corpus
Subject:: dependency, treebank, syntax, and morphology
Language:: Slovak
Description:: Slovak Dependency Treebank (Slovenský závislostný korpus) was created as part of the Slovak National Corpus at the Ľ. Štúr Institute of the Slovak Academy of Sciences. The annotation follows the guidelines of the Prague Dependency Treebank (Czech), slightly modified in the spirit of Slovak grammatical tradition. Morphological tags, lemmas and dependency relations have been assigned manually to every word. The present dataset is a subset of the original treebank. We automatically selected the sentences where the two human annotators 100% agreed on the analysis. This increases the quality and trustworthiness of the data but it also results in selecting short sentences most of the time. An extended version may be published in the future when manually merged and checked annotation is available. The selected sentences have been converted to the CoNLL-X file format (original token IDs are preserved in the FEATS column). This PDT-style annotation will serve as the source for the first Slovak dataset in the Universal Dependencies (to be published separately).
Rights:: Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB

1806. Slovak MorphoDiTa Models 170914

Creator:: Straka, Milan
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text, mlmodel, and languageDescription
Subject:: MorphoDiTa, Slovak, morphological analysis, morphological generation, and PoS tagging
Language:: Slovak
Description:: Slovak models for MorphoDiTa, providing morphological analysis, morphological generation and part-of-speech tagging. The morphological dictionary is created from MorfFlex SK 170914 and the PoS tagger is trained on automatically translated Prague Dependency Treebank 3.0 (PDT).
Rights:: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB

1807. Slovene Dependency Treebank

Type:: corpus
Language:: Slovenian
Description:: 3,000 sentences, analytical structure (PDT)
Rights:: Not specified

1808. SMOR - German morphology

Publisher:: University of Stuttgart
Type:: toolService
Language:: German
Description:: SMOR is a wide-coverage German computational morphology with inflection, derivation, and compounding. The SMOR code excepted the stem lexicon are available under the GNU license. SMOR (without a stem lexicon) comes with the SFST tools.
Rights:: Not specified

1809. SnakeCLEF 2021

Creator:: Picek, Lukáš, Bolon, Isabelle, Durso, Andrew M., and Castañeda, Rafael Ruiz de
Publisher:: CEUR Workshop Proceedings (CEUR-WS.org)
Type:: IMAGE and corpus
Subject:: LifeCLEF, SnakeCLEF, global health, epidemiology, snake bite, snake, reptile, benchmark, biodiversity, machine learning, computer vision, and Classification
Language:: No linguistic content
Description:: The dataset with 409,679 images belonging to 772 snake species from 188 countries and all continents (386,006 images with labels targeted for development and 23,673 images without labels for testing). In addition, we provide a simple train/val (90% / 10%) split to validate preliminary results while ensuring the same species distributions. Furthermore, we prepared a compact subset (70,208 images) for fast prototyping. The test set data consists of 23,673 images submitted to the iNaturalist platform within the "first four months of 2021. All data were gathered from online biodiversity platforms (i.e., iNaturalist, HerpMapper) and further extended by data scraped from Flickr. The provided dataset has a heavy long-tailed class distribution, where the most frequent species (Thamnophis sirtalis) is represented by 22,163 images and the least frequent by just 10 (Achalinus formosanus).
Rights:: BSD 3-Clause "New" or "Revised" license, http://opensource.org/licenses/BSD-3-Clause, and PUB

1810. Social Aid of Refugees from the Borders

Creator:: Aktualita
Publisher:: Národní filmový archiv
Type:: video and clip
Subject:: uprchlíci z pohraničí, dohoda Mnichovská důsledky, akce charitativní, děti uprchlíků, Sudety 1938, akce Československý Červený kříž, tábor uprchlický, dary pro uprchlíky, sbírka pro uprchlíky, ošetřovatelka, akce České srdce, České srdce, Places::Kladno::tábor pro uprchlíky ze Sudet, Places::Praha::Smíchov::Drtinova::gymnázium, People::Beran Rudolf (1887-1954), People::Klumpar Vladislav (1893-1979), Český zvukový týdeník Aktualita::1938/31, Mnichovská dohoda, and Zdravotní a sociální péče
Language:: Czech
Description:: The segment from the 1938 Československý zvukový týdeník Aktualita (Czechoslovak Aktualita Sound Newsreel) Issue No. 31 shows a camp of Czech refugees fleeing the German-occupied borders. Charity events organised by the Czechoslovak Red Cross and the charity initiative České srdce (Czech Heart) provided food, clothing, books, and toys for the refugee children. Politicians Rudolf Beran and Vladislav Klumpar visit the camp. The following footage shows items donated through the refugee collection organised in Drtinovo gymnázium (Comprehensive school Drtinova) in Prague's Smíchov district.
Rights:: Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0), http://creativecommons.org/licenses/by-nc-nd/4.0/, and PUB

1801. Slavic Forest, Norwegian Wood (scripts)

1802. Slávka Procházková (opera singer)

1803. Slavonic Colour Lexicon

1804. Slovak Demonstration for the Unity of Czechoslovakia

1805. Slovak Dependency Treebank

1806. Slovak MorphoDiTa Models 170914

1807. Slovene Dependency Treebank

1808. SMOR - German morphology

1809. SnakeCLEF 2021

1810. Social Aid of Refugees from the Borders

Limit your search

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Search

Search Constraints

Search Results

Limit your search

Contributor

Show values starting with

Coverage

Show values starting with

Creator

Show values starting with

Format

Language

Show values starting with

Publisher

Show values starting with

Rights

Show values starting with

Subject

Show values starting with

Type

Show values starting with

Date

Original context has metadata only

Harvested from