Harvested from: LINDAT/CLARIAH-CZ repository - LINDAT/CLARIAH-CZ Catalog Search Results

821. Generator of Czech lyrics according to structure

Creator:: Štěpánková, Barbora
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: tool and toolService
Subject:: Song lyrics generation
Language:: Czech
Description:: Fine-tuned Czech TinyLlama model (https://huggingface.co/BUT-FIT/CSTinyLlama-1.2B) and Czech GPT2 small model (https://huggingface.co/lchaloupsky/czech-gpt2-oscar) to generate lyrics of song sections based on the provided syllable counts, keywords and rhyme scheme. The TinyLlama-based model yields better results, however, the GPT2-based model can run locally. Both models are discussed in a Bachelor Thesis: Generation of Czech Lyrics to Cover Songs.
Rights:: The MIT License (MIT), http://opensource.org/licenses/mit-license.php, and PUB

822. GerManC : A representative historical corpus of German 1650-1800

Type:: corpus
Language:: German
Description:: The ultimate aim of the project is to compile a representative historical corpus of written German for the years 1650-1800. The complete GerManC corpus will contain 2000 word samples from nine genres
Rights:: Not specified

823. Gesprächanalytisches Informationssystem (GAIS)

Publisher:: Institut für Deutsche Sprache
Type:: toolService
Language:: German
Description:: web-based information system on scientific community (news, events, persons, job market, mailing list, database on research projects and corpora, bibliography, glossary and links) and recording equipment/software; disciplinary scope: research on conversation and discourse analysis and spoken language
Rights:: Not specified

827. Glossa corpus search system

Creator:: Nøklestad, Anders
Publisher:: Department of Linguistics and Nordic Studies, University of Oslo
Type:: toolService
Description:: Glossa is a web-based system for corpus search and results management. It comes with built-in support for CLARIN federated content search as well as corpora encoded with the IMS Corpus Workbench. It also has a plugin architecture that enables other search engines to be used once a wrapper has been created.Glossa can be freely downloaded and installed on the user's server. It currently supports only monolignual written corpora, but support for multilingual corpora is under development, as well as support for spoken corpora with audio, video and maps.
Rights:: Not specified

828. Gold Standard Reference Data for Multiword Expression Extraction: Czech Dependency Bigrams from the Prague Dependency Treebank

Creator:: Pecina, Pavel
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text, lexicalConceptualResource, and computationalLexicon
Subject:: multiword expressions
Language:: Czech
Description:: Annotated list of dependency bigrams occurring in the PDT more than five times and having part-of-speech patterns that can possibly form a collocation. Each bigram is assigned to one of the six MWE categories by three annotators.
Rights:: Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0), http://creativecommons.org/licenses/by-nc/3.0/, and PUB

829. Grammatisches Informationssystem (grammis)

Creator:: Strecker, Bruno, Schneider, Roman, and Konopka, Marek
Publisher:: Institut für Deutsche Sprache
Type:: lexicalConceptualResource
Language:: German
Description:: Web Information System on German grammar – contains e.g. a linked terminological knowledge-base, XML format
Rights:: Not specified

830. GrandStaff-LMX: Linearized MusicXML Encoding of the GrandStaff Dataset

Creator:: Mayer, Jiří, Straka, Milan, Hajič jr., Jan, and Pecina, Pavel
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: image and corpus
Subject:: GrandStaff, pianoform scores, MusicXML, and Linearized MusicXML
Language:: No linguistic content
Description:: The GrandStaff-LMX dataset is based on the GrandStaff dataset described in the "End-to-end optical music recognition for pianoform sheet music" paper by Antonio Ríos-Vila et al., 2023, https://doi.org/10.1007/s10032-023-00432-z . The GrandStaff-LMX dataset contains MusicXML and Linearized MusicXML encodings of all systems from the original datase, suitable for evaluation with the TEDn metric. It also contains the GrandStaff official train/dev/split.
Rights:: Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB

821. Generator of Czech lyrics according to structure

822. GerManC : A representative historical corpus of German 1650-1800

823. Gesprächanalytisches Informationssystem (GAIS)

824. Gestor de diccionaris

825. Géza Včelička, true name Antonín Eduard Včelička (writer)

826. Giuseppe Dalla Torre on Czechoslovakia

827. Glossa corpus search system

828. Gold Standard Reference Data for Multiword Expression Extraction: Czech Dependency Bigrams from the Prague Dependency Treebank

829. Grammatisches Informationssystem (grammis)

830. GrandStaff-LMX: Linearized MusicXML Encoding of the GrandStaff Dataset

Limit your search

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Search

Search Constraints

Search Results

Limit your search

Contributor

Show values starting with

Coverage

Show values starting with

Creator

Show values starting with

Format

Language

Show values starting with

Publisher

Show values starting with

Rights

Show values starting with

Subject

Show values starting with

Type

Show values starting with

Date

Original context has metadata only

Harvested from