Harvested from: LINDAT/CLARIAH-CZ repository / Language: German - LINDAT/CLARIAH-CZ Catalog Search Results

Start Over Language German Harvested from LINDAT/CLARIAH-CZ repository

51. Czech Malach Cross-lingual Speech Retrieval Test Collection

Creator:: Galuščáková, Petra, Pecina, Pavel, Hoffmannová, Petra, Hajič, Jan, Ircing, Pavel, and Švec, Jan
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: audio and corpus
Subject:: annotated corpus, corpus, speech corpus, annotation, audio, and multilingual
Language:: Czech, English, French, German, and Spanish
Description:: The package contains Czech recordings of the Visual History Archive which consists of the interviews with the Holocaust survivors. The archive consists of audio recordings, four types of automatic transcripts, manual annotations of selected topics and interviews' metadata. The archive totally contains 353 recordings and 592 hours of interviews.
Rights:: Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0), http://creativecommons.org/licenses/by-nc-nd/4.0/, and PUB

52. Damen Conversations Lexikon

Type:: lexicalConceptualResource
Subject:: Germanistik
Language:: German
Description:: Neusatz und Faksimile der zehnbändigen Ausgabe (Leipzig, 1834-1838); wortgenaue Seitenkonkordanz zu der gedruckten Ausgabe; Darstellung der Gegenstandsbereiche gesellschaftlicher Konversation (speziell auf eine weibliche Zielgruppe ausgerichtet)
Rights:: Not specified

53. DaMuEL 1.0: A Large Multilingual Dataset for Entity Linking

Creator:: Kubeša, David and Straka, Milan
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: entity linking, NEL, NER, dataset, and knowledge base
Language:: Afrikaans, Arabic, Armenian, Basque, Belarusian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Hebrew, Hindi, Hungarian, Indonesian, Irish, Italian, Japanese, Korean, Latin, Latvian, Lithuanian, Maltese, Marathi, Modern Greek (1453-), Northern Sami, Norwegian Nynorsk, Persian, Polish, Portuguese, Romanian, Russian, Scottish Gaelic, Serbian, Slovak, Slovenian, Spanish, Swedish, Tamil, Telugu, Uighur, Ukrainian, Urdu, Vietnamese, and Wolof
Description:: We present DaMuEL, a large Multilingual Dataset for Entity Linking containing data in 53 languages. DaMuEL consists of two components: a knowledge base that contains language-agnostic information about entities, including their claims from Wikidata and named entity types (PER, ORG, LOC, EVENT, BRAND, WORK_OF_ART, MANUFACTURED); and Wikipedia texts with entity mentions linked to the knowledge base, along with language-specific text from Wikidata such as labels, aliases, and descriptions, stored separately for each language. The Wikidata QID is used as a persistent, language-agnostic identifier, enabling the combination of the knowledge base with language-specific texts and information for each entity. Wikipedia documents deliberately annotate only a single mention for every entity present; we further automatically detect all mentions of named entities linked from each document. The dataset contains 27.9M named entities in the knowledge base and 12.3G tokens from Wikipedia texts. The dataset is published under the CC BY-SA licence.
Rights:: Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB

54. Das Deutsche Wörterbuch von Jacob und Wilhelm Grimm

Publisher:: Kompetenzzentrum für elektronische Erschließungs and Publikationsverfahren in den Geisteswissenschaften
Type:: lexicalConceptualResource
Language:: German
Description:: Online edition of the Grimm brothers' "Deutsche Wörterbuch" (1838). Each word shows the Grimms' etymological sources. Also available on CD-ROM
Rights:: Not specified

55. Das virtuelle Preußische Urkundenbuch

Publisher:: Universität Hamburg
Type:: corpus
Subject:: Germanistik
Language:: German
Description:: Register of decrees as well as texts on the history of Prussia and the Teutonic Order; Regesten und Texte zur Geschichte Preußens und des Deutschen Ordens
Rights:: Not specified

56. Database of Bavarian Dialects (BayDat)

Creator:: Zimmermann, Ralf, Raaf, Manuel, König, Werner, Eichinger, Ludwig M., Eroms, Hans-Werner, Wolf, Norbert Richard, Munske, Horst Haider, and Hinderling, Robert
Publisher:: Bayerische Akademie der Wissenschaften
Type:: text and corpus
Subject:: Bavarian, Swabian, Germanistik, Dialektologie, dialect variation, dialectology, Bairisch, Fränkisch, Schwäbisch, Bayern, Sprachtatlas von Unterfranken, Sprachatlas von Mittelfranken, Sprachatlas von Bayerisch-Schwaben, Sprachatlas von Oberbayern, Bayerischer Sprachatlas, BSA, Sprachatlas von Nordostbayern, and Sprachtatlas von Niederbayern
Language:: Bavarian, Swabian, Frankish, and German
Description:: The database contains about 5 Million dialectal linguistic evidences collected in differend projects within the Free State of Bavaria to the dialects Bavarian, Frankish, and Swabian. In 1984, linguists at the University of Augsburg began to collect dialect data for the research and documentation project "Linguistic Map of Swabia" (German: "Sprachatlas von Bayerisch-Schwaben (SBS)"). In 1986, the University of Bayreuth followed with preparations for the "Linguistic Map of North- and East-Bavaria" (German: "Sprachatlas von Nordostbayern (SNOB)"). In the following years, partner projects of the other regions also started to collect data in their particular region. All six language projects then formed the "Research Association of the Bavarian Linguistic Map " (German: Bayerischer Sprachatlas (BSA)"), which was funded by the DFG and the Bavarian State Ministry of Science, Research and the Arts. The first digital publication of BayDat by Ralf Zimmermann in 2007 at the University of Würzburg (see linked paper) was re-designed in 2019 by Manuel Raaf at the Bavarian Academy of Sciences and Humanities. For detailed information, please see https://baydat.badw.de/info
Rights:: Not specified

57. DDR-Korpus

Publisher:: Berlin-Brandenburg Academy of Sciences and Humanities
Format:: application/tei+xml
Type:: corpus
Language:: German
Description:: 9 million words in 1150 texts from GDR written between 1949 and 1990. Part of the DWDS project
Rights:: Not specified

51. Czech Malach Cross-lingual Speech Retrieval Test Collection

52. Damen Conversations Lexikon

53. DaMuEL 1.0: A Large Multilingual Dataset for Entity Linking

54. Das Deutsche Wörterbuch von Jacob und Wilhelm Grimm

55. Das virtuelle Preußische Urkundenbuch

56. Database of Bavarian Dialects (BayDat)

57. DDR-Korpus

58. Deep Universal Dependencies 2.4

59. Deep Universal Dependencies 2.5

60. Deep Universal Dependencies 2.6

Limit your search

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Search

Search Constraints

Search Results

Limit your search

Contributor

Show values starting with

Coverage

Show values starting with

Creator

Show values starting with

Format

Language

Show values starting with

Publisher

Show values starting with

Rights

Show values starting with

Subject

Show values starting with

Type

Show values starting with

Date

Original context has metadata only

Harvested from