1 - 9 of 9
Number of results to display per page
Search Results
2. Arabic ACL corpus
- Creator:
- Salah Elfahal Elebaed, Hoyam, Kasbi, Mohammed, Nasri, Mohammed, and Bouzoubaa, Karim
- Publisher:
- International Journal of Computer Science Trends and Technology (IJCST)
- Type:
- text and corpus
- Subject:
- Controlled Natural Language, Arabic CNL, ACL, Arabic Corpus, and and TEI.
- Language:
- Arabic
- Description:
- This corpus constitutes all sentences representing the Arabic Controlled Language (ACL). It contains 551 sentences taken from four textbooks and websites dedicated to teach Arabic language to kids such as: a) First grade book, Republic of Sudan (كتاب الصف الاول جمهورية السودان), b) Al Jazeera Educational Site (موقع الجزيرة التعليمي), c) Bella Preparatory School Girls Forum (منتدى مدرسة بيلا الاعدادية بنات), and d) Albahr website (موقع انا البحر). These sentences are respecting 52 ACL rules. The average number of sentences for each rule is 10.6. All sentences in the corpus were analyzed by Farasa syntactic parser to confirm they are correctly analyzed. The validity of the parsing was done manually by linguist experts. The structure of this corpus is made of a header and a body. The header consists of a set of metadata that describe the corpus, such as the corpus name, the authors, the sources and further meta data. While the header is made of metadata, the body contains rules. Each rule has a code, a structure and all sentences respecting that rule. For each sentence, we store an id, the vowelledand unvowelled text as well as the result of parsing using Farasa.
- Rights:
- Not specified
3. Bavaria's Dialects Online
- Creator:
- Raaf, Manuel
- Publisher:
- Bayerische Akademie der Wissenschaften and Bavarian Academy of Sciences and Humanities
- Type:
- text, machineReadableDictionary, and lexicalConceptualResource
- Subject:
- dictionary, web dictionary, Dialektologie, dialect variation, language variation, Dialectology, dialectology, Bavarian, Bavaria, Swabian, Frankish, Franconian Language, and spoken language
- Language:
- German, Bavarian, Swabian, and Frankish
- Description:
- Bavaria's Dialects Online (BDO) is the digital language information system of the three projects "Bavarian Dictionary", "Franconian Dictionary", and "Dialectological Information System of Bavarian Swabia". The database combines the research results of dialect research and presents dictionary articles as well as research data in a freely accessible online tool. BDO is not only aimed at scholars, but also at the lay public interested in the language. Here, the vocabulary of all Bavarian dialects is collected in one place and made accessible. The system shows the richness of the dialects of Bavaria in combination. With the new database, one will be able to compare the dialect vocabulary of Old Bavaria, Franconia and Swabia. Authentic dialect evidence is used to illustrate the dialect words in their variety of meanings and regional distribution, as well as to show their use in idioms, proverbs, and much more. BDO allows a whole new look at the vocabulary of the dialects of all parts of the state of Bavaria.
- Rights:
- Not specified
4. Database of Bavarian Dialects (BayDat)
- Creator:
- Zimmermann, Ralf, Raaf, Manuel, König, Werner, Eichinger, Ludwig M., Eroms, Hans-Werner, Wolf, Norbert Richard, Munske, Horst Haider, and Hinderling, Robert
- Publisher:
- Bayerische Akademie der Wissenschaften
- Type:
- text and corpus
- Subject:
- Bavarian, Swabian, Germanistik, Dialektologie, dialect variation, dialectology, Bairisch, Fränkisch, Schwäbisch, Bayern, Sprachtatlas von Unterfranken, Sprachatlas von Mittelfranken, Sprachatlas von Bayerisch-Schwaben, Sprachatlas von Oberbayern, Bayerischer Sprachatlas, BSA, Sprachatlas von Nordostbayern, and Sprachtatlas von Niederbayern
- Language:
- Bavarian, Swabian, Frankish, and German
- Description:
- The database contains about 5 Million dialectal linguistic evidences collected in differend projects within the Free State of Bavaria to the dialects Bavarian, Frankish, and Swabian. In 1984, linguists at the University of Augsburg began to collect dialect data for the research and documentation project "Linguistic Map of Swabia" (German: "Sprachatlas von Bayerisch-Schwaben (SBS)"). In 1986, the University of Bayreuth followed with preparations for the "Linguistic Map of North- and East-Bavaria" (German: "Sprachatlas von Nordostbayern (SNOB)"). In the following years, partner projects of the other regions also started to collect data in their particular region. All six language projects then formed the "Research Association of the Bavarian Linguistic Map " (German: Bayerischer Sprachatlas (BSA)"), which was funded by the DFG and the Bavarian State Ministry of Science, Research and the Arts. The first digital publication of BayDat by Ralf Zimmermann in 2007 at the University of Würzburg (see linked paper) was re-designed in 2019 by Manuel Raaf at the Bavarian Academy of Sciences and Humanities. For detailed information, please see https://baydat.badw.de/info
- Rights:
- Not specified
5. Dictionary of Bavarian Dialects
- Creator:
- Schamberger-Hirt, Andrea, Erhard, Felicitas, Schnabel, Michael, Funk, Edith, Rowley, Anthony, and Schwab, Vincenz
- Publisher:
- Bayerisches Wörterbuch and Bayerische Akademie der Wissenschaften
- Type:
- text and corpus
- Subject:
- Bavaria, Bayern, Dialektologie, Dialekt, Dialectology, Bavarian, Bairisch, Bayerisch, dialect variation, Germanistik, German, Historical Linguistics, and History of German Language and Literature
- Language:
- Bavarian and German
- Description:
- The database offers access to over 6 million dialectal linguistic evidences of the project "Dictionary of Bavarian Dialects" (German: Das Bayerische Wörterbuch) as image snippets, partly and forthgoing lemmatized. The area covered by the Dictionary of Bavarian Dialects (Bayerisches Wörterbuch) comprises Upper Bavaria, Lower Bavaria, the Upper Palatinate and neighbouring regions of Bavarian Swabia, Middle Franconia and Upper Franconia. Over and above the vernaculars spoken today, Bavaria’s literary tradition since its beginnings in the 8th century is also taken into account. Starting in 1913, language material was collected from all Bavarian-speaking regions in Bavaria. Questionnaires were sent out to local informants throughout Bavaria, and contemporary and historical literary sources were excerpted. Today the collection comprises around nine million dialect examples. With the exception of the “Wörterlisten” (word lists), which can be digitally searched and edited, this material consists of index cards, to which corresponding standard German or quasi-standard German keywords have been added, filed alphabetically (see link below for more information). For detailed information, please see https://www.bwb.badw.de/en/the-project.html and https://www.bwb.badw.de/en/digital-platform.html
- Rights:
- Not specified
6. Dictionnaire de l'occitan médiéval (DOM)
- Creator:
- Claudia, Kraus, Stempel, Wolf-Dieter, Tausend, Monika, and Peter, Renate
- Publisher:
- Bavarian Academy of Sciences and Humanities and Bayerische Akademie der Wissenschaften
- Type:
- text, lexicon, and lexicalConceptualResource
- Subject:
- Emil Levy, Petit Levy, Lexique Roman, DOM, Occitian language, Medieval Occitan, Occitan, Old Occitan, Old Provençal, Romance languages, dictionary, etymology, Middle Ages, troubadours, lexicography, and Supplementwörterbuch
- Language:
- French and Old Provençal (to 1500)
- Description:
- In the Middle Ages, Old Occitan (formerly "Old Provençal"), the language of the troubadours, was a literary and cultural language, the influence of which extended far beyond the frontiers of Southern France. The only comprehensive portrayal of the Old Occitan vocabulary to have appeared up to now is the "Lexique roman" by François Raynouard (6 vols., 1836–1845). It was supplemented by Emil Levy’s "Provenzalisches Supplementwörterbuch" (8 vols., 1894–1924). An updated dictionary, taking account of progress in research over the last 100 years, has been the desideratum of literary scholars, linguists, and historians ever since. Under the direction of Wolf-Dieter Stempel, the publication of a new dictionary of Old Occitan, the "Dictionnaire de l'occitan médiéval (DOM)", began in 1996. This appeared in print until 2013, directed from 2012 on by Maria Selig. Since then it has been available as an alphabetically complete digital dictionary, the "DOM en ligne". This comprises the newly written articles of the DOM together with the articles from the dictionaries of Raynouard and Levy for those parts of the alphabet not yet covered by the new work and is enriched by entries for words absent till now from Old Occitan lexicography. Its content is available for free at https://dom-en-ligne.de/dom.php
- Rights:
- Not specified
7. The Diorisis Ancient Greek Corpus
- Creator:
- Vatri, Alessandro and McGillivray, Barbara
- Publisher:
- Figshare
- Type:
- text and corpus
- Subject:
- annotated corpus, ancient world, lemmatization, and part of speech
- Language:
- Ancient Greek (to 1453)
- Description:
- An annotated corpus of literary Ancient Greek sourced from the Perseus Canonical Greek Lit repository (https://github.com/PerseusDL/canonical-greekLit), “The Little Sailing” digital library (http://www.mikrosapoplous.gr/en/texts1en.html), and the Bibliotheca Augustana digital library (http://www.hs-augsburg.de/~harsch/augustana.html#gr). The corpus consists of 820 texts spanning between the beginnings of the AG literary tradition (Homer) and the fifth century AD, and it counts 10,206,421 words. In addition to referring to this resource, please use the following citation when citing the corpus: Vatri, A., & McGillivray, B. (2018). The Diorisis Ancient Greek Corpus, Research Data Journal for the Humanities and Social Sciences, 3(1), 55-65. doi: https://doi.org/10.1163/24523666-01000013
- Rights:
- Not specified
8. The Franconian Dictionary
- Creator:
- König, Almut, Klepsch, Alfred, Beyschlag, Siegfried, Habermann, Mechthild, Werner, Ottmar, Straßner, Erich, Wagner, Eberhard, and Grimm, Reinhold
- Publisher:
- Bayerische Akademie der Wissenschaften
- Type:
- text and corpus
- Subject:
- Frankish, German, Fränkisch, Dialekt, Dialektologie, Bayern, Bavaria, Germanistik, Dialectology, dialect variation, Franconian Language, and Ostfränkisch
- Language:
- Frankish, German, and Mainfränkisch
- Description:
- The database currently contains about 1 million dialectal linguistic evidences of the project "The Franconian Dictionary" (German: Das Fränkische Wörterbuch), each of which lemmatized, annotated, and linked to the original questionnaire. The database is work in progress, so there will be more data available regularly. The Franconian Dictionary was initiated by the Munich office of the Bavarian Dictionary project, sending questionnaires for a dialect survey in Franconia. In the wake of this survey an office in Erlangen was established in 1933 (see link below for more information). During the course of 90 years thousands of volunteers helped to compile a considerable collection of vernacular examples of usage, drawn from the Bavarian districts of Upper, Middle and Lower Frankonia. For the most part they represent the East Franconian dialect, to the lesser extent also Rhine-Franconian, Swabian and North-Bavarian vernaculars. Between 2007 and 2008 a small selection of the research results was published in three editions of one printed volume by Eberhard Wagner and Alfred Klepsch: “Handwörterbuch von Bayerisch-Franken” (see link below for more information). Since 2012 the Franconian Dictionary, a project of the Bavarian Academy of Sciences and Humanities, has been entrusted to the Friedrich-Alexander-University in Erlangen and Nuremberg (FAU). The project is supervised by Prof. Dr. Mechthild Habermann, Chair of the Faculty of German Linguistics at the FAU. For detailed information, please see http://www.wbf.badw.de/en/the-project.html and http://www.wbf.badw.de/en/wbf-digital.html
- Rights:
- Not specified
9. Thesaurus linguae Latinae
- Creator:
- Ammann, Andreas, Blundell, John, Gitner, Adam, Hillen, Michael, Hajdú, István, Holmes, Nigel, Kuper, Charles, van Leijenhorst, Cornelis G., Marchionni, Roberta, Meusel, Eduard, Ottink, Marijke, Pieroni, Paolo, Ramminger, Johann, Schrickx, Josine, Spoth, Friedrich, and Wick, Claudia
- Publisher:
- Bavarian Academy of Sciences and Humanities and Bayerische Akademie der Wissenschaften
- Type:
- text, thesaurus, and lexicalConceptualResource
- Subject:
- thesaurus, latin, thesaurus linguae latinae, dictionary, roman, ancient world, TLL, ThlL, and written language
- Language:
- Latin
- Description:
- The Thesaurus linguae Latinae is the first comprehensive dictionary of ancient Latin; • it is compiled on the basis of all Latin texts surviving from antiquity (until AD 600), both literary and non-literary • for less common words it cites every attestation, for the rest (those marked with an asterisk) an instructive and representative sample • it records all meanings (including technical usages) and all constructions • it documents peculiarities of inflection, spelling, and prosody • it supplies information about the etymology of the Latin words and their survival in the Romance languages, contributed by recognised authorities in the fields of Indo-European and Romance studies • it collects the comments of ancient sources on the word in question The Thesaurus therefore offers for every Latin word a comprehensive, richly documented picture of its possibilities and history – not only for Latin scholars, but also for scholars of the various branches of ancient studies and for related disciplines.
- Rights:
- Not specified