The contribution includes the data frame and the R script (Markdown file) belonging to the paper "Who Benefits from an Imperative? Assessment of Directives on a Benefit-Scale" submitted to the journal Pragmatics on September 2024.
The book [1] contains spelling rules classified into ten categories, each category containing many rules. This XML file presents our implemented rules classified with six category tags, as is the case in the book. We implemented 24 rules since the remaining rules require diacritical and morphological analysis that are outside the scope of our present work.
References:
[1] Dr.Fahmy Al-Najjar, 'Spelling rules in ten easy lessons', Al Kawthar Library,2008. Available: https://www.alukah.net/library/0/53498/%D9%82%D9%88%D8%A7%D8%B9%D8%AF-%D8%A7%D9%84%D8%A5%D9%85%D9%84%D8%A7%D8%A1-%D9%81%D9%8A-%D8%B9%D8%B4%D8%B1%D8%A9-%D8%AF%D8%B1%D9%88%D8%B3-%D8%B3%D9%87%D9%84%D8%A9-pdf/
This corpus consists of full transcriptions of both Democratic and Republican 2016 presidential candidate debates, with a special focus on the idiolects of Hillary Clinton and Donald Trump against the background of the speeches of other candidates for the post of president of the United States.
The transcriptions are sourced from the American Presidency Project at the University of California, Santa Barbara. Any use of the material requires a prior and explicit written permission by the project administrator (contact policy@ucsb.edu). This corpus material is now being shared with their kindly permission.
Indonesian text corpus from web. Crawling done by SpiderLing in 2017. Filtering by JusText and Onion (see http://corpus.tools/ for details). Tagged and lemmatized by MorphInd (http://septinalarasati.com/morphind/).
Environmental impact assessment (EIA) is the formal process used to predict the environmental consequences of a plan. We present a rule-based extraction system to mine Czech EIA documents. The extraction rules work with a set of documents enriched with morphological information and manually created vocabularies of terms supposed to be extracted from the documents, e.g. basic information about the project (address, ID company, ...), data on the impacts and outcomes (waste substances, endangered species, ...), a final opinion. The documents Notice of Intent contains the section BI2 with the information on the scope (capacity) of the plan.
A corpus of spontaneous discussions and read-aloud performances from native speakers of different ages. Parallel corpus in Russian, Finnish, and Dutch.
The segment captures the reverential act of depositing the remains of forty-two Italian legionnaires who were executed for deserting from the Austrian army to join the Italian legions in the summer of 1918. The coffins with their bodies were temporarily placed at the military cemetery in Milovice and later unearthed and transported to Prague, where a day-long funeral ceremony was held on 24 April 1921. The camera focuses on military troops lined up on Old Town Square and Italian and Czechoslovak officers. The ceremony is witnessed by Minister of National Defence Otakar Husák and the General Inspector of the Czechoslovak Army, the poet Josef Svatopluk Machar. Shots of speeches given by Josef Rotnágl, a member of the Revolutionary National Assembly, and General Otakar Husák, who delivers a message from the President of Czechoslovakia (silent). This is followed by speeches given by the Senator of the National Assembly, Václav Klofáč, Deputy of the National Assembly František Udržal, and the President of the Italian-Czechoslovak League, Prince Pietro Lanza di Scalea, whose speech is interpreted by diplomat Jan Šeba. Shot of the commander of the funeral procession, General Karel Voženílek, on horseback. General Otakar Husák and Josef Svatopluk Machar receive Italian military honours. After the solemn ceremony on Old Town Square, the coffins with the remains of the executed legionnaires were taken to the military burial ground at Olšany Cemetery.