Zobrazit minimální záznam

 
dc.contributor.author Salah Elfahal Elebaed, Hoyam
dc.contributor.author Kasbi, Mohammed
dc.contributor.author Nasri, Mohammed
dc.contributor.author Bouzoubaa, Karim
dc.date.accessioned 2022-05-31T20:08:34Z
dc.date.available 2022-05-31T20:08:34Z
dc.date.issued 2021
dc.identifier.uri http://hdl.handle.net/11372/LRT-4768
dc.description This corpus constitutes all sentences representing the Arabic Controlled Language (ACL). It contains 551 sentences taken from four textbooks and websites dedicated to teach Arabic language to kids such as: a) First grade book, Republic of Sudan (كتاب الصف الاول جمهورية السودان), b) Al Jazeera Educational Site (موقع الجزيرة التعليمي), c) Bella Preparatory School Girls Forum (منتدى مدرسة بيلا الاعدادية بنات), and d) Albahr website (موقع انا البحر). These sentences are respecting 52 ACL rules. The average number of sentences for each rule is 10.6. All sentences in the corpus were analyzed by Farasa syntactic parser to confirm they are correctly analyzed. The validity of the parsing was done manually by linguist experts. The structure of this corpus is made of a header and a body. The header consists of a set of metadata that describe the corpus, such as the corpus name, the authors, the sources and further meta data. While the header is made of metadata, the body contains rules. Each rule has a code, a structure and all sentences respecting that rule. For each sentence, we store an id, the vowelledand unvowelled text as well as the result of parsing using Farasa.
dc.language.iso ara
dc.publisher International Journal of Computer Science Trends and Technology (IJCST)
dc.relation.isreferencedby http://www.ijcstjournal.org/volume-9/issue-6/IJCST-V9I6P8.pdf
dc.source.uri http://arabic.emi.ac.ma/alelm/?q=Resources
dc.subject Controlled Natural Language
dc.subject Arabic CNL
dc.subject ACL
dc.subject Arabic Corpus
dc.subject and TEI.
dc.title Arabic ACL corpus
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
has.files no
branding LRT + Open Submissions
demo.uri http://arabic.emi.ac.ma/alelm/?q=Resources
contact.person Hoyam Salah Elfahal Elebaed hoyam090@hotmail.com College of Post-graduate Studies, Sudan University of Science and Technology, Khartoum -Sudan
sponsor No No No ownFunds
size.info 197 kb
files.size 0
files.count 0


Zobrazit minimální záznam