dc.contributor.author | Cinková, Silvie |
dc.contributor.author | Chromý, Jan |
dc.contributor.author | Šamánková, Jana |
dc.contributor.author | Hořeňovská, Karolína |
dc.contributor.author | Kettnerová, Václava |
dc.contributor.author | Kolářová, Veronika |
dc.contributor.author | Kubištová, Hana |
dc.contributor.author | Panevová, Jarmila |
dc.date.accessioned | 2023-10-10T07:55:21Z |
dc.date.available | 2023-10-10T07:55:21Z |
dc.date.issued | 2023-01-01 |
dc.identifier.uri | http://hdl.handle.net/11234/1-5225 |
dc.description | LiFR-Law is a corpus of Czech legal and administrative texts with measured reading comprehension and a subjective expert annotation of diverse textual properties based on the Hamburg Comprehensibility Concept (Langer, Schulz von Thun, Tausch, 1974). It has been built as a pilot data set to explore the Linguistic Factors of Readability (hence the LiFR acronym) in Czech administrative and legal texts, modeling their correlation with actually observed reading comprehension. The corpus is comprised of 18 documents in total; that is, six different texts from the legal/administration domain, each in three versions: the original and two paraphrases. Each such document triple shares one reading-comprehension test administered to at least thirty readers of random gender, educational background, and age. The data set also captures basic demographic information about each reader, their familiarity with the topic, and their subjective assessment of the stylistic properties of the given document, roughly corresponding to the key text properties identified by the Hamburg Comprehensibility Concept. Changes to the previous version and helpful comments • File names of the comprehension test results (self-explanatory) • Corrected one erroneous automatic evaluation rule in the multiple-choice evaluation (zahradnici_3, TRUE and FALSE had been swapped) • Evaluation protocols for both question types added into Folder lifr_formr_study_design • Data has been cleaned: empty responses to multiple-choice questions were re-inserted. Now, all surveys are considered complete that have reader’s subjective text evaluation complete (these were placed at the very end of each survey). • Only complete surveys (all 7 content questions answered) are represented. We dropped the replies of six users who did not complete their surveys. • A few missing responses to open questions have been detected and re-inserted. • The demographic data contain all respondents who filled in the informed consent and the demographic details, with respondents who did not complete any test survey (but provided their demographic details) in a separate file. All other data have been cleaned to contain only responses by the regular respondents (at least one completed survey). |
dc.language.iso | ces |
dc.publisher | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
dc.relation.replaces | http://hdl.handle.net/11234/1-5020 |
dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ |
dc.source.uri | https://ufal.mff.cuni.cz/grants/lifr |
dc.subject | readability |
dc.subject | legal texts |
dc.subject | legal domain |
dc.subject | reading comprehension |
dc.subject | corpus |
dc.subject | survey |
dc.title | LiFR-Law. Corpus of Paraphrased Czech Administrative Texts with Reading Comprehension for Readability Studies (2023-10-08) |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
dc.rights.label | PUB |
has.files | yes |
branding | LINDAT / CLARIAH-CZ |
contact.person | Silvie Cinková cinkova@ufal.mff.cuni.cz Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
sponsor | GAČR 19-19191S Linguistic Factors of Readability in Czech Administrative and Educational Texts nationalFunds |
size.info | 17601 tokens |
size.info | 18 texts |
size.info | 769 items |
files.size | 4862319 |
files.count | 1 |
Soubory tohoto záznamu
Licenční kategorie:
Licence: Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
Licence: Creative Commons - Attribution 4.0 International (CC BY 4.0)
- Název
- LIFRLawRELEASE2.0.zip
- Velikost
- 4.64 MB
- Formát
- application/zip
- Popis
- zip file with text files
- MD5
- 61eda2fcd85efbc2a73289b29f69962e
- LIFRLawRELEASE2.0
- lifr_tsv_OdpovediTestCteni
- LiFRLawEvalOpenQuestionsJustCompleteSurveys.tsv607 kB
- LiFRLawEvalMultipleChoiceJustCompleteSurveys.tsv2 MB
- README_LiFR_admin.pdf821 kB
- lifr_tsv_Demograficke
- .~lock.Demograficke_Age.tsv#97 B
- suspect_respondents.tsv4 kB
- regular_respondents.tsv16 kB
- Demograficke_ReaderFreq.tsv18 kB
- Demograficke_Eyesight.tsv18 kB
- Demograficke_WriterFreq.tsv18 kB
- Demograficke_CzechL2.tsv1 kB
- Demograficke_CzechL1.tsv17 kB
- .~lock.Demograficke_Education.tsv#97 B
- Demograficke_Sex.tsv16 kB
- Demograficke_Disorder.tsv17 kB
- lifr_texts
- txt
- stavarska-1_kusv.txt13 kB
- zastoupeni-2_jasa.txt4 kB
- stavarska-2_orig.txt14 kB
- ockovani-2_jasa.txt5 kB
- knihovna-2_kusv.txt7 kB
- knihovna-3_orig.txt7 kB
- zahradnici-3_kusv.txt7 kB
- zahradnici-1_jasa.txt5 kB
- zastoupeni-3_orig.txt6 kB
- ockovani-3_orig.txt7 kB
- zahradnici-2_orig.txt7 kB
- knihovna-1_jasa.txt6 kB
- stavarska-3_jasa.txt10 kB
- zaloba-2_kusv.txt6 kB
- zastoupeni-1_kusv.txt4 kB
- ockovani-1_kusv.txt8 kB
- zaloba-3_jasa.txt5 kB
- zaloba-1_orig.txt6 kB
- pdf
- zastoupeni-1_kusv.pdf143 kB
- ockovani-1_kusv.pdf138 kB
- zaloba-3_jasa.pdf183 kB
- zaloba-1_orig.pdf136 kB
- stavarska-1_kusv.pdf155 kB
- zastoupeni-2_jasa.pdf152 kB
- stavarska-2_orig.pdf146 kB
- ockovani-2_jasa.pdf153 kB
- knihovna-2_kusv.pdf139 kB
- knihovna-3_orig.pdf132 kB
- zahradnici-3_kusv.pdf211 kB
- zahradnici-1_jasa.pdf139 kB
- zastoupeni-3_orig.pdf130 kB
- ockovani-3_orig.pdf131 kB
- zahradnici-2_orig.pdf144 kB
- knihovna-1_jasa.pdf139 kB
- stavarska-3_jasa.pdf199 kB
- zaloba-2_kusv.pdf139 kB
- html
- zaloba-2_kusv.html6 kB
- zaloba-3_jasa.html6 kB
- ockovani-2_jasa.html7 kB
- zahradnici-1_jasa.html6 kB
- zahradnici-3_kusv.html9 kB
- stavarska-2_orig.html16 kB
- knihovna-3_orig.html8 kB
- ruzenka.html1 kB
- stavarska-1_kusv.html15 kB
- ockovani-1_kusv.html9 kB
- zaloba-1_orig.html7 kB
- zastoupeni-1_kusv.html5 kB
- zastoupeni-2_jasa.html6 kB
- ockovani-3_orig.html9 kB
- zastoupeni-3_orig.html7 kB
- knihovna-2_kusv.html8 kB
- stavarska-3_jasa.html13 kB
- knihovna-1_jasa.html7 kB
- zahradnici-2_orig.html9 kB
- docx
- zaloba-2_kusv.docx30 kB
- zaloba-3_jasa.docx30 kB
- ockovani-2_jasa.docx29 kB
- zahradnici-1_jasa.docx39 kB
- zahradnici-3_kusv.docx39 kB
- stavarska-2_orig.docx43 kB
- knihovna-3_orig.docx24 kB
- ruzenka.docx14 kB
- stavarska-1_kusv.docx48 kB
- ockovani-1_kusv.docx42 kB
- zaloba-1_orig.docx31 kB
- zastoupeni-1_kusv.docx23 kB
- zastoupeni-2_jasa.docx29 kB
- ockovani-3_orig.docx37 kB
- zastoupeni-3_orig.docx14 kB
- knihovna-2_kusv.docx29 kB
- stavarska-3_jasa.docx59 kB
- knihovna-1_jasa.docx32 kB
- zahradnici-2_orig.docx36 kB
- txt
- lifr_hamburg_comprehensibility_annotation
- lifr_hamburg_comprehensibility_annotation.tsv5 kB
- lifr_tsv_Subjektivni
- Subjektivni_wideformat.tsv90 kB
- Subjektivni_longformat.tsv458 kB
- lifr_formr_study_design
- REVISED_MULTICHOICE_CLUES.tsv8 kB
- lifr_formr_run.json1 MB
- REVISED_OPEN_QUESTION_EVALUATION.tsv246 kB
- lifr_tsv_CoVedeliPredem
- CoVedeliPredem_JednotliveOtazky.tsv301 kB
- CoVedeliPredem_PerDokument.tsv87 kB
- lifr_tsv_OdpovediTestCteni