This is an open dataset of sentences from 19th and 20th century letterpress reprints of documents from the Hussite era. The dataset contains a corpus for language modeling and human annotations for named entity recognition (NER).
This is an open dataset of sentences from 19th and 20th century letterpress reprints of documents from the Hussite era. The dataset contains a corpus for language modeling and human annotations for named entity recognition (NER).
This is an open dataset of scanned images and OCR texts from 19th and 20th century letterpress reprints of documents from the Hussite era. The dataset contains human annotations for layout analysis, OCR evaluation, and language identification.
These are supplementary materials for an open dataset of scanned images and OCR texts from 19th and 20th century letterpress reprints of documents from the Hussite era. The dataset contains human annotations for layout analysis, OCR evaluation, and language identification and is available at http://hdl.handle.net/11234/1-4615. These supplementary materials contain OCR texts from different OCR engines for book pages for which we have both high-resolution scanned images and annotations for OCR evaluation.
This study deals with a short but little researched episode in the life of Henry of Isernia, an Italian master of ars dictaminis, who came to the court of King Ottokar II of Bohemia in the early 1270s. Henry’s letter collection contains nine letters relating to his temporary stay at the Premonstratensian monastery in Strahov. These letters are impressive, but hardly interpretable, historical sources, and are also the only ones describing the circumstances of the election of a new Abbot of Strahov that probably took place in 1274. The reliability and credibility of Henry’s sometimes exaggerated and emotionally charged narratives were assessed by comparing their historical and biographical content with existing documents and memorial sources, such as monastery necrologies and annals.