Short description of the record: Ground truth of some parts of ÖNB Cod. 473 and Cod. 1239 (including glosses): automatically read pages with Transkribus
ATTENTION: To clone this repo you need to have Git LFS installed and then clone the repository like this:
git lfs clone [email protected]:htr-school-vienna/[your_repository_name].git https://github.com/HTR-School-Vienna/2023--carolingian-latin
- partially Liber Pontificalis (Vienna, Austrian National Library, Cod. 473) and Pauline letters with glosses (Vienna, Austrian National Library, Cod. 1239)
- Files: 3
- Lines: 34.296
- Latin, Carolingian Minuscule (9th century)
- source of images: Austrian National Library (Cod. 473: https://manuscripta.at/hs_detail.php?ID=12335), Cod. 1239 not digitized yet
- Description or citation of transcription guidelines:
- we used the transcription guidelines prepared by Tim Geelhaar according to his public model Latin - Carolingian Minuscule on Transkribus (https://readcoop.eu/de/modelle/latin-carolingian-minuscule/)
- we have three folders, one with the images and transcription of Cod. 473, the second with the images and transcription of both main text and glosses of Cod. 1239 and the last with images and transcription of only the main text of Cod. 1239
This dataset was created by Cinzia Grifoni, Leon Pürstinger, William Weiss, Evangelos Adam, Tatiana Tommasi, Veronika Wladika, Theresa Halbertschlager, Alice Morandy, Essi Nuutinen, Evina Stein, and Richard M. Pollard. The digitisation is not copyright free, but the transcription is. However, properly annotating a corpus takes time and is a task that should be recognised. If you use any item from this corpus as ground truth, cite the dataset using the following information
Copy citation BibTeX from Zenodo 10.5281/zenodo.10589561
This dataset was created as part of the Winter School of Handwritten Text Recognition of Medieval Manuscripts 2023/2024, Vienna at the Österreichische Akademie der Wissenschaften, Institut für Mittelalterforschung, all transcriptions are licensed under the Creative Commons 4 licence. Images were provided by the Austrian National Library (ÖNB) and are licensed under Creative Commons 4 licence.