Skip to content

Transcriptions of 15th-century Latin manuscripts (ÖNB Cod. 4680 and 4135) from the 2024/2025 HTR Winter School, following CATMuS guidelines.

License

Notifications You must be signed in to change notification settings

HTR-School-Vienna/2024--late-medieval-latin

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HTR Winter School 2024/2025 - Late Medieval Latin, ONB 4135; ONB 4680

Transcriptions from codd. 4680 and 4135, Wien, Österreichische Nationalbibliothek (ÖNB). Both are saec. XV, Latin.



ATTENTION: To clone this repo you need to have Git LFS installed and then clone the repository like this:

git lfs clone [email protected]:HTR-School-Vienna/2024--late-medieval-latin.git

# or you can use https://
git lfs clone https://github.com/HTR-School-Vienna/2024--late-medieval-latin

Description

Cod. 4680

Miscellanea by Thomas Ebendorfer (1388-1464) as found in MS Wien, Österreichische Nationalbibliothek (ÖNB), Cod. 4680 (autograph copy). See the description of the manuscript at Manuscripta.at. Writing: Latin, cursive, saec. XV. Number of files: Number of lines:

Cod.4135

Transcriptions from: Tractatus contra Hussitas (ed. Traxler, 2020) fol.1r-47r; fol. 55r-125v and fol.148r-279v (various texts). Wien, Österreichische Nationalbibliothek (ÖNB), Cod. 4135. Writing: Latin, cursive, saec. XV. Number of files: Number of lines:

Transcription guidelines

The guidelines followed are those prescribed by CATMuS (Consistent Approaches for Transcribing Manuscripts). [Ariane Pinche, Thibault Clérice, Alix Chagué, Jean-Baptiste Camps, Malamatenia Vlachou- Efstathiou, et al., CATMuS-Medieval: Consistent Approaches to Transcribing ManuScripts: A generalized set of guidelines and models for Latin scripts from Middle Ages (8th–16th century). 2023. hal-04346939]

Due to the level of complexity in the manuscripts studied, significant interpretation of the guidelines was required. It was necessary to agree on general conventions in order to produce a consistent approach in transcribing; nevertheless, some minor variation will be found based on individual interpretation.

Source of images: Austrian National Library

Origin of the data

Cod. 4135 https://digital.onb.ac.at/RepViewer/viewer.faces?doc=DTL_3262558&order=1&view=SINGLE

Cod. 4680 https://digital.onb.ac.at/RepViewer/viewer.faces?doc=DTL_8098140&order=1&view=SINGLE

Data organisation

The data are organised in two folders, one for each manuscript. Each folder contains two subfolders: 'Final' and 'Done'. The 'Final folder contains transcribed pages that have been checked by the organisers, while the 'Done' folder contains finished pages that have not been checked. The transcriptions are in the form of XML files in format of PageXML, with one file per page.

Each manuscript folder also contains an image folder, which contains the images of the manuscript pages. The images are in the form of JPEG files, with one file per page. The images are named according to the folio number of the manuscript page.

How to cite

This dataset was created by: Jan Odstrčilík, Ivana Lukáč Labancová, Martin Roček, Ignacio Arias Philippi, Giorgio Di Michino, Erika Dell'Aquila, Jakub Fiala, Nico Faltin, Jakub Fiala, Annamária Kovács, Miroslav Kotrha, Dorota Labašová, Lorenzo Livorsi, Zuzana Čermáková Lukšová, Jan Maliszewski, Claudia Meneghetti, Courtney Van de Mosselaer, J.P. Osmond, Alexandre Parent, Štefan Pacola, Abigail Provan, Eva Rutšeková, Katharina Strika, Michal Ťapák, Viktor Tuček, Ballo Harald, Davide Obili, Jan Odstrcilik, Cerys Warwick, Peter Buch Hvarregaard, Emmarae Stein. The digitisation is not copyright free, but the transcription is. However, properly annotating a corpus takes time and is a task that should be recognised. If you use any item from this corpus as ground truth, cite the dataset using the following information:

BibTeX from Zenodo 10.5281/zenodo.10589561

Copyright and licence

This dataset was created as part of the Winter School of Handwritten Text Recognition of Medieval Manuscripts 2024, Vienna at the Österreichische Akademie der Wissenschaften, Institut für Mittelalterforschung, all transcriptions are licensed under the Creative Commons 4 licence. Images were provided by the Austrian National Library (ÖNB) and are licensed under Creative Commons 4 licence.

About

Transcriptions of 15th-century Latin manuscripts (ÖNB Cod. 4680 and 4135) from the 2024/2025 HTR Winter School, following CATMuS guidelines.

Topics

Resources

License

Stars

Watchers

Forks