We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
In addition to ALTO text/xml, we should support PAGE application/vnd.prima.page+xml files.
text/xml
application/vnd.prima.page+xml
(One scenario could be OCR-D processed material.)
The text was updated successfully, but these errors were encountered:
Workaround in the meantime: apply https://github.com/kba/page-to-alto, as included in the ocrd-fileformat-transform page alto (but you may have to use script-args for page-to-alto, e.g. --dummy-word --no-check-words --no-check-border)
page alto
script-args
--dummy-word --no-check-words --no-check-border
Sorry, something went wrong.
For inspiration: https://github.com/dariok/page2tei/blob/master/page2tei-0.xsl
EDIT: but we would have to coordinate that with https://www.deutsches-textarchiv.de/doku/basisformat/
No branches or pull requests
In addition to ALTO
text/xml
, we should support PAGEapplication/vnd.prima.page+xml
files.(One scenario could be OCR-D processed material.)
The text was updated successfully, but these errors were encountered: