-
Notifications
You must be signed in to change notification settings - Fork 13
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
This is a bugfix release targeting mainly the MiniOCR and ALTO implementations. **Bufgfixes:** - ALTO: Fix handling of empty words. Previously any words after a word element with no text **would be skipped entirely during indexing** 😱😱. - MiniOCR: Fix handling of empty words, Previously a word element with no text would make the parser crash. - MiniOCR: Make the `wh` attribute on `<p>` page elements actually optional. The documentation said it was optional, but the parser would crash when attempting to handle elements without the attribute **Other Changes:** - A warning will now be logged if none of the fields requested with `hl.ocr.fl` exist or are defined as stored fields. Previously highlighting would just not work, with no indications to users as to why this was the case.
- Loading branch information
Showing
6 changed files
with
30 additions
and
8 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters