Skip to content

Commit

Permalink
Release 0.8.2
Browse files Browse the repository at this point in the history
Bugfix release for an edge-case in hOCR parsing.

**Bugfixes:**

- hOCR: Fix stack overflow when handling empty words in combination with a partially
  hyphenated word

**Other Changes:**

- Improved error message in case of errors during highlighting, the message now
  includes the source pointer of the failed document, or if storing OCR in the index,
  the beginning of the broken content. Also included is the internal Lucene document
  identifier.  [By adding the `[docid]` field to the returned fields for the failing
  query](https://solr.apache.org/guide/solr/latest/query-guide/document-transformers.html#docid-docidaugmenterfactory),
  the internal id is added to very document in the result set for a failing query,
  which should allow quick identification of the documents that cause issues during
  highlighting.
  • Loading branch information
jbaiter committed Sep 22, 2022
1 parent 2564a04 commit 9f4201a
Show file tree
Hide file tree
Showing 2 changed files with 5 additions and 2 deletions.
5 changes: 4 additions & 1 deletion docs/changes.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,7 @@
## 0.8.2 (unreleased)
## 0.8.2 (2022-09-22)
[GitHub Release](https://github.com/dbmdz/solr-ocrhighlighting/releases/tag/0.8.2)

Bugfix release for an edge-case in hOCR parsing.

**Bugfixes:**

Expand Down
2 changes: 1 addition & 1 deletion pom.xml
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@

<groupId>de.digitalcollections</groupId>
<artifactId>solr-ocrhighlighting</artifactId>
<version>0.8.2-SNAPSHOT</version>
<version>0.8.2</version>

<name>Solr OCR Highlighting Plugin</name>
<description>
Expand Down

0 comments on commit 9f4201a

Please sign in to comment.