Include Groundtruth Alignment #7

M3ssman · 2022-11-08T09:48:37Z

Description

Given the case, that groundtruth has been created once upon a time and the original(=referenced) image has been scanned again with different parameters (resolution, channels, size, angle, ... ) , the existing groundtruth would become unusable because of different geometric features.

Currently, there's a clumsy way of achieving a somehow similar effect implemented by passing an extra argument ignore_geometry. But this make only the data "forget" it's coordinates, which on the other side requires additional manual efforts to extract the appropriate image frame from the evaluation candidate an use only this part for ocr-ing as well.

Typical examples include newspaper digitalization when an image is first taken from microfilm/microfiche from which initial groundtruth got created and then shall be compared with a scan from original paper.

Requires the actual image to align existing groundtruth to it's actual geometry by calculating affine transformation matrix which handles translation, rotation and scaling.

For sake of simple implementation see as classical optimization task, cf. remarks from @bertsky on https://github.com/OCR-D/ocrd_segment/wiki/TemplateDrivenSegmentation) and try out the proposed algorithms.

In real world scenarios like different image sizes the major problem is scaling, so the implementation should favor an approach that puts emphasis on this issue.

The text was updated successfully, but these errors were encountered:

bertsky · 2022-11-08T15:50:54Z

Agreed! See also https://github.com/hnesk/ocr-experiments for a first demo

M3ssman added enhancement New feature or request good first issue Good for newcomers labels Nov 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Include Groundtruth Alignment #7

Include Groundtruth Alignment #7

M3ssman commented Nov 8, 2022

bertsky commented Nov 8, 2022

Include Groundtruth Alignment #7

Include Groundtruth Alignment #7

Comments

M3ssman commented Nov 8, 2022

Description

bertsky commented Nov 8, 2022