Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Include Groundtruth Alignment #7

Open
M3ssman opened this issue Nov 8, 2022 · 1 comment
Open

Include Groundtruth Alignment #7

M3ssman opened this issue Nov 8, 2022 · 1 comment
Labels
enhancement New feature or request good first issue Good for newcomers

Comments

@M3ssman
Copy link
Member

M3ssman commented Nov 8, 2022

Description

Given the case, that groundtruth has been created once upon a time and the original(=referenced) image has been scanned again with different parameters (resolution, channels, size, angle, ... ) , the existing groundtruth would become unusable because of different geometric features.

Currently, there's a clumsy way of achieving a somehow similar effect implemented by passing an extra argument ignore_geometry. But this make only the data "forget" it's coordinates, which on the other side requires additional manual efforts to extract the appropriate image frame from the evaluation candidate an use only this part for ocr-ing as well.

Typical examples include newspaper digitalization when an image is first taken from microfilm/microfiche from which initial groundtruth got created and then shall be compared with a scan from original paper.

Requires the actual image to align existing groundtruth to it's actual geometry by calculating affine transformation matrix which handles translation, rotation and scaling.

For sake of simple implementation see as classical optimization task, cf. remarks from @bertsky on https://github.com/OCR-D/ocrd_segment/wiki/TemplateDrivenSegmentation) and try out the proposed algorithms.

In real world scenarios like different image sizes the major problem is scaling, so the implementation should favor an approach that puts emphasis on this issue.

@M3ssman M3ssman added enhancement New feature or request good first issue Good for newcomers labels Nov 8, 2022
@bertsky
Copy link

bertsky commented Nov 8, 2022

Agreed! See also https://github.com/hnesk/ocr-experiments for a first demo

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

2 participants