-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cut multiple line segments #262
Comments
Hi,
Line segmentation in OCR4all isn't really implemented optimally at the moment and while – as you said – one can often improve the results with parameter tweaking this doesn't work always. The upcoming release of OCR4all will feature refactored code for the line segmentation and will hopefully improve the line segmentation.
Would the subtract rectangle / subtract polygon work for your use case (see video)? Peek.2021-06-24.17-25.mp4I just quickly looked into adding a cut-from-line (instead of rectangle / polygon) feature into LAREX but Paper.js doesn't seem to like intersecting / dividing open paths like lines and closed paths like polygons (the cut function in
Great idea, I guess adding a toggle for that would make a lot of sense for the current subtraction features as well.
Ordering the newly created segments (through subtraction or division) by lowest x or y coordinate (determined by the state of the added toggle) might probably work for most vertical / horizontal segmentation, wouldn't it? |
Hello!
This could work for series of use cases, I guess. Thanks for this idea! One will probably have to adjust the two resulting polygons, but that is not terribly cumbersome. Maybe one might add a polygon reduce function. If not closed, you could automatically add for each point
Very much in favour of this idea!
I'm not sure how this would work out on a skewed page with two-column layout. In any case, one could also think of a possibility of moving multiple lines in the reading order batchwise (select group of lines, move them to a specific position in the reading order). More generally, however, I would advocate for a "redo reading order" function. When I add several new lines, it would be easier to have the reading order recognized once again instead of manually adding the new lines. |
Hello!
This is a small feature request originating from my work with OCR4all/LAREX:
Line segmentation isn't always perfect. For some reason (maybe this can be avoided tweaking the preferences) a bunch of lines gets not segmented properly, for example:
As this happens rather often, drawing new rectangles and adding them to the reading order can become time-consuming, so I was wondering if you could add something like the cut line function () in the Segments mode to the Lines mode as well.
Possible behaviour:
A similar function for vertical segmentation would be useful as well, but reorganizing the reading order is definitely more difficult.
Thank you!
The text was updated successfully, but these errors were encountered: