Adding segmentation as way to save ton of time #120

maximka1812 · 2022-11-17T00:27:13Z

One option is to use Tesseract OCR API, see https://github.com/maximka1812/Segmentation-Demo-Using-Tesseract-API
Note that you can use Tesseract API also for automatic page turning on initial stage, they have feature to determinate angle (90, 180, 270).

This can be also all be done via external call and just use JSON or similar way to return information to ST.

Another huge help is to be able to use image regions data for mixed output mode, as presently ST automatic mode frequently make errors or miss stuff.

Another option is to have tool that changes ScanTailor project file adding all information, here saved Finereader project files can be also used, they have regions data in binary (and their segmentation quality is better!)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding segmentation as way to save ton of time #120

Adding segmentation as way to save ton of time #120

maximka1812 commented Nov 17, 2022

Adding segmentation as way to save ton of time #120

Adding segmentation as way to save ton of time #120

Comments

maximka1812 commented Nov 17, 2022