A box editor and trainer for Tesseract OCR, providing editing of box data of both Tesseract 2.0x and 3.0x formats and full automation of Tesseract training. It can read images of common image formats, including multi-page TIFF. The program requires Java Runtime Environment 8 or later.
Note: LSTM Training for Tesseract 4.0x is not supported.
jTessBoxEditor is released and distributed under the Apache License, v2.0.
- Tesseract Windows training executable 5.3.3 bundled
Java.
java -Xms128m -Xmx1024m -jar jTessBoxEditor.jar