Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

single digits not getting recognized #2389

Open
Shreeshrii opened this issue Apr 15, 2019 · 4 comments
Open

single digits not getting recognized #2389

Shreeshrii opened this issue Apr 15, 2019 · 4 comments

Comments

@Shreeshrii
Copy link
Collaborator

Shreeshrii commented Apr 15, 2019

tesseract -v
tesseract 4.1.0-rc1-255-g332a1
leptonica-1.76.0
libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 1.4.2) : libpng 1.2.54 : libtiff 4.0.6 : zlib 1.2.8 : libwebp 0.4.4 : libopenjp2 2.3.0

Please see the issue opened by @jandier with a number of images which are NOT being recognized or being recognized incorrectly.
Shreeshrii/tessdata_shreetest#5 (comment)

@Shreeshrii
Copy link
Collaborator Author

Shreeshrii commented Apr 15, 2019

Using the finetuned digits traineddata gives slightly better results in some cases, but still does not work with default --psm.

This issue with non-recognition of small images has also been reported elsewhere. @stweil @bertsky Any suggestions for improving this.

@Shreeshrii
Copy link
Collaborator Author

Shreeshrii commented Apr 15, 2019

Here is the output for 0-9.png and 06.jpg (different style and size of 6).

The digits config file which uses the whitelist feature improves the result. Thanks, @bertsky.

*****  num/06.jpg OEM 1 LANG eng TESSDATA tessdata_best
**** PSM 3 ****
Empty page!!
Empty page!!
**** with digits config ****
Empty page!!
Empty page!!
**** PSM 8 ****
5
**** with digits config ****
5

*****  num/0.png OEM 1 LANG eng TESSDATA tessdata_best
**** PSM 3 ****
Empty page!!
Empty page!!
**** with digits config ****
Empty page!!
Empty page!!
**** PSM 8 ****
Co
**** with digits config ****
0

*****  num/1.png OEM 1 LANG eng TESSDATA tessdata_best
**** PSM 3 ****
Empty page!!
Empty page!!
**** with digits config ****
Empty page!!
Empty page!!
**** PSM 8 ****
IE
**** with digits config ****

*****  num/2.png OEM 1 LANG eng TESSDATA tessdata_best
**** PSM 3 ****
Empty page!!
Empty page!!
**** with digits config ****
Empty page!!
Empty page!!
**** PSM 8 ****
2
**** with digits config ****
2

*****  num/3.png OEM 1 LANG eng TESSDATA tessdata_best
**** PSM 3 ****
Empty page!!
Empty page!!
**** with digits config ****
Empty page!!
Empty page!!
**** PSM 8 ****
3
**** with digits config ****
3

*****  num/4.png OEM 1 LANG eng TESSDATA tessdata_best
**** PSM 3 ****
Empty page!!
Empty page!!
**** with digits config ****
Empty page!!
Empty page!!
**** PSM 8 ****
Ce
**** with digits config ****

*****  num/5.png OEM 1 LANG eng TESSDATA tessdata_best
**** PSM 3 ****
Empty page!!
Empty page!!
**** with digits config ****
Empty page!!
Empty page!!
**** PSM 8 ****
Cs
**** with digits config ****
5

*****  num/6.png OEM 1 LANG eng TESSDATA tessdata_best
**** PSM 3 ****
Empty page!!
Empty page!!
**** with digits config ****
Empty page!!
Empty page!!
**** PSM 8 ****
Ce
**** with digits config ****
6

*****  num/7.png OEM 1 LANG eng TESSDATA tessdata_best
**** PSM 3 ****
Empty page!!
Empty page!!
**** with digits config ****
Empty page!!
Empty page!!
**** PSM 8 ****
7
**** with digits config ****
7

*****  num/8.png OEM 1 LANG eng TESSDATA tessdata_best
**** PSM 3 ****
Empty page!!
Empty page!!
**** with digits config ****
Empty page!!
Empty page!!
**** PSM 8 ****
Cs
**** with digits config ****
8

*****  num/9.png OEM 1 LANG eng TESSDATA tessdata_best
**** PSM 3 ****
Empty page!!
Empty page!!
**** with digits config ****
Empty page!!
Empty page!!
**** PSM 8 ****
Cs
**** with digits config ****

@rexlow
Copy link

rexlow commented Apr 17, 2019

I noticed the same if that single digit is placed far away from other blocks of characters. Interestingly, Google Cloud Vision sometimes suffer from the same problem.

@Shreeshrii
Copy link
Collaborator Author

Empty page issue also reported in #1362

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants