Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug]: Textbox detection seems to be performing poorly with ctd #710

Open
torgabor opened this issue Sep 29, 2024 · 3 comments
Open

[Bug]: Textbox detection seems to be performing poorly with ctd #710

torgabor opened this issue Sep 29, 2024 · 3 comments
Labels
bug Something isn't working

Comments

@torgabor
Copy link

torgabor commented Sep 29, 2024

Issue

Hi!
I've tried this program but it seems that the textbox detection using ctd is very unreliable, working on like 10-50% of the text boxes.

The strange thing is that the hosted version (https://cotrans.touhou.ai/) seems much better, and BalloonsTranslator (https://github.com/dmMaze/BallonsTranslator), which is supposedly based on this project is also working well.

In all cases, I'm using the ctd detector with detection size 1024. I'm using the mit-48px for ocr (where I can specify it).

Does anyone have an idea what could be the difference between these approaches?
I've tried this with commit 37bb4cd

Command Line Arguments

This is the one I used last, but I tried many combinations of arguments, with no success:
python -m manga_translator --translator=none --force-horizontal -l=ENG --detector=ctd --detection-size=1024 --ocr=48px --overwrite --use-gpu --save-text --inpainter none -i somepage.jpg

No response

Console logs

No response

@torgabor torgabor added the bug Something isn't working label Sep 29, 2024
@grassmedico
Copy link

Do the upscaling thing
also rescale it back if you dont want the resolution became to big

@zyddnys
Copy link
Owner

zyddnys commented Oct 11, 2024

use higher detection resolution with default detector like --detection-size=2048 --detector default

@DuckDuckGo-prog
Copy link

use higher detection resolution with default detector like --detection-size=2048 --detector default

can you give me what detectors give accurate results seems to me ctd paired with manga_ocr is failing

so the default now is more accurate?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants