-
Notifications
You must be signed in to change notification settings - Fork 9.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tesseract fails to recognize big text #3480
Comments
I think this may be the issue with the text being white-on-black. Can you check it by manually inverting colors and trying again? |
Tried it with with tesseract version 5 and it works perfect:
|
@wollmers Did you use tessdata |
@nocun Yes, I tried and it works. But I have some pictures with the same text size and on some of them Tesseract fails. |
None of Just downloaded today for a fresh compile/installation on another server
|
Images that small (33 KB) should be in the issue thread. If I test an issue, I try to record the case in an own github repository like this: https://github.com/wollmers/ocr-tess-issues/tree/main/issues/issue_3304_big_text. |
So, no issue recognizing the image you provided. Getting different results with different models is normal.
So provide 2-3 examples. Use the latest 5.0.0 codebase. |
No feedback from the OP. |
Environment
Current Behavior:
Tesseract can't recognize text on this image.
Suggested Fix:
I reduced size of the image and it works, but shouldn't Tesseract recognize it without any extra transformations?
The text was updated successfully, but these errors were encountered: