Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Box_padding is shift on the fisrt row/column #1908

Open
zwwlouis opened this issue Sep 14, 2018 · 2 comments
Open

Box_padding is shift on the fisrt row/column #1908

zwwlouis opened this issue Sep 14, 2018 · 2 comments
Labels

Comments

@zwwlouis
Copy link

Tesseract Open Source OCR Engine v4.0.0-beta.4-18-g4370
text2image 4.0.0-beta.4-18-g4370

Current Behavior:

Since I'v found that model is sensitive on img padding, I'm trying to add some random box_padding when rendering img
But the words on the first row or column are wrong in padding, as the picture below.
eng_box_padding

Some words are always on the first line, So they are always geting wrong box info which will likely lead to overfiting as I know.

Expected Behavior:

I expect box_padding to be uniformly distribute no matter the word is on the edge or inside

@amitdo
Copy link
Collaborator

amitdo commented Sep 14, 2018

The bounding boxes of characters are not very reliable when the lstm engine is being used for text recognition.

Do you get the right text output?

@zwwlouis
Copy link
Author

Nope, I use eng training text to make an example. For chinese word it always box up character by character.
This box shift influence my model more or less.

The bounding boxes of characters are not very reliable when the lstm engine is being used for text
recognition.

Do you get the right text
output?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants