Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bugs #2

Open
joeywen0830 opened this issue May 27, 2023 · 2 comments
Open

Bugs #2

joeywen0830 opened this issue May 27, 2023 · 2 comments

Comments

@joeywen0830
Copy link

Along with DLL missing issue same as the other user, the ocr function sometimes automatically ignores some words, and the final txt file is showing wrong sequences of sentences compared to the original image text. Would you be kind enough to explain what kind of image should we use to receive the most accurate outcome? Thanks.

@ahnaf-zamil
Copy link
Owner

Hello there, thanks for making the issue!

Are you using the version from the Releases page? Shouldn't say DLL missing if you used that. Although, Windows can be a b*itch sometimes. What specific DLL is it missing? It should say that in a popup prompt.

Also, please send the input images and I'll try to test them out myself. I personally didn't train the JP_VERT detection model that is used for the text detection, it was trained by the guys who made Tesseract. But completely ignoring some words is very strange, since it has never happened.

The sequencing/ordering of sentences may be an issue caused by my ordering algorithm, I'll have to take a look at that.

Thanks for making the issue though :)

@joeywen0830
Copy link
Author

Hello there, thanks for making the issue!

Are you using the version from the Releases page? Shouldn't say DLL missing if you used that. Although, Windows can be a b*itch sometimes. What specific DLL is it missing? It should say that in a popup prompt.

Also, please send the input images and I'll try to test them out myself. I personally didn't train the JP_VERT detection model that is used for the text detection, it was trained by the guys who made Tesseract. But completely ignoring some words is very strange, since it has never happened.

The sequencing/ordering of sentences may be an issue caused by my ordering algorithm, I'll have to take a look at that.

Thanks for making the issue though :)

Yeah I'm using the ones from the releases. The DLL issue is just like what the other user's image shows. About the missing word issue, I'm not sure why, sometimes it happens

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants