Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Assertion failure when using the legacy ara.traineddata in latest tesseract version. #117

Closed
girikum opened this issue Oct 17, 2023 · 2 comments
Labels
bug Something isn't working

Comments

@girikum
Copy link

girikum commented Oct 17, 2023

I downloaded the legacy arabic model file from your documentation here: https://tesseract-ocr.github.io/tessdoc/tess3/Data-Files.html

When I use the model in the latest tesseract version with OEM mode as 0, I see an assertion failure.

tesseract inputImage.jpg outputText -l ara --oem 0
mgr->GetComponent(TESSDATA_INTTEMP, &fp):Error:Assert failed:in file src/classify/adaptmatch.cpp, line 539
Illegal instruction

Is this a bug in the tessdata file or the latest tesseract version?

@stweil
Copy link
Contributor

stweil commented Oct 17, 2023

It's a bug that the latest Tesseract raises an assertion instead of reporting an old format which is obviously no longer supported. Why do you want to use a legacy model for Arabic? I don't expect that it will work good.

@stweil stweil added the bug Something isn't working label Oct 17, 2023
@amitdo
Copy link
Collaborator

amitdo commented Oct 18, 2023

Duplicate of tesseract-ocr/tesseract#3929.

@amitdo amitdo closed this as completed Oct 18, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants