Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using certain traindata files causes Tesseract crash. #249

Open
oldominion opened this issue Jun 2, 2019 · 4 comments
Open

Using certain traindata files causes Tesseract crash. #249

oldominion opened this issue Jun 2, 2019 · 4 comments

Comments

@oldominion
Copy link

oldominion commented Jun 2, 2019

I've cross posted this on the github page of the traindata files.

Shreeshrii/tessdata_shreetest#7

Using any of these lang files (see: https://github.com/Shreeshrii/tessdata_shreetest) cases tesseract via EmguCV 4.1 it to fail with:

"DotProductAVX can't be used on Android"

I'm building my application for Windows x64 target platform.

This isn't a new issue by any means:

tesseract-ocr/tesseract#1113
tesseract-ocr/tesseract#1151

@oldominion
Copy link
Author

Someone else having this same problem: http://www.emgu.com/forum/viewtopic.php?t=21084

@oldominion
Copy link
Author

Found a workaround is to convert the traineddata file to integer type with --convert_to_int

This reduces accuracy as far as I understand, so it would be preferable if EmguCV could fixed to work with float type traindata files too.

@maxpeacock
Copy link

Does anybody know how to fix the bug? I can not update to Emgu 4.x because i need tesseract to run with float type traindata files. But I would like to work with the newes version.

@NkhilKajrekar
Copy link

@oldominion can you please provide file which you converted to integer type also let us know how you have used. we are having same issue with following code we are getting OOM Killed error string result; using (Tesseract ocr = new Tesseract(TessdataPath, "eng", OcrEngineMode.TesseractLstmCombined)) { ocr.PageSegMode = PageSegMode.SingleBlock; ocr.SetImage(gemImg); result = ocr.GetUTF8Text().Trim().TrimStart('-'); result = Regex.Replace(result, @"[^0-9a-zA-Z-]+", ""); } gemImg.Dispose(); return result;

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants