Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OEM_LSTM_ONLY very slow #632

Closed
rudrabhoj opened this issue Jan 1, 2017 · 3 comments
Closed

OEM_LSTM_ONLY very slow #632

rudrabhoj opened this issue Jan 1, 2017 · 3 comments

Comments

@rudrabhoj
Copy link

rudrabhoj commented Jan 1, 2017

Using Tesseract 4 (libtesseract) in my project decipher_text.
When I use OEM_TESSERACT_ONLY, it takes little over 7-8 seconds to complete the recognization of a page in English. But when I try to instead use OEM_LSTM_ONLY, it takes 10-15 minutes, if not more, to do the same. I am using i5 2400. Ubuntu 16.04 x86_64.

Part of code where this is used:
if (ocrUnit->process->Init(tessDataPath, languageArg, tesseract::OEM_LSTM_ONLY)){
//Again, handle error here
//
//
//
//
}

Entire file in which it was used could be seen here. (Note: File is from an older, stabler version)

@amitdo
Copy link
Collaborator

amitdo commented Jan 1, 2017

Hi,

Did you test it with the Tesseract command line?
How did you compile it? Did you use debug mode?
Can you provide an image?

@Shreeshrii
Copy link
Collaborator

Shreeshrii commented Jan 1, 2017 via email

@theraysmith
Copy link
Contributor

Debug mode is known to be ~50x slower than optimized mode.
Please re-open if you are seeing this slow-down in optimized mode.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants