Skip to content
This repository has been archived by the owner on Jan 6, 2024. It is now read-only.

Add support for GNU Parallel #6

Closed
popnt opened this issue Aug 26, 2016 · 6 comments
Closed

Add support for GNU Parallel #6

popnt opened this issue Aug 26, 2016 · 6 comments

Comments

@popnt
Copy link

popnt commented Aug 26, 2016

The batch and service modes of pmOCR seem like good candidates to integrate Parallel

I'm not sure which would be the best way to do it but I'd be willing collaborate on it

@deajan
Copy link
Owner

deajan commented Aug 26, 2016

I already have a parallel job control function which is portable to bsd and mac.
It should be easy to send OCR function to background without using gnu parallel.

What's the purpose here ?
Is the purpose is to use tesseract on all cores (abbyyocr11 is already threaded) ?
Is the purpose to run on multiple computers ?

@popnt
Copy link
Author

popnt commented Aug 26, 2016

Yes, Tesseract does not support all cores by default, but your point that it running on multiple computers sounds very useful too!

@deajan
Copy link
Owner

deajan commented Aug 26, 2016

It's called "poor man" ocr service, running on multiple cores should be enough :)

I've commited some quick and dirty parallel execution work.
Number of processes is set in config header.

Please test it and tell me if it works for you.

@popnt
Copy link
Author

popnt commented Aug 26, 2016

Just gave it a try and seems to work just fine..! I set the process limit to 3 on a system with 4 cores, I'll monitor if it uses more than 3 but so far it looks stable.

@deajan
Copy link
Owner

deajan commented Aug 27, 2016

Commited what should become v1.5.
Fixes an issue introduced with parallel processing on filenames with spaces and adds a couple of options for tesseract. Feel free to update and test.
Also, feel free to tell me if you have other ideas for this script :)

@popnt
Copy link
Author

popnt commented Sep 5, 2016

tested v1.5 and runs well. Also integrated instructions to run a new instance as outlined in Issue-7, it all seems stable!

@popnt popnt closed this as completed Sep 5, 2016
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

2 participants