Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Run do_ocr.py automatically when pages are not equal #66

Open
ravidreams opened this issue Feb 17, 2016 · 3 comments
Open

Run do_ocr.py automatically when pages are not equal #66

ravidreams opened this issue Feb 17, 2016 · 3 comments

Comments

@ravidreams
Copy link

Run do_ocr.py automatically when pages are not equal at the end of first do_ocr.py run. Right now, it waits for user input.

@jayantanth
Copy link
Contributor

It will create endless loop, because we are using third party tool (Google drive) and ocr depends on scan page quality, so manual input necessary. so after first run completed next three/four time can be set re-run automatically. For next run should be done by user and there may be two option

  1. re-run
  2. skipped the undone page

skipped page as describe #38 can be added here for complete the full ocr process.

@ravidreams
Copy link
Author

I see. How about limiting the iteration to 1 or 2 times only and then request manual input? This way, endless loop can be avoided.

But, this automatic feature is necessary if we are going to run batch of files together without the need for editing config.ini every time for new file. When the tool moves to the cloud, this might be necessary.

@bodhisattwawiki
Copy link
Contributor

This is needed. Happening to me almost every time. At least run do_ocr.py second time automatically, if some pages are not OCRed. After that we can do it manually.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants