Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Exclude document from OCR #598

Closed
thndrbck opened this issue Feb 16, 2024 · 3 comments
Closed

Exclude document from OCR #598

thndrbck opened this issue Feb 16, 2024 · 3 comments
Assignees

Comments

@thndrbck
Copy link

Forms filled in by hand don't need Optical Character Recognition. The OCR database would fill up with form field labels. Also, disk storage will fill up with unnecessary OCR duplicates.

If you could include a check box when uploading a file so that it is marked for no OCR, that would be helpful.
A toggle to turn off OCR when batch uploading documents would also be helpful.

@ciur
Copy link
Owner

ciur commented Feb 17, 2024

Thank you for opening this ticket!

This feature makes perfect sense and it is relatively easy to implement.
Will be implemented as part of next release 3.1, which will be out in couple of weeks.

@thndrbck
Copy link
Author

Re: Did you meant here exclude entire document from being OCRed - which is exactly as #598 ?

Or did you really meant to exclude specific pages from being OCRed ?
In last case, i.e. when you mean to exclude specific pages from OCRed - it is not possible to implement. It is either entire document (i.e. all pages in the document) or nothing.


I meant not OCRing the entire document.

@ciur
Copy link
Owner

ciur commented Feb 23, 2024

Added
PR#332

Feature will be part of the 3.1.0 release.

@ciur ciur closed this as completed Feb 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants