[Paperless-NGX] OCR not working with additional languages #3012
Replies: 3 comments 6 replies
-
in lxc restart paperless service |
Beta Was this translation helpful? Give feedback.
-
I started from scratch, no dice. With default settings, PNGX complains (even with notification in GUI) about Uncommenting and adding results in the file stuck in Queued (NO GUI warning)
in log. installing I don't think this is me. CPU should not be an issue, the OCR process is not starting with Czech language at all. |
Beta Was this translation helpful? Give feedback.
-
It seems just installing the tesseract-ocr-ces package and not touching the conf file at all at the very least allows for the scanning to start and finish. The scanned document looks fine. Let's see what happens when I add english PDF. |
Beta Was this translation helpful? Give feedback.
-
I need to add Czech (CES) to OCR because I'm getting
MissingDependencyError: OCR engine does not have language data for the following requested languages: CES
PNGX documentation says:
If you run paperless on docker, paperless.conf is not used. Rather, configure paperless by copying necessary options to docker-compose.env
LXC is not Docker, so I'm editing /opt/paperless/paperless.conf
Adding
PAPERLESS_OCR_LANGUAGE=eng ces
makes zero difference.anything else? thanks!
Beta Was this translation helpful? Give feedback.
All reactions