-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
workflow not finishing #53
Comments
Hi @github-cli, yes basically this command is issued. To be precise it's First thing i'd to is setting a more verbose loglevel in your NC config and then paste the results here if possible. |
this is the relevant output, as for the rror messages in line 9+10, the same ones appear if i run ocrmypdf manually but it still finishes and creates the file correctly
|
Thanks for this. Well the relevant line is If it's possible for you please paste the mentioned PDF file here, maybe @bahnwaerter could have a look at it and say what's wrong? |
I have an example I can send, not exactly confidential but can I still share in private? |
Ok then it would be nice if you could send me an email with both files attached. @bahnwaerter FYI |
Hey @github-cli, thanks for sharing your original PDF files with @R0Wi and me. I've taken a look at those PDF files and analyzed them. The PDF file of the low quality scan is compliant with the PDF 1.7 standard, whereas the PDF file of the high quality scan is not syntactically well-formed. Therefore, the PDF file of the high quality scan does not conform to any of the available PDF standards. Furthermore, I noticed that both PDF files were created by the HP scan tool. This scan tool seems to create faulty PDF files as the analysis of issue #42 shows. To solve this issue, you can repair your faulty PDF files before uploading them to your Nextcloud server. Please follow the solution described in #42 (comment). I will close this issue as it is related to the HP scan tool. But feel free to reopen it, if we can help somehow. |
Duplicate of #42 |
it seems this workflow does not finish on larger higher res files, if i manually start ocrmypdf --redo-ocr input.odf output.pdf then it works fine but running "sudo -u www-data php cron.php" only updates smaller files (although it seems to start as it takes quite some time if new large scans were added but the files are never updated).
any way to debug this? isnt this the exact same command being used by the workflow?
The text was updated successfully, but these errors were encountered: