New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error while indexing PDF files #140
Comments
This file does not cause problems in my test environment. I tested it with Which version of pdftotext are you using? |
I'am using |
Errors from pdftotext and pdfinfo are now logged to the ke_search error log in order to make it easier to find the problematic files.
Errors from pdftotext and pdfinfo will now be logged to the ke_search error log. That should make it at least easier to find the problematic files. |
I got the following errors in my cronjob:
Syntax Error: Marked object is wrong type (boolean) Syntax Error: Marked object is wrong type (boolean) Syntax Error: Marked object is wrong type (boolean) Syntax Error: Invalid object stream Syntax Error: Invalid object stream Syntax Error: Invalid object stream Syntax Error: Marked object is wrong type (boolean) Syntax Error: Marked object is wrong type (boolean) Syntax Error: Marked object is wrong type (boolean) Syntax Error: Marked object is wrong type (boolean) Syntax Error: Marked object is wrong type (boolean)
These errors are coming from pdftotext which apperantly has a problem reading the PDF files. At least the error handling could be improved so that in the ke_search log it is shown which files are problematic.
An example pdf file which causes an error is attached to the issue.
nvs-seminarprogramm-2023-1.pdf
The text was updated successfully, but these errors were encountered: