-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error when Cleanup Scans / OCR #1288
Comments
Hey can you check if this works with 0.24.5 instead? |
it does not work with 0.24.5 either,it throws the same error |
the 0.22's ocr is ok |
Are you using the tag from docker hub or building it yourself ? |
for me 0.22.0 from docker hub does not work either here my stack
and here the stack trace from 0.22.0
|
^ please answer Also does it work on public instances like pdf.adminforge.de I can use it fine, maybe it's the pdf? |
docker run -d -i -t |
0.22.0 had known issues with OCR and really old |
Is /location/of/trainingData A actual location for you or badly copied? |
I'm not sure if you mean me my location of trainingData from the
and the location from
|
your docker run command says
|
That was another user with this path I posted my path above |
CAn you try delete all files other than deu.traineddata and restart |
i have deleted all files other than deu.traineddata and restarted = it does NOT work same error permissions have changed after i restarted the container, here from the
and here from
EDIT: if i do OCR clenup with eng.traineddata on an english pdf it works without problems, german pdf with deu.traineddata does NOT i also changed the permissions of deu.traieddata to root:root and tested = does NOT work either |
does it work on |
yes it works on https://pdf.adminforge.de/ocr-pdf with have you read my edit from my last post? EDIT: i deleted all folders and files and deployed the container from scratch and downloaded the deu.traieddata same problem german pdf with deu.traineddata does not work english pdf with eng.traineddata works |
Which deu.trainingdata are you using? |
Problem solved!!! THANK YOU VERY MUCH and sorry for creating a problem for you that you are not responsible for i can´t belive it, what the hell???? if i do a if i download it via browser and then upload it from my pc to my docker host the file is bigger and OCR works fine |
do you have an idea what i did wrong?? |
At a guess you downloaded the html page not the raw file |
you are right, what a dumb mistake i am really sorry this is the right download link and this did i download https://github.com/tesseract-ocr/tessdata/blob/main/deu.traineddata i downloaded |
np glad we got it figured out |
hello,
i get an error when trying to
Cleanup Scans / OCR
withgerman language
i got my deu.traindata from here https://github.com/tesseract-ocr/tessdata_fast/blob/main/deu.traineddata
App-Version: 0.24.6
here my stack file
and here the tessdata folder
here is the Stack-Trace
The text was updated successfully, but these errors were encountered: