New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ValueError: zero-size array to reduction operation maximum which has no identity #64
Comments
Hello, the problem is caused by the following faulty text-line in the PAGE XML which for some reason only has two coordinate points. <TextLine id="l6">
<Coords points="911,399 911,459"/>
<TextEquiv>
<Unicode/>
</TextEquiv>
<TextEquiv index="1">
<Unicode>VANAMTatione fieri dicam Au-</Unicode>
</TextEquiv>
</TextLine> Was this PAGE XML created by using OCR4all only or were any changes done to the PAGE XML manually? |
Oh, great, thank you! Maybe it's useful for you to know that the pagexml is the result of the new function that converts legacy data into pagexml. |
That's actually pretty useful, thank you. |
I hope this helps .. |
Thank you for the files. |
Hello! During the recognition process the following error is thrown and the recognition effectively stops proceeding, while the Status still reads "Status: ERROR: The process is still running" (OCR4all ver 0.3.0, LAREX ver 0.3.1):
I have attached the PAGEXML and image files where the error occurs. I couldn't find anything suspicious here.
0125.txt
The text was updated successfully, but these errors were encountered: