You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
https://github.com/UB-Mannheim/Fibeln also has 11 files which contain text regions without text. Those files where also created using Transkribus. This indicates that it might be a general problem of that software.
The PAGE XML files contain lots of text regions without text in their
TextEquiv
and a few text files without text in theirTextEquiv
:Text from regions with text in lines but without text in the region gets lost when the PAGE XML file is converted to pure text using
ocr-transform
.The text was updated successfully, but these errors were encountered: