-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Semi-automatic structuring in metadata editor using OCR #60
Comments
@markusweigelt The linked feature sounds great. But i think it would be very useful to have the following functionality - in a way also the basis of the described quite fancy functionality -:
The main use case for me would be to allow the OCR to be done at the beginning of a workflow. Even before people have done some quality assurance (missing pages etc.). So that the OCR does not have to wait. And to allow people to use the OCR results while structring. And if people then do corrections in Kitodo enable the OCR only for newly added pages for example. I am not quite sure if those features could be adressed in the KITODO-OCRD-project or wether they are something for the Kitodo development fund, what do you think? |
Most of the things which kitodo/kitodo-production#5476 describes are new Kitodo UI features – out of scope for our OCR-D integration project, so yes, that would mean Kitodo development fund.
What we can do here is previewing OCR results with OCR-D browser. On the Kitodo side, for the intended extension, I think you're right – a simple plain text editor would suffice (one line per
Already possible (see
That's also something we (as integration project) have little control over, since it's a genuine UI feature. All we can do is ensure the filesystem side (
Yes, these are valid use-cases, too. But renaming pages adds the difficulty of ensuring consistency (as long as OCR is still running). I'll try to reformulate under kitodo/kitodo-production#5476. For ocrd_kitodo IMO we can already close (as it's already supported from our side). |
Except perhaps the feature that we should skip pages which have already been processed earlier (an ALTO file exists). |
Great, thanks for your detailed answer! |
In Kitodo.Production repository a detailed ticket for integrating the automatic structuring was created.
kitodo/kitodo-production#5476
The text was updated successfully, but these errors were encountered: