Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

a proposition to help hocr-tools become ZE best #174

Open
evanescente-ondine opened this issue Feb 2, 2022 · 2 comments
Open

a proposition to help hocr-tools become ZE best #174

evanescente-ondine opened this issue Feb 2, 2022 · 2 comments
Labels

Comments

@evanescente-ondine
Copy link

It would simplify people's life A LOT, if you could write a version of hocr-pdf that does everything on its own:
create the hOCR for all of a pdf's pages, merge them, then merge the resulting file with the pdf. and VOILÀ, no loss in the conversion, no mess, no fuss...
Perhaps allowing for changing the engine too.

@stweil
Copy link
Collaborator

stweil commented Feb 3, 2022

Why not simply use ocrmypdf?

@isspid
Copy link

isspid commented Mar 30, 2023

@stweil one reason could be that ocrmypdf only allows for using tesseract as an engine, out of the box.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants