Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

hocr-check complain assert doc.xpath("//meta[@name='ocr-id']")!=[] #101

Closed
CharlesNepote opened this issue Sep 15, 2016 · 2 comments
Closed

Comments

@CharlesNepote
Copy link
Contributor

I was trying to play with hocr-tools -- https://github.com/tmbdev/hocr-tools -- to merge hOCR and jpeg in a single PDF and notice that the hocr-check tools complains when I feed it with an hOCR file produced by gImageReader.

$ hocr-check 20160815.html
Traceback (most recent call last):
  File "/usr/local/bin/hocr-check", line 85, in <module>
    assert doc.xpath("//meta[@name='ocr-id']")!=[]
AssertionError

I'm not sure it's an issue because hOCR spec doesn't talk about 'ocr-id' tag...

@manisandro
Copy link
Owner

Well I suppose I could add it - what value to other OCR applications set for ocr-id?

@CharlesNepote
Copy link
Contributor Author

I close the issue as hocr-check developpers agreed ocr-id is not in the spec and should not be used.
They are going to correct hocr-check code.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants