Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Construct SVG output files by putting together the recognition results for each page #139

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

kba
Copy link
Collaborator

@kba kba commented Nov 4, 2016

By @jze in #68

@kba kba mentioned this pull request Nov 4, 2016
@zuphilip zuphilip changed the title Construct SVG output files by putting together the recognition resuts for each page Construct SVG output files by putting together the recognition results for each page May 7, 2017
@zuphilip
Copy link
Collaborator

zuphilip commented May 7, 2017

Here is an example of the output (converted into a png file for easier handling).

I think it is more interesting to continue hocrjs for visualizing the hocr output. Thus, I would suggest to not take this PR in the repo. @kba What do you think about this?

@kba
Copy link
Collaborator Author

kba commented May 7, 2017

SVG is a nice format, portable, can be displayed in the browser and could also be scripted. I considered it for hocrjs but decided against it because hOCR is already HTML and easier to manipulate. If you want to share/compare a visual representation of the hOCR output, it seems like a good option. Github supports diffing SVGs, so you could use that to see what affects positioning etc.

@zuphilip
Copy link
Collaborator

zuphilip commented May 7, 2017

Okay, I see that there is some more value into this script. However, I am not sure we should have another top-level command ocropus-svg for that...

If you want to share/compare a visual representation of the hOCR output, it seems like a good option.

Wouldn't we need then something like a hocr2svg tool? Which I guess is possible by rewriting the lines here and could be saved in the hocr-tools...

We could also think about integrating these svg lines into ocropus-visualize-results. These could actually be some replacement for the nowadays obsolete csegs parts in this script.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants