Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reusing HTML: Some text is missing in the first paragraph #96

amitdo opened this issue Oct 29, 2016 · 1 comment


Copy link

commented Oct 29, 2016

Reusing HTML

This document describes a representation of various aspects of OCR output in an XML-like format. That is, we define as set of tags containing text and other tags, together with attributes of those tags. However, since the content we are representing is formatted text,

However, we are not actually using a new XML for the representation; instead embed the representation in

Some text is missing in the first paragraph.

define as set => define a set
instead embed => instead we embed


This comment has been minimized.

Copy link

commented Nov 4, 2016

Some text is missing in the first paragraph

That's still from the google doc. I'm open to suggestions and I'll mark it in the spec.

The typos I'll fix right away, thanks.

@kba kba changed the title Reusing HTML Reusing HTML: Some text is missing in the first paragraph Nov 4, 2016

kba added a commit that referenced this issue Nov 4, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
None yet
2 participants
You can’t perform that action at this time.