New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add a parameter to output or not object coordinates #69

kermitt2 opened this Issue Aug 20, 2015 · 2 comments


None yet
2 participants

kermitt2 commented Aug 20, 2015

We have started to output the coordinates in the original PDF of some TEI elements (e.g. figure markers) with bounding boxes. As we will output coordinates for more objects, it would nice to have in the APIs for the full text processing a parameter to choose to output or not these coordinates.

@kermitt2 kermitt2 self-assigned this Aug 20, 2015

@kermitt2 kermitt2 modified the milestones: 0.3.6, 0.3.9 Aug 20, 2015

@kermitt2 kermitt2 modified the milestones: 0.4.0, 0.3.9 Nov 19, 2015


This comment has been minimized.

nooralahzadeh commented Sep 13, 2016

I am wondering if you provide this information for the TEI elements like author, title and ,,,?


This comment has been minimized.


kermitt2 commented Sep 13, 2016

Not for the moment, but the area coordinates can be provided for any fields/structures of the document, including those where the text is normalized. Currently coordinates appear for bibliographical references and reference markers which are used in the PDF annotation demos.

We're really open to suggestion on how to specify which coordinates to get in the output, given that it would be very heavy to output too many of them. We can also provide the coordinates as JSON annotations independently from the XML TEI (see web service referenceAnnotations). We're still exploring also how to exploit nicely those annotations for creating dynamic layer on top of a PDF rendered in the browser.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment