Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Answer bounding box #31

Closed
horiacristescu opened this issue Aug 19, 2022 · 2 comments
Closed

Answer bounding box #31

horiacristescu opened this issue Aug 19, 2022 · 2 comments

Comments

@horiacristescu
Copy link

Hi,

I appreciate very much this simple and effective approach to information extraction. My question is - can the model produce the bounding box for the extracted text?

As a workaround I am thinking of fuzzy matching the text an OCR with bounding boxes, but if the data is replicated on the page in multiple locations then it becomes difficult to know where the answer was copied from.

Thanks

@gwkrsrch
Copy link
Collaborator

Hi :)

#16 would be helpful to you.

donut does not require any bounding box annotation/supervision during the model training. But, as a result, there are no actual boxes in the model output. Instead, you can get an attention heatmap that could be used for your purpose.

Or, you may try your fuzzy matching logic with the attention heatmap.

I hope this comment is useful to you. Please let me know if you are still confused.

@SamSamhuns
Copy link
Contributor

I've found some updates at #45

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants