Answer bounding box #31

horiacristescu · 2022-08-19T14:59:33Z

Hi,

I appreciate very much this simple and effective approach to information extraction. My question is - can the model produce the bounding box for the extracted text?

As a workaround I am thinking of fuzzy matching the text an OCR with bounding boxes, but if the data is replicated on the page in multiple locations then it becomes difficult to know where the answer was copied from.

Thanks

gwkrsrch · 2022-08-21T08:44:14Z

Hi :)

#16 would be helpful to you.

donut does not require any bounding box annotation/supervision during the model training. But, as a result, there are no actual boxes in the model output. Instead, you can get an attention heatmap that could be used for your purpose.

Or, you may try your fuzzy matching logic with the attention heatmap.

I hope this comment is useful to you. Please let me know if you are still confused.

SamSamhuns · 2022-09-09T07:09:33Z

I've found some updates at #45

gwkrsrch closed this as completed Aug 21, 2022

leitouran mentioned this issue Sep 5, 2022

How to get the bounding boxes of the extracted entities? #16

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Answer bounding box #31

Answer bounding box #31

horiacristescu commented Aug 19, 2022

gwkrsrch commented Aug 21, 2022

SamSamhuns commented Sep 9, 2022

Answer bounding box #31

Answer bounding box #31

Comments

horiacristescu commented Aug 19, 2022

gwkrsrch commented Aug 21, 2022

SamSamhuns commented Sep 9, 2022