You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I appreciate very much this simple and effective approach to information extraction. My question is - can the model produce the bounding box for the extracted text?
As a workaround I am thinking of fuzzy matching the text an OCR with bounding boxes, but if the data is replicated on the page in multiple locations then it becomes difficult to know where the answer was copied from.
Thanks
The text was updated successfully, but these errors were encountered:
donut does not require any bounding box annotation/supervision during the model training. But, as a result, there are no actual boxes in the model output. Instead, you can get an attention heatmap that could be used for your purpose.
Or, you may try your fuzzy matching logic with the attention heatmap.
I hope this comment is useful to you. Please let me know if you are still confused.
Hi,
I appreciate very much this simple and effective approach to information extraction. My question is - can the model produce the bounding box for the extracted text?
As a workaround I am thinking of fuzzy matching the text an OCR with bounding boxes, but if the data is replicated on the page in multiple locations then it becomes difficult to know where the answer was copied from.
Thanks
The text was updated successfully, but these errors were encountered: