Finetuning Donut on FUNSD dataset #3

satheeshkatipomu · 2022-07-25T10:03:06Z

Hi,

Thank you for open sourcing DONUT and SynthDoG. I have two requests.

After Pre-Training("how to read"/pseudo-OCR task), is there a documentation about how to finetune("how to understand") on a different dataset like FUNSD?
Can we generate synthetic documents resembling forms/invoices using SyntheticDoG, if yes can you provide hints whether we need a template or something?

gwkrsrch · 2022-07-26T02:59:05Z

Hi, thank you for your interest on our work :)

For (1), I hope the following links would be helpful,

https://github.com/clovaai/donut#for-document-information-extraction
https://arxiv.org/abs/2111.15664
- especially, Section 2.4 and Appendix A.4 and Figure E.

For (2), I would like to say that the SynthDoG's purpose is not to create synthetic data resembling actual forms/invoices. The purpose is just to create a simple synthetic document with texts. The link below seems to be helpful to you.

Hope this helps. Please let me know if you are still confused.

gwkrsrch closed this as completed Jul 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finetuning Donut on FUNSD dataset #3

Finetuning Donut on FUNSD dataset #3

satheeshkatipomu commented Jul 25, 2022

gwkrsrch commented Jul 26, 2022

Finetuning Donut on FUNSD dataset #3

Finetuning Donut on FUNSD dataset #3

Comments

satheeshkatipomu commented Jul 25, 2022

gwkrsrch commented Jul 26, 2022