Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Finetuning Donut on FUNSD dataset #3

Closed
satheeshkatipomu opened this issue Jul 25, 2022 · 1 comment
Closed

Finetuning Donut on FUNSD dataset #3

satheeshkatipomu opened this issue Jul 25, 2022 · 1 comment

Comments

@satheeshkatipomu
Copy link

Hi,

Thank you for open sourcing DONUT and SynthDoG. I have two requests.

  1. After Pre-Training("how to read"/pseudo-OCR task), is there a documentation about how to finetune("how to understand") on a different dataset like FUNSD?
  2. Can we generate synthetic documents resembling forms/invoices using SyntheticDoG, if yes can you provide hints whether we need a template or something?
@gwkrsrch
Copy link
Collaborator

Hi, thank you for your interest on our work :)

For (1), I hope the following links would be helpful,

For (2), I would like to say that the SynthDoG's purpose is not to create synthetic data resembling actual forms/invoices. The purpose is just to create a simple synthetic document with texts. The link below seems to be helpful to you.

Hope this helps. Please let me know if you are still confused.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants