Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Finetuning on DONUT-proto #22

Closed
Veason-silverbullet opened this issue Aug 14, 2022 · 5 comments
Closed

Finetuning on DONUT-proto #22

Veason-silverbullet opened this issue Aug 14, 2022 · 5 comments

Comments

@Veason-silverbullet
Copy link

Hi, @gwkrsrch ,

It works well in the case of DONUT-base, but DONUT-proto does not. Could you please provide the finetuning YAML configuration file of DONUT-proto? Many thanks for your effort!

@logan-markewich
Copy link

+1!

Also, if you ever have results for docvqa with donut-proto, I'd love to seem them 🥇

@Veason-silverbullet
Copy link
Author

Hi, @gwkrsrch @logan-markewich . Seems that when the input size is set as [1024, 768], donut-proto works on finetuning, observed from the DONUT-base config that pretrain[2560x1920] -> finetuning[1280x960]. Does it make sense?

@gwkrsrch
Copy link
Collaborator

Hi @Veason-silverbullet, yes it make sense :) The window size of donut-proto is 8, hence, there should be no problem if the size of each axis is set to a multiple of 256 (e.g., 768, 1024, 2048, etc).

@gwkrsrch
Copy link
Collaborator

@Veason-silverbullet I made a toy example notebook about training and testing donut-proto on CORD. You can see the colab notebook at this link. Hope this helps :)

@Veason-silverbullet
Copy link
Author

Veason-silverbullet commented Aug 16, 2022

@gwkrsrch , yes the above example notebook works well!

BTW, do you plan to release the donut-proto pretraining YAML config file? I would like to train donut-proto from scratch on other datasets, so the example pretraining config file would help (e.g., optimizer settings). Thanks a lot!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants