-
Notifications
You must be signed in to change notification settings - Fork 466
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Finetuning on DONUT-proto #22
Comments
+1! Also, if you ever have results for docvqa with donut-proto, I'd love to seem them 🥇 |
Hi, @gwkrsrch @logan-markewich . Seems that when the input size is set as [1024, 768], donut-proto works on finetuning, observed from the DONUT-base config that pretrain[2560x1920] -> finetuning[1280x960]. Does it make sense? |
Hi @Veason-silverbullet, yes it make sense :) The window size of donut-proto is 8, hence, there should be no problem if the size of each axis is set to a multiple of 256 (e.g., 768, 1024, 2048, etc). |
@Veason-silverbullet I made a toy example notebook about training and testing |
@gwkrsrch , yes the above example notebook works well! BTW, do you plan to release the |
Hi, @gwkrsrch ,
It works well in the case of DONUT-base, but DONUT-proto does not. Could you please provide the finetuning YAML configuration file of DONUT-proto? Many thanks for your effort!
The text was updated successfully, but these errors were encountered: