-
Notifications
You must be signed in to change notification settings - Fork 474
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Where do classes get added as special tokens? #10
Comments
Hi, @NielsRogge For (1), For (2), I found 5 donut repositories at Kind regards, |
Hi, Thanks for updating that :) Regarding uploading the checkpoints, I can open up PRs on your repos. I'll open a PR on the Transformers repository today to add the model to the library. Will update you. |
Great! I will wait for the updates :) |
Hi, I'll open PRs on the repos today. When these are merged, I can also merge the Donut PR linked above. Kind regards, Niels |
Hi @gwkrsrch, I've opened PRs on all 8 repos. Feel free to review and merge them :) |
Hi @NielsRogge, I have just checked that all model repositories got the PR and merged it successfully. And I see the huggingface update is now available. Awesome :) I will close this issue since the related updates are completed and merged. |
Hi,
I've implemented Donut as a fork of HuggingFace Transformers, and soon I'll add it to the library. The model is implemented as an instance of VisionEncoderDecoderModel, which allows to combine any vision Transformer encoder (like ViT, Swin) with any text Transformer as decoder (like BERT, GPT-2, etc.). As Donut exactly did that, it was straightforward to implement it that way.
Here's a notebook that shows inference with it.
I do have 2 questions though:
when using this dataset when creating an instance of
DonutDataset
, it seems only "<s_class>", "</s_class>" and "<s_rvlcdip>" are added as special tokens. But looking at this file, it seems that one also defines special tokens for each class. Looking at the code, it seems only keys are added, not values of the dictionaries.Let me know what you think!
Kind regards,
Niels
ML Engineer @ HuggingFace
The text was updated successfully, but these errors were encountered: