Replies: 2 comments 9 replies
-
Hey @OriAlpha! I'm not sure to what exactly you are referring to with model definition. Could you please elaborate on what information you need? Then I can help you finding that information. :) |
Beta Was this translation helpful? Give feedback.
8 replies
-
I have tried whole process, but distilled model results in accurcay drop (i.e., around 10%), i tried for training for more epochs and dont see any improvements. any suggestion to improve distilled model performance. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi all,
I am trying to create a distill version of gelectra-base model, as known we need model defination. I could not find in paper not at least full? So is there any way i could achieve this?? or can i import from transformers as config
Beta Was this translation helpful? Give feedback.
All reactions