-
Notifications
You must be signed in to change notification settings - Fork 112
performance after fine-tuning #100
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
What is the learning rate set to? how do the RNA-only training and test data construct? |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hello,
This model was pre-trained on Protein, DNA, and RNA data collectively. Therefore, I initially expected that fine-tuning with RNA-only data would improve its performance specifically for RNA structure prediction. However, the results did not align with my expectations.
Despite adjusting parameters such as lowering the learning rate and tuning ema_decay, the more I fine-tuned the model, the worse its performance became on the test RNA dataset. In the end, the pretrained model remained the best-performing version.
Do you have any advice on why this might be happening or how I could improve the fine-tuning process?
The text was updated successfully, but these errors were encountered: