-
Notifications
You must be signed in to change notification settings - Fork 320
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Aboat training #50
Comments
|
I wanted to make sure I understood is correctly. Is this how the complete training goes:
If that is the case; can you please answer the following questions:
|
Hi, by looking at the provided log files; it feels you are first training for 600K iterations woTSA and then using this pre-trained model, further training for another 600K iterations with TSA module in place. Is this correct? Why fine-tuning based on TSA required equal amount of training? |
|
In the training .yml file,what does the "ft_tsa_only" mean?
when I doesn't use it, prompted WARNING: Offset mean is XX, larger than 100.
and the loss is larger than use the "ft_tsa_only" to train
The text was updated successfully, but these errors were encountered: