You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Can you add initial warmup variant so that the warmup steps only apply on the first cycle like the one shown here?
Can you infer the max_lr parameter value from the optimizer base learning rate for each group? Currently, if you have an optimizer with multiple groups with different learning rates, all of their learning rate values will get overridden by the max_lr.
BTW very awesome implementation!
The text was updated successfully, but these errors were encountered:
Hi, I have some suggestions for features:
max_lr
parameter value from the optimizer base learning rate for each group? Currently, if you have an optimizer with multiple groups with different learning rates, all of their learning rate values will get overridden by themax_lr
.BTW very awesome implementation!
The text was updated successfully, but these errors were encountered: