Skip to content
This repository has been archived by the owner on Dec 16, 2022. It is now read-only.

fix bug with SlantedTriangular LR scheduler #4443

Merged
merged 2 commits into from Jul 6, 2020
Merged

Conversation

epwalsh
Copy link
Member

@epwalsh epwalsh commented Jul 6, 2020

There is a bug related to gradual unfreezing with the Slanted Triangular Learning Rate schedule. During the first epoch, only the top layer is trained (as expected) but, during the 2nd epoch, the top 3 layers are trained when it should just be the top 2:

image

This also causes training to crash on the last "freezing" epoch:

image

@epwalsh epwalsh requested a review from matt-gardner July 6, 2020 17:51
@epwalsh epwalsh merged commit 8482f02 into master Jul 6, 2020
@epwalsh epwalsh deleted the slanted-triangular-fix branch July 6, 2020 18:04
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants