are predecessor modules weights frozen or not #9

TobiasLee · 2020-08-12T14:57:08Z

Hi, thanks for your great work.
According to the paper description, the predecessor module weights are frozen after fine-tuned on the task data( including embedding & output classifier).
The code, however, if my understanding is correct, the fine-tuned predecessor weights are not frozen, instead, the loss can BP to the corresponding parameters.
So, which pattern is supposed to be correct? Thanks in advance.

JetRunner · 2020-08-12T16:13:50Z

They are frozen. Please check the optimizer. Thanks!

TobiasLee · 2020-08-13T07:32:29Z

Thanks for the notification.

Ca0L · 2021-05-20T07:02:44Z

Sorry, I don't see how the optimizer freeze predecessor modules weights. Could you please explain it? Thanks.

JetRunner · 2021-05-20T17:46:01Z

Sorry, I don't see how the optimizer freeze predecessor modules weights. Could you please explain it? Thanks.

They are not in the optimized parameters of the optimized. PyTorch optimizers can only change the parameters passed to them.

JetRunner closed this as completed Aug 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

are predecessor modules weights frozen or not #9

are predecessor modules weights frozen or not #9

TobiasLee commented Aug 12, 2020

JetRunner commented Aug 12, 2020

TobiasLee commented Aug 13, 2020

Ca0L commented May 20, 2021

JetRunner commented May 20, 2021

are predecessor modules weights frozen or not #9

are predecessor modules weights frozen or not #9

Comments

TobiasLee commented Aug 12, 2020

JetRunner commented Aug 12, 2020

TobiasLee commented Aug 13, 2020

Ca0L commented May 20, 2021

JetRunner commented May 20, 2021