The task-specific weights for h_phi are not adapted in the inner loop #2

xuegsh · 2020-11-30T14:10:48Z

Hello, sir. I run the code in the repository and got some questions about it :

I use the script to run finetuning on the CoNLL task and I notice that the weights of the 2-layer MLP (which is denoted as h_phi in the paper) do not change as the training goes on. And I found that the reason is that the per-layer learning rates for h_phi are initialized to 0 and are set to untrainable as follows:

This means that the task-specific weights for h_phi are not adapted in the inner loop. It is inconsistent with what the paper says.

The learning rates for the 2-layer MLP (which is denoted as g_psi in the paper) seems redundant as they haven't been used in the adaption phase.

theTB · 2020-12-02T00:21:01Z

Hi. As you can see from the code snippet that you shared, these are treated as warp layers and hence they are not adapted during fine-tuning. So what you are seeing is the intended behavior.

theTB closed this as completed Dec 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The task-specific weights for h_phi are not adapted in the inner loop #2

The task-specific weights for h_phi are not adapted in the inner loop #2

xuegsh commented Nov 30, 2020

theTB commented Dec 2, 2020

The task-specific weights for h_phi are not adapted in the inner loop #2

The task-specific weights for h_phi are not adapted in the inner loop #2

Comments

xuegsh commented Nov 30, 2020

theTB commented Dec 2, 2020