How to measure the embedding/representation difference between layers? #16

LifangD · 2019-10-09T03:19:38Z

Can anyone explain how to draw such figure since the L2 distance or cosine similarity is often conducted on the vector rather than the embedding matrix. Besides, how to measure it on the whole dataset?
Thanks for any suggestions or references~

brightmart · 2019-10-10T14:30:44Z

may be you can take the first position, which is [CLS] token, it represent the sequence of that layer. so that you can have a hidden states as vector to represent that specific layer.
after you trained the model, or you get a checkpoint from pre-trained model, i think you can measure it, as all the parameters are fixed. it may not relate to a downstream dataset.

LifangD · 2019-10-11T03:01:46Z

may be you can take the first position, which is [CLS] token, it represent the sequence of that layer. so that you can have a hidden states as vector to represent that specific layer.
after you trained the model, or you get a checkpoint from pre-trained model, i think you can measure it, as all the parameters are fixed. it may not relate to a downstream dataset.

OK, thanks~

brightmart closed this as completed Oct 17, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to measure the embedding/representation difference between layers? #16

How to measure the embedding/representation difference between layers? #16

LifangD commented Oct 9, 2019

brightmart commented Oct 10, 2019

LifangD commented Oct 11, 2019

How to measure the embedding/representation difference between layers? #16

How to measure the embedding/representation difference between layers? #16

Comments

LifangD commented Oct 9, 2019

brightmart commented Oct 10, 2019

LifangD commented Oct 11, 2019