Attention Weight Bug in Embedding #2

kandorm · 2019-04-01T15:06:52Z

https://github.com/xhuang31/KEQA_WSDM19/blob/f923a15cc732e8844c26e4c653c306c90e067734/embedding.py#L56

There is a bug in this line, because 'self.attn(torch.cat((x, outputs), 1)' generate a tensor which size is
(seq_len*batch_size, 1), which results in all softmax results to be 1.0.

xhuang31 · 2019-04-02T01:54:23Z

Yes. You are right. It should be

attn_weights = F.softmax(self.attn(torch.cat((x, outputs), 1)), dim=0).

Then we would need different hyperparameters. I will leave this issue open and fine tune the updated model when I get time.

Thanks for your comments.

xhuang31 closed this as completed Sep 18, 2019

xhuang31 mentioned this issue Apr 23, 2020

train_entity.py 头实体表示学习模型训练精度只有63 #11

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attention Weight Bug in Embedding #2

Attention Weight Bug in Embedding #2

kandorm commented Apr 1, 2019

xhuang31 commented Apr 2, 2019

Attention Weight Bug in Embedding #2

Attention Weight Bug in Embedding #2

Comments

kandorm commented Apr 1, 2019

xhuang31 commented Apr 2, 2019