difference between implementation and LCF-paper #10

fhamborg · 2020-08-06T10:53:46Z

There is a difference between LCF-BERT as described in the paper and the implementation here. Specifically, the paper mentions three self attentions, see for example Figure 3 in https://www.mdpi.com/2076-3417/9/16/3389/htm, one for the output of local BERT, one for the output of global BERT, and one lastly during the feature interactive learning layer.

However, in the implementation here, there is only one self attention, which is only applied to the BERT local output. So, self attention on global BERT and self attention on features interactive learning layer are missing currently in the code. Could you let me know why that is?

fhamborg · 2020-08-06T10:54:50Z

Not sure if relevant, but the implementation at https://github.com/songyouwei/ABSA-PyTorch/blob/master/models/lcf_bert.py is different to the implementation here and different to the description in the paper. It uses also only one self attention (compared to the three mentioned in the paper) but after applying linear_double (https://github.com/songyouwei/ABSA-PyTorch/blob/b925abcdf43c51e475a64ed04984d4911b88676d/models/lcf_bert.py#L116) whereas in this repository the self attention is applied before linear_double (https://github.com/yangheng95/LC-ABSA/blob/c945a94e0f86116c5578245aa9ad36c46c7b9c4a/models/lc_apc/lcf_bert.py#L56)

yangheng95 · 2020-08-07T03:29:31Z

Thank you for your concern. I modified the model when I cleaned and refactored the code because I found that the model could achieve better results in some cases without adding the last MHSA, as well as reduce some computation. Since the paper has been published, we have merely access to revise some minor problems in the aforementioned paper.

fhamborg · 2020-08-07T09:13:41Z

Alright, thanks for letting me know. Since you mentioned the current implementation seems to be better than the one described in the paper, I'll also stick with the current implementation. FYI: Note the other issue opened in ABSA-PyTorch repository, where there is also a difference between their implementation and your implementation here.

fhamborg mentioned this issue Aug 7, 2020

different LCF-BERT implementations songyouwei/ABSA-PyTorch#146

Open

fhamborg closed this as completed Sep 1, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

difference between implementation and LCF-paper #10

difference between implementation and LCF-paper #10

fhamborg commented Aug 6, 2020

fhamborg commented Aug 6, 2020 •

edited

yangheng95 commented Aug 7, 2020

fhamborg commented Aug 7, 2020

difference between implementation and LCF-paper #10

difference between implementation and LCF-paper #10

Comments

fhamborg commented Aug 6, 2020

fhamborg commented Aug 6, 2020 • edited

yangheng95 commented Aug 7, 2020

fhamborg commented Aug 7, 2020

fhamborg commented Aug 6, 2020 •

edited