Performances of BERT on ACE2005 and MAVEN #4

alderpaw · 2021-05-17T01:45:40Z

Hi, thanks for your work on this dataset.
I notice that you compare the performances of BiLSTM and BERT on both ACE2005 and MAVEN, and it seems that BiLSTM outperforms BERT on ACE2005. However, some papers report different results. For example, in https://www.aclweb.org/anthology/P19-1522/, they report a 80+ F1 score with BERT. And in https://www.aclweb.org/anthology/2020.emnlp-main.435/, results on BERT+MLP is better than DMBERT (76.2 vs 74.9). What do you think of these results?

Bakser · 2021-08-16T12:21:45Z

Hi, thanks for your interest in our work. I didn't reproduce the two works so I will only give some intuitions here.

The PLMEE work adopts their proposed sophisticated mechanisms to help BERT and the results may be reasonably much higher than vanilla DMBERT.
The BERT+MLP is more similar to DMBERT, but still not the same thing. Also, I am not sure if they run the experiments multiple times and report the averaged results. Due to the large variance on the small ACE 2005 dataset, I will not be surprised if the differences come from only the randomness.

alderpaw · 2021-09-01T02:00:52Z

Hi, thanks for your interest in our work. I didn't reproduce the two works so I will only give some intuitions here.

The PLMEE work adopts their proposed sophisticated mechanisms to help BERT and the results may be reasonably much higher than vanilla DMBERT.

The BERT+MLP is more similar to DMBERT, but still not the same thing. Also, I am not sure if they run the experiments multiple times and report the averaged results. Due to the large variance on the small ACE 2005 dataset, I will not be surprised if the differences come from only the randomness.

Thanks for your reply!
I wonder do you use the same split as HMEAE? It seems to be different from the one used in https://github.com/nlpcl-lab/ace2005-preprocessing and leads to a different result.

Bakser · 2021-09-01T09:23:10Z

Hi,
We use the same split as HMEAE, which is also the same as the split you mentioned in fact. But I find that Ziqi uploaded a wrong split file for the example logs in the HMEAE repo and we haven't realized this for such a long time... Now we have fixed the split file in HMEAE repo.
Thanks for bringing this to our attention and sorry for the inconvenience caused.

alderpaw · 2021-09-01T09:33:07Z

Hi,
We use the same split as HMEAE, which is also the same as the split you mentioned in fact. But I find that Ziqi uploaded a wrong split file for the example logs in the HMEAE repo and we haven't realized this for such a long time... Now we have fixed the split file in HMEAE repo.
Thanks for bringing this to our attention and sorry for the inconvenience caused.

Thanks for your help!

Bakser closed this as completed Aug 16, 2021

Bakser mentioned this issue Sep 2, 2021

关于数据划分 thunlp/Adv-ED#12

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performances of BERT on ACE2005 and MAVEN #4

Performances of BERT on ACE2005 and MAVEN #4

alderpaw commented May 17, 2021

Bakser commented Aug 16, 2021

alderpaw commented Sep 1, 2021

Bakser commented Sep 1, 2021

alderpaw commented Sep 1, 2021

Performances of BERT on ACE2005 and MAVEN #4

Performances of BERT on ACE2005 and MAVEN #4

Comments

alderpaw commented May 17, 2021

Bakser commented Aug 16, 2021

alderpaw commented Sep 1, 2021

Bakser commented Sep 1, 2021

alderpaw commented Sep 1, 2021