Why BLEU is greater than 1? #3

LiHui1116 · 2022-01-03T10:50:19Z

According to the results of table 4 and table 6 that you published in the paper Towards Expressive Communication with Internet Memes: A New Multimodal Conversation Dataset and Bechmark, the BLUE score is greater than 1. This is contrary to the definition of blue, which requires the values should be between 0 and 1.
At the same time, I cheak the file task1_score.py and I didn't find the amplification factor multiplied in the Blue calculation.
Look forward to your reply.

Tuan-Lee-23 · 2022-04-26T11:15:31Z

According to the results of table 4 and table 6 that you published in the paper Towards Expressive Communication with Internet Memes: A New Multimodal Conversation Dataset and Bechmark, the BLUE score is greater than 1. This is contrary to the definition of blue, which requires the values should be between 0 and 1. At the same time, I cheak the file task1_score.py and I didn't find the amplification factor multiplied in the Blue calculation. Look forward to your reply.

I notice that the authors incorrectly implemented BLEU score on the file task1_score.py. Specifically, they computed corpus BLEU (corpus bleu from NLTK) on a single pair (inference, hypothesis), and then they averaged them over the corpus (divide by the length of samples). That was totally wrong

Btw I think that the BLEU benchmark in the paper is multiplied by 100 to achieve percentage number

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why BLEU is greater than 1? #3

Why BLEU is greater than 1? #3

LiHui1116 commented Jan 3, 2022

Tuan-Lee-23 commented Apr 26, 2022

Why BLEU is greater than 1? #3

Why BLEU is greater than 1? #3

Comments

LiHui1116 commented Jan 3, 2022

Tuan-Lee-23 commented Apr 26, 2022