Reproduce the results of the paper, but there may be something wrong with my pre-processing steps. #1

lingxiaoxue · 2023-10-13T15:55:12Z

I used L=3, K=6 WMT2015 de-en Transformer-Base to reproduce the results of the paper, but the bleu value was nearly 1.64 lower than the original paper.

As shown in the figure above, the result in the paper is bleu 29.29, but the reproduced result is bleu 27.65.
The training and inference steps are the same as those on github, so I think there may be something wrong with my pre-processing steps.

Refer to prepare-wmt14en2de.sh （https://github.com/ictnlp/HMT/blob/main/examples/translation/prepare-wmt14en2de.sh）, change it to wmt15, and delete lines 114-118; As the paper, I use newstest2013 (3000 pairs) as the validation set and newstest2015 (2169 pairs) as the test set.

and BPE_TOKENS =32000；The rest of the steps remain the same.
python $BPEROOT/learn_bpe.py -s $BPE_TOKENS < $TRAIN > $BPE_CODE

Finally, perform length filtering:
perl $CLEAN -ratio 1.5 $tmp/bpe.train $src $tgt $prep/train 1 250
perl $CLEAN -ratio 1.5 $tmp/bpe.valid $src $tgt $prep/valid 1 250

Before pre-processing：

After pre-processing：

Can you help analyze the problem, or provide a pre-processing script?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reproduce the results of the paper, but there may be something wrong with my pre-processing steps. #1

Reproduce the results of the paper, but there may be something wrong with my pre-processing steps. #1

lingxiaoxue commented Oct 13, 2023 •

edited

Loading

Reproduce the results of the paper, but there may be something wrong with my pre-processing steps. #1

Reproduce the results of the paper, but there may be something wrong with my pre-processing steps. #1

Comments

lingxiaoxue commented Oct 13, 2023 • edited Loading

lingxiaoxue commented Oct 13, 2023 •

edited

Loading