The problms of mismatched evaluation metrics #27

Root970103 · 2021-07-20T02:50:03Z

Hi, thank you for your excellent work. I reproduce your work with the config file named default.yaml, but cannot get the same result(BLEU=0.74). And I found the train loss increased after a few epoches. Can you give some adivice?

lukas-blecher · 2021-07-20T08:32:25Z

Hi, thank you for your interest in the project.
I've had the same problem and I think it is because of the learning rate scheduler. I've created a discussion post here: #11
Restarting the training is not an optimal solution but it seems to work. As stated, maybe another scheduler is a better fit. Any ideas?

Also, I'm using config.yaml right now.

Root970103 · 2021-07-20T09:19:48Z

Thanks for your reply. I have also been looking for possible reasons for this problem. I think both the optimizer and the lr scheduler may be the cause. So I will try to use different configs for the optimizer and the lr scheduler to find the optimal result.

And, are you currently using the method which manually stops and retrains with the previous model as a starting point?

lukas-blecher · 2021-07-20T10:37:11Z

I'm interrupting the training after some time (I keep looking at the progress in wandb and if the loss stagnates I'm interrupting) and resume from the last saved checkpoint (It saves on Keyboardinterrupt after the first epoch).

python train.py --config path-to-checkpoints/model-name/config.yaml --resume

something like that.

lukas-blecher · 2022-01-09T17:21:49Z

With the StepLR scheduler this does not happen anymore

lukas-blecher mentioned this issue Oct 24, 2021

What settings to achieve BLEU: 0.88? #44

Closed

lukas-blecher closed this as completed Jan 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The problms of mismatched evaluation metrics #27

The problms of mismatched evaluation metrics #27

Root970103 commented Jul 20, 2021

lukas-blecher commented Jul 20, 2021

Root970103 commented Jul 20, 2021

lukas-blecher commented Jul 20, 2021

lukas-blecher commented Jan 9, 2022

The problms of mismatched evaluation metrics #27

The problms of mismatched evaluation metrics #27

Comments

Root970103 commented Jul 20, 2021

lukas-blecher commented Jul 20, 2021

Root970103 commented Jul 20, 2021

lukas-blecher commented Jul 20, 2021

lukas-blecher commented Jan 9, 2022