Resetting of high scores, scheduler and optimizer for fine-tuning/domain adaptation #75

juliakreutzer · 2019-11-25T20:07:03Z

In the case of fine-tuning or domain adaptation, one might want to overwrite the existing scheduler or optimizer or the previously tracked high score.
This is specifically relevant when fine-tuning with a different metric or a data where the validation score is in a different range than the loaded model (e.g. pre-training on BLEU and fine-tuning on PPL would result in never storing a new checkpoint). When the data is much smaller than the previous training data, one might want to reduce the patience.

We can also think about modeling this as "load_x_from_ckpt" rather than "reset_x", but for now, it seems that in the default case we want to load everything.

dwhitena

This works for me. When the flags are set, I can see the checkpoints being saved in the filesystem and logged properly. Thanks for updating this!

juliakreutzer · 2019-11-25T20:56:40Z

Awesome, thanks for confirming this so quickly @dwhitena! And an official welcome as collaborator! 🎉

added resetting of highscores, scheduler and optimizer

3faa155

juliakreutzer requested a review from bastings November 25, 2019 20:07

reverted eval metric

19e3e1c

joeynmt requested review from dwhitena and removed request for bastings November 25, 2019 20:36

dwhitena approved these changes Nov 25, 2019

View reviewed changes

juliakreutzer merged commit 1f51b40 into master Nov 25, 2019

juliakreutzer deleted the reset_loading branch November 25, 2019 23:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resetting of high scores, scheduler and optimizer for fine-tuning/domain adaptation #75

Resetting of high scores, scheduler and optimizer for fine-tuning/domain adaptation #75

juliakreutzer commented Nov 25, 2019

dwhitena left a comment

juliakreutzer commented Nov 25, 2019

Resetting of high scores, scheduler and optimizer for fine-tuning/domain adaptation #75

Resetting of high scores, scheduler and optimizer for fine-tuning/domain adaptation #75

Conversation

juliakreutzer commented Nov 25, 2019

dwhitena left a comment

Choose a reason for hiding this comment

juliakreutzer commented Nov 25, 2019