Automatically save best model during training #44

chriscyyeung · 2023-05-11T19:24:45Z

I think instead of (or maybe in addition to) saving after a certain number of epochs, we can save the best model based on the validation loss?

ungi · 2023-05-11T22:27:54Z

Finding the best model is not as simple as saving the one that has minimum loss function. Sometimes when training for extremely long (thousands of epochs), the model can learn a better (more general) representation of the data without decreasing the loss function. I couldn't find where I learned this, but it was related to this theory: https://medium.com/@MITIBMLab/estimating-information-flow-in-deep-neural-networks-b2a77bdda7a7

Saving the model regularly is generally a good practice. We could add an option like "model_save_frequency". E.g. if it's 5 then the model would be saved after every 5 epochs using names like model_005, model_010, etc. And we could save on the wandb report all the metrics for each saved model.

I also had positive experience in the past training for a few hundred more epochs after it seemed like the metrics did not improve.

…utput folder structure

chriscyyeung added a commit that referenced this issue May 18, 2023

Re #44: move save_frequency to command-line argument and reorganize o…

dcd588f

…utput folder structure

chriscyyeung added a commit that referenced this issue May 18, 2023

Re #44: move save_frequency to command-line argument and reorganize o…

f690be4

…utput folder structure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatically save best model during training #44

Automatically save best model during training #44

chriscyyeung commented May 11, 2023

ungi commented May 11, 2023

Automatically save best model during training #44

Automatically save best model during training #44

Comments

chriscyyeung commented May 11, 2023

ungi commented May 11, 2023