Best model in ModelCheckpoint #1960

mpariente · 2020-05-26T15:09:00Z

🚀 Feature

Add a best_model attribute in ModelCheckpoint.

Motivation

After training, it would be nice to easily have the path to the checkpoint with the best val loss.

Or is there an argument in the trainer to resume best state after training and I missed it? (totally possible)

The text was updated successfully, but these errors were encountered:

mpariente · 2020-05-26T20:52:42Z

BTW, this is my solution but I don't think this should be necessary

best_k = checkpoint.best_k_models
best_path = [b for b, v in best_k.items() if v == torch.min(best_k.values())][0]

rohitgr7 · 2020-05-26T22:18:24Z

@mpariente Better condition will be:

best_k = checkpoint.best_k_models
best_path = [b for b, v in best_k.items() if v == checkpoint.best)][0]

checkpoint.modecan be max though.

mpariente · 2020-05-27T07:35:16Z

True, thanks, but this still doesn't change the fact that a best_model_path attribute should be integrated in the checkpoint IMO

HansBambel · 2020-05-27T08:35:28Z

The save_top_k parameter saves the last k best models. When put to -1 it saves all models. The latest master version also has a save_last parameter that also keeps the latest epoch of the model.
E.g:

checkpoint_callback = ModelCheckpoint(
        filepath=os.getcwd()+"/"+"<your-folder>"+"/{epoch}-{val_loss:.6f}",
        save_top_k=1,
        verbose=False,
        monitor='val_loss',
        mode='min',
        prefix=net.__class__.__name__+"_"
    )

This results in a file called: UNet_epoch=5-val_loss=0.581735.ckpt

mpariente · 2020-05-27T10:33:12Z

Thanks for the example, I also use it like that.
Still, asking the user to reverse the dict or look for the best loss in the filenames is not super user-friendly. It asks few (not many) additional lines of code which are not necessary, don't you think?

williamFalcon · 2020-05-27T10:35:58Z

this is already in master.

#1799.

@kepler can you add to docs? (on the checkpoint page)

mpariente · 2020-05-27T10:38:26Z

Oh ok, sorry for bothering then, I should have checked master
Thanks for the pointer William

williamFalcon · 2020-05-28T18:07:48Z

it's a great idea :) just someone beat you to it haha

mpariente added feature Is an improvement or enhancement help wanted Open to be worked on labels May 26, 2020

mpariente closed this as completed May 27, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Best model in ModelCheckpoint #1960

Best model in ModelCheckpoint #1960

mpariente commented May 26, 2020

mpariente commented May 26, 2020

rohitgr7 commented May 26, 2020

mpariente commented May 27, 2020

HansBambel commented May 27, 2020 •

edited

Loading

mpariente commented May 27, 2020

williamFalcon commented May 27, 2020 •

edited

Loading

mpariente commented May 27, 2020

williamFalcon commented May 28, 2020

Best model in ModelCheckpoint #1960

Best model in ModelCheckpoint #1960

Comments

mpariente commented May 26, 2020

🚀 Feature

Motivation

mpariente commented May 26, 2020

rohitgr7 commented May 26, 2020

mpariente commented May 27, 2020

HansBambel commented May 27, 2020 • edited Loading

mpariente commented May 27, 2020

williamFalcon commented May 27, 2020 • edited Loading

mpariente commented May 27, 2020

williamFalcon commented May 28, 2020

HansBambel commented May 27, 2020 •

edited

Loading

williamFalcon commented May 27, 2020 •

edited

Loading