Trainer should run the test loop with the best weights when ModelCheckpoint is used #2046

yukw777 · 2020-06-01T22:59:24Z

🚀 Feature

Motivation

I noticed that even when ModelCheckpoint is used, Trainer by default runs the test loop with the last weights, not the best weights saved by ModelCheckpoint. I believe the sensible default here is to run the test loop with the best weights saved by ModelCheckpoint.

Pitch

Now that ModelCheckpoint has a pointer to the best weights, Trainer can replace the last weights with the best weights before running the test loop automatically.

Alternatives

Possibly, this could be another option to Trainer. I don't like this as much b/c this is the behavior most users would expect.

Additional context

The text was updated successfully, but these errors were encountered:

awaelchli · 2020-06-02T06:08:37Z

Something like this?
trainer.test(model, load_best_checkpoint=True)

Borda · 2020-06-02T13:11:55Z

I would make the load_best_checkpoint=True as default...

yukw777 · 2020-06-02T14:48:33Z

Yeah this should definitely be the default behavior.

Another question is, can we only do this when ModelCheckpoint is used since Trainer itself doesn’t keep track of the best weights? What if someone writes their own ModelCheckpoint? It seems like there needs to be a common interface that Trainer uses to retrieve the best weights for the test loop. The best weights could then be whatever the checkpoint_callback defines it to be. In this way, I don’t think we’d need to have yet another option on Trainer.

williamFalcon · 2020-06-02T15:31:07Z

why not make it:

# default
test(..., checkpoint=‘best’)

test(..., checkpoint=PATH/CKPT)

with the option for a string ‘best’

and make this the default

Borda · 2020-06-02T17:55:43Z

why not make it:
# default
test(..., checkpoint=‘best’)

test(..., checkpoint=PATH/CKPT)
with the option for a string ‘best’

and make this the default

very good and test(..., checkpoint=None) uses the last...

yukw777 · 2020-06-02T21:30:51Z

Nice! I like that idea. To summarize:

add an option to test() called checkpoint whose default value is best.
if it's None, use the weights from the last epoch
if it's another string, treat it as a path.

williamFalcon · 2020-06-02T22:17:39Z

ummm. i prefer None to disable it.
there will 100% be cases where people need to disable that haha.

yukw777 · 2020-06-03T15:08:47Z

ah yeah since using the last epoch weights is the current behavior, setting it to None (and using the last epoch weights) would effectively disable it. Let me know if my understanding is incorrect.

williamFalcon · 2020-06-03T15:27:34Z

current behavior is equivalent to None

new default behavior should be “best”

yukw777 added feature Is an improvement or enhancement help wanted Open to be worked on labels Jun 1, 2020

Borda added this to the 0.9.0 milestone Jun 2, 2020

yukw777 mentioned this issue Jun 15, 2020

Add ckpt_path option to LightningModule.test() #2190

Merged

5 tasks

williamFalcon closed this as completed in #2190 Jun 15, 2020

Borda modified the milestones: 0.9.0, 0.8.0 Jun 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Trainer should run the test loop with the best weights when ModelCheckpoint is used #2046

Trainer should run the test loop with the best weights when ModelCheckpoint is used #2046

yukw777 commented Jun 1, 2020

awaelchli commented Jun 2, 2020 •

edited

Borda commented Jun 2, 2020

yukw777 commented Jun 2, 2020 •

edited by Borda

williamFalcon commented Jun 2, 2020 •

edited

Borda commented Jun 2, 2020

yukw777 commented Jun 2, 2020

williamFalcon commented Jun 2, 2020 •

edited

yukw777 commented Jun 3, 2020

williamFalcon commented Jun 3, 2020

Trainer should run the test loop with the best weights when ModelCheckpoint is used #2046

Trainer should run the test loop with the best weights when ModelCheckpoint is used #2046

Comments

yukw777 commented Jun 1, 2020

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

awaelchli commented Jun 2, 2020 • edited

Borda commented Jun 2, 2020

yukw777 commented Jun 2, 2020 • edited by Borda

williamFalcon commented Jun 2, 2020 • edited

Borda commented Jun 2, 2020

yukw777 commented Jun 2, 2020

williamFalcon commented Jun 2, 2020 • edited

yukw777 commented Jun 3, 2020

williamFalcon commented Jun 3, 2020

awaelchli commented Jun 2, 2020 •

edited

yukw777 commented Jun 2, 2020 •

edited by Borda

williamFalcon commented Jun 2, 2020 •

edited

williamFalcon commented Jun 2, 2020 •

edited