Add a warning when some of the modules are in eval mode before the training stage #19820

mszulc913 · 2024-04-26T13:26:18Z

Description & Motivation

I feel like #18951 is not visible enough. The change made it super easy to have silent correctness bugs in the codebase when using libraries like huggingface.

Pitch

Add a warning about modules that are set to eval mode before training.

Alternatives

Update the documentation and tutorials.

Additional context

No response

cc @Borda

The text was updated successfully, but these errors were encountered:

AndreiCComan · 2024-05-13T18:59:32Z

Hello, thanks @mszulc913 for bringing this up. It seems like when loading a huggingface model using from_pretrained, it defaults its training state to False during the training_step. That's indeed not ideal, and could potentially have some negative consequences, especially regarding components like Dropout in the pretrained model.

awaelchli · 2024-06-22T18:49:00Z

I prefer not to make a warning. This could confuse users who finetune models where parts of the model are frozen (and thus have to remain in eval mode). This was the main motivation of #18951. The model summary update should give some more visibility into this #19468, it could also be made even more explicit there.

mszulc913 · 2024-07-13T09:37:08Z

Thank you @awaelchli for the reply. Indeed, the updated model summary will help a lot. However, I'm not sure if it's enough. My main concern is that pre-trained models are ubiquitous today, and this feature poses a risk of turning users away from PL, because of a regression they don't have time or will to investigate.

The docs should be updated as well, for example here:
https://lightning.ai/docs/pytorch/stable/advanced/transfer_learning.html

mszulc913 added feature Is an improvement or enhancement needs triage Waiting to be triaged by maintainers labels Apr 26, 2024

awaelchli added discussion In a discussion stage and removed needs triage Waiting to be triaged by maintainers labels Jun 22, 2024

awaelchli mentioned this issue Jul 19, 2024

What happens during training with HuggingFace models in eval mode? #20105

Closed

awaelchli mentioned this issue Aug 4, 2024

Count number of modules in train/eval mode in ModelSummary #20159

Merged

awaelchli closed this as completed in #20159 Aug 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a warning when some of the modules are in eval mode before the training stage #19820

Add a warning when some of the modules are in eval mode before the training stage #19820

mszulc913 commented Apr 26, 2024 •

edited

Loading

AndreiCComan commented May 13, 2024

awaelchli commented Jun 22, 2024

mszulc913 commented Jul 13, 2024 •

edited

Loading

Add a warning when some of the modules are in eval mode before the training stage #19820

Add a warning when some of the modules are in eval mode before the training stage #19820

Comments

mszulc913 commented Apr 26, 2024 • edited Loading

Description & Motivation

Pitch

Alternatives

Additional context

AndreiCComan commented May 13, 2024

awaelchli commented Jun 22, 2024

mszulc913 commented Jul 13, 2024 • edited Loading

mszulc913 commented Apr 26, 2024 •

edited

Loading

mszulc913 commented Jul 13, 2024 •

edited

Loading