New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

`load_from_checkpoint` support for LightningCLI when using dependency injection #18105

Merged

carmocca merged 10 commits into Lightning-AI:master from mauvilsa:cli-load-from-checkpoint

Feb 23, 2024

Contributor

mauvilsa commented Jul 18, 2023 •

edited

What does this PR do?

For now this pull request is intended to start the discussion on how to add full support for load_from_checkpoint for models trained using LightningCLI.

LightningCLI is designed to allow the use of dependency injection, which is a good programming pattern. Since this means that class instances are given to the module's __init__, the current implementation of save_hyperparameters is not enough because there is no reliable way to know how the objects were instantiated. And this prevents load_from_checkpoint from working correctly.

The main idea is that LightningCLI provides to save_hyperparameters, through a context variable, the actual parameters used for instantiation. These get saved as normal in the hparams.yaml file. Then load_from_checkpoint uses a custom instantiator that makes use of jsonargparse to support the configuration of subclasses (class_path and init_args) in configs.

The issue #15427 asks for a way to instantiate a model from a config file. However, load_from_checkpoint does both instantiation and loading of weights. Is there really a need to only instantiate from config? Like having a load_from_config which would get the path to the hparams.yaml file?

Also #15427 asks about dataloaders. From what I understand save_hyperparameters can also be used for data modules. However, I don't know how that is intended to work.

Fixes #15427
Fixes #13279

Before submitting

Was this discussed/agreed via a GitHub issue? (not for typos and docs)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you make sure to update the documentation with your changes? (if necessary)
Did you write any new necessary tests? (not for typos and docs)
Did you verify new and existing tests pass locally with your changes?
Did you list all the breaking changes introduced by this pull request?
Did you update the CHANGELOG? (not for typos, docs, test updates, or minor internal changes/refactors)

PR review

Anyone in the community is welcome to review the PR.
Before you start reviewing, make sure you have read the review guidelines. In short, see the following bullet-list:

Reviewer checklist

Is this pull request ready for review? (if not, please submit in draft mode)
Check that all items from Before submitting are resolved
Make sure the title is self-explanatory and the description concisely explains the PR
Add labels and milestones (and optionally projects) to the PR so it can be classified

github-actions bot added the pl label

mauvilsa force-pushed the cli-load-from-checkpoint branch from 0bda3b7 to 6075f6f Compare

July 18, 2023 05:21

mauvilsa commented

View reviewed changes

requirements/pytorch/extra.txt Outdated Show resolved Hide resolved

src/lightning/pytorch/cli.py Outdated Show resolved Hide resolved

tests/tests_pytorch/test_cli.py Outdated Show resolved Hide resolved

mauvilsa marked this pull request as ready for review

July 18, 2023 05:32

mauvilsa requested review from awaelchli, carmocca, justusschock, williamFalcon, lantiga, Borda and tchaton as code owners

July 18, 2023 05:32

mergify bot added the has conflicts label

mauvilsa mentioned this pull request

Provide the ability to instantiate the model using the config file generated with LightningCLI #15427

Closed

awaelchli assigned carmocca

awaelchli added lightningcli feature community labels

mergify bot removed the has conflicts label

Borda requested changes

View reviewed changes

Member

Borda left a comment

Can we have this as optional so that still allow users to use an older version of jsonargparse and not have a too narrow version range...

requirements/pytorch/extra.txt Outdated Show resolved Hide resolved

carmocca reviewed

View reviewed changes

Member

carmocca left a comment

Interesting solution. But I think we should iterate on the UX for this first

tests/tests_pytorch/test_cli.py Outdated Show resolved Hide resolved

mergify bot added the has conflicts label

Borda changed the title ~~load_from_checkpoint support for LightningCLI when using dependency injection~~ load_from_checkpoint support for LightningCLI when using dependency injection

mauvilsa force-pushed the cli-load-from-checkpoint branch from da5a248 to f007077 Compare

August 22, 2023 14:28

mergify bot removed the has conflicts label

mauvilsa force-pushed the cli-load-from-checkpoint branch from f007077 to ae8aa65 Compare

August 22, 2023 14:45

mergify bot added the has conflicts label

mauvilsa force-pushed the cli-load-from-checkpoint branch from ae8aa65 to 78667fa Compare

August 23, 2023 06:02

mergify bot added has conflicts and removed has conflicts labels

mauvilsa requested a review from Borda

October 3, 2023 09:43

mauvilsa added 2 commits

February 6, 2024 12:24


          load_from_checkpoint support for LightningCLI when using dependency i…

dd5901b

…njection.


          Change implementation to use add_instantiator.

df9e4b4

mauvilsa force-pushed the cli-load-from-checkpoint branch from 78667fa to df9e4b4 Compare

February 6, 2024 11:25

mergify bot removed the has conflicts label

mauvilsa commented

View reviewed changes

tests/tests_pytorch/test_cli.py Show resolved Hide resolved

carmocca approved these changes

View reviewed changes

Member

carmocca left a comment

Overall looks fine. Given the complexity and the lack of docs I don't expect users to try this yet. Will you add those later?

tests/tests_pytorch/test_cli.py Outdated Show resolved Hide resolved

carmocca mentioned this pull request

No action for key "ckpt_path" -> ckpt_path not available for linking #18885

Open

awaelchli approved these changes

View reviewed changes

src/lightning/pytorch/core/mixins/hparams_mixin.py Outdated Show resolved Hide resolved

src/lightning/pytorch/core/mixins/hparams_mixin.py Outdated Show resolved Hide resolved

tests/tests_pytorch/test_cli.py Outdated Show resolved Hide resolved

awaelchli added this to the 2.3 milestone


          Require newer version of jsonargparse

b5145e2

github-actions bot added the dependencies label


          Merge branch 'master' into cli-load-from-checkpoint

2fc7ec8

codecov bot commented Feb 13, 2024 •

edited

Codecov Report

Merging #18105 (c71406f) into master (39a86f8) will decrease coverage by 35%.
Report is 3 commits behind head on master.
The diff coverage is 100%.

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #18105      +/-   ##
==========================================
- Coverage      84%      48%     -35%     
==========================================
  Files         450      442       -8     
  Lines       38154    38050     -104     
==========================================
- Hits        31893    18430   -13463     
- Misses       6261    19620   +13359

mauvilsa and others added 3 commits

February 20, 2024 02:42


          Address review comments: protected names and removal of instantiator …

fa4074b

…parameter.


          [pre-commit.ci] auto fixes from pre-commit.com hooks

9b9e5c6

for more information, see https://pre-commit.ci


          Mention load_from_checkpoint support in docs and add unit test for su…

8917d3d

…bclass mode.

github-actions bot added the docs label

carmocca reviewed

View reviewed changes

Member

carmocca left a comment •

edited

@Borda you have changes requested for this PR. Can you check it again please?

requirements/pytorch/extra.txt Show resolved Hide resolved


          Updated _JSONARGPARSE_SIGNATURES_AVAILABLE

8fb083e

awaelchli approved these changes

View reviewed changes

src/lightning/pytorch/CHANGELOG.md Outdated Show resolved Hide resolved


          Move up the changelog entry.

a892efb

mergify bot added the has conflicts label


          Merge branch 'master' into cli-load-from-checkpoint

c71406f

mergify bot removed the has conflicts label

Borda approved these changes

View reviewed changes

mergify bot added the ready label

carmocca merged commit 623ec58 into Lightning-AI:master

103 of 104 checks passed

mauvilsa deleted the cli-load-from-checkpoint branch

February 23, 2024 12:43

mauvilsa commented

View reviewed changes

src/lightning/pytorch/cli.py

+                  def __call__(self, class_type: Type[ModuleType], *args: Any, **kwargs: Any) -> ModuleType:
+                      with _given_hyperparameters_context(
+                          hparams=self.cli.config_dump.get(self.key, {}),
+                          instantiator="lightning.pytorch.cli.instantiate_module",

Contributor Author

mauvilsa Feb 25, 2024

@carmocca @awaelchli @Borda I changed to this to avoid providing an instantiator during the call to load_from_checkpoint. The tests run fine, but now I am wondering what happens when the pytorch-lightning package is installed instead of lightning. This line could be changed so that the import path is different for each package. But then, how would this be tested? Also, if someone saves a checkpoint using pytorch-lightning, should the load work when using lightning?

Member

carmocca Feb 26, 2024

We have a CI step where lightning.pytorch gets rewritten to pytorch_lightning for the CI jobs that build pytorch_lightning. If tests pass (they did) it should be a reliable signal that this worked as expected.

Also, if someone saves a checkpoint using pytorch-lightning, should the load work when using lightning?

However, this most likely doesn't work, but mixing the two packages is strongly discouraged. I guess it could be a problem for different users who share checkpoints.

Member

awaelchli Feb 26, 2024

If it becomes an issue, we could write a checkpoint migration that will rewrite the instantiator import path on the fly:
https://github.com/Lightning-AI/pytorch-lightning/blob/master/src/lightning/pytorch/utilities/migration/migration.py

Checkpoint migrations are applied automatically when lightning loads a checkpoint.

Contributor Author

mauvilsa Feb 26, 2024

Migrations only go one way pytorch-lightning -> lightning, or is it bidirectional?

Member

awaelchli Feb 26, 2024

Migrations are just one way for us to define transformations to "upgrade" checkpoints from one version of lightning to another, for example when breaking changes were made so people can still load their old checkpoints.

It could also be used to define a transformation that replaces the instantiator string if the user is running from within the other package.

Contributor Author

mauvilsa Feb 27, 2024 •

edited

Is it known how common is it for people to share checkpoints and use them indistinctly in pytorch-lightning or lightning? If it is not common or it is not known how common, then we could leave it like this. And do an improvement when someone raises the issue. Or should this be considered now before the feature is included in a release?

Member

awaelchli Mar 1, 2024

I'm ok leaving it like this for now.

speediedan added a commit to speediedan/finetuning-scheduler that referenced this pull request


          refactor examples to accommodate Lightning-AI/pytorch-lightning#18105

29d355e

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

awaelchli awaelchli approved these changes

carmocca carmocca approved these changes

Borda Borda approved these changes

justusschock Awaiting requested review from justusschock justusschock is a code owner

williamFalcon Awaiting requested review from williamFalcon williamFalcon is a code owner

lantiga Awaiting requested review from lantiga lantiga is a code owner

tchaton Awaiting requested review from tchaton tchaton is a code owner