Modular loading from pretrained #3305

tonydavis629 · 2023-03-24T16:25:20Z

Description

This PR allows ModularTorchModel to load components from disk. This is important for any pretraining training regime. The changes are:

load_pretrained_components is removed
load_from_pretrained from TorchModel is modified to accept components, and is modified to load the components and full model. Mypy incompatible signature errors are ignored.
save_checkpoint is modified to save components as well as the full model to a single checkpoint.
restore is modified to accept components. Mypy incompatible signature errors are ignored.

Suggestion: we should remove the example usage for ModularTorchModel. It is an abstract class, users are not expected to call ModularTorchModel, so to have an example is more confusing than helpful. Subclasses will be the usage examples.

Type of change

Please check the option that is related to your PR.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
- In this case, we recommend to discuss your modification on GitHub issues before creating the PR
Documentations (modification for documents)

Checklist

rbharath

Some requests for more documentation since the save/restore algorithm for component has some detail to it

deepchem/models/torch_models/modular.py

tonydavis629 · 2023-03-27T15:10:38Z

deepchem/models/torch_models/infograph.py

-        self.init_emb()
+        if init_emb:
+            self.init_emb()


This change is made so that .restore() functions as expected. Otherwise the bias weights will be filled with 0.

rbharath

This looks good to me barring a minor comment issue.

@gusty1g Can you do a quick review pass as well since this is based on your earlier prototype?

deepchem/models/torch_models/modular.py

arunppsg · 2023-03-28T11:08:35Z

deepchem/models/torch_models/modular.py

+
+    def load_from_pretrained(  # type: ignore
+            self,
+            source_model: Optional["ModularTorchModel"] = None,


Are all three required - source_model, checkpoint, model_dir? I think only model_dir will be sufficient. Given a model_dir, the method loads the state_dict and if any of the current models layer or component matches the keys in state_dict, the method can update those components weights.

I think it's relatively harmless to support multiple loading options here. Gives maybe a bit more flexibility to users

arunppsg · 2023-03-28T11:13:16Z

I am not sure why we need both load_from_pretrained and restore. Will it be possible to add the functionality of load_from_pretrained to restore?

tonydavis629 · 2023-03-28T14:42:06Z

I am not sure why we need both load_from_pretrained and restore. Will it be possible to add the functionality of load_from_pretrained to restore?

restore is an optional argument in TorchModel.fit. We can move the functionality to load_from_pretrained then modify modulartorchmodel.fit but I think at this point it's just better to maintain the convention we have in torchmodel.

rbharath

Looking at discussion so far, I think we will be good to merge in once CI is fixed. Looks like we have some flake8 errors and yapf errors.

rbharath · 2023-03-29T05:29:43Z

deepchem/models/torch_models/modular.py

+
+    def load_from_pretrained(  # type: ignore
+            self,
+            source_model: Optional["ModularTorchModel"] = None,


I think it's relatively harmless to support multiple loading options here. Gives maybe a bit more flexibility to users

rbharath

LGTM, feel free to merge in once CI is clear

tonydavis629 added 7 commits March 24, 2023 10:19

modular loading works

8229b76

clean test

1e8c51b

need revised api for loading

437632d

load_from_pretrained overwritten, clean

da70b4d

ignore mypy

d4884ab

flake8

3504a23

remove get_checkpoints, same as torchmodel

0a09703

rbharath reviewed Mar 25, 2023

View reviewed changes

deepchem/models/torch_models/modular.py Show resolved Hide resolved

deepchem/models/torch_models/modular.py Show resolved Hide resolved

deepchem/models/torch_models/modular.py Show resolved Hide resolved

tonydavis629 added 8 commits March 27, 2023 09:26

docs

35f1283

opt and state dict

60e8dbc

optimizer loaded

e177595

Merge branch 'master' into modular_loading

daa6395

Merge branch 'master' into modular_loading

4630f8c

Merge branch 'master' into modular_loading

c8d273d

modify infograph to work with loading

119a20b

yapf

fea8b47

tonydavis629 commented Mar 27, 2023

View reviewed changes

test fix

bc626cb

rbharath reviewed Mar 28, 2023

View reviewed changes

deepchem/models/torch_models/modular.py Show resolved Hide resolved

arunppsg reviewed Mar 28, 2023

View reviewed changes

Merge branch 'master' into modular_loading

2b28b9d

typo

1880fb4

rbharath reviewed Mar 29, 2023

View reviewed changes

yapf

3834030

rbharath approved these changes Mar 29, 2023

View reviewed changes

fix test

9183428

tonydavis629 merged commit 3dc137a into deepchem:master Mar 29, 2023

tonydavis629 deleted the modular_loading branch March 30, 2023 13:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modular loading from pretrained #3305

Modular loading from pretrained #3305

tonydavis629 commented Mar 24, 2023

rbharath left a comment

tonydavis629 Mar 27, 2023

rbharath left a comment

arunppsg Mar 28, 2023

rbharath Mar 29, 2023

arunppsg commented Mar 28, 2023

tonydavis629 commented Mar 28, 2023

rbharath left a comment

rbharath Mar 29, 2023

rbharath left a comment

Modular loading from pretrained #3305

Modular loading from pretrained #3305

Conversation

tonydavis629 commented Mar 24, 2023

Description

Type of change

Checklist

rbharath left a comment

Choose a reason for hiding this comment

tonydavis629 Mar 27, 2023

Choose a reason for hiding this comment

rbharath left a comment

Choose a reason for hiding this comment

arunppsg Mar 28, 2023

Choose a reason for hiding this comment

rbharath Mar 29, 2023

Choose a reason for hiding this comment

arunppsg commented Mar 28, 2023

tonydavis629 commented Mar 28, 2023

rbharath left a comment

Choose a reason for hiding this comment

rbharath Mar 29, 2023

Choose a reason for hiding this comment

rbharath left a comment

Choose a reason for hiding this comment