Added Weave class and WeaveModel class #3529

NimishaDey · 2023-08-17T08:43:27Z

Description

Added Weave class and WeaveModel class.

Type of change

Please check the option that is related to your PR.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
- In this case, we recommend to discuss your modification on GitHub issues before creating the PR
Documentations (modification for documents)

Checklist

rbharath

@ARY2260 Can you do a preliminary round of review?

ARY2260 · 2023-09-27T14:32:10Z

deepchem/models/torch_models/weavemodel_pytorch.py

+                                               pad_batches=pad_batches):
+                if y_b is not None:
+                    if self.model.mode == 'classification':
+                        y_b = to_one_hot(y_b.flatten(),


separate handling of labels in the data with in the default generator may not be required

please check

default_generator function gets called internally for Numpy Dataset when model.fit() is run. And to keep it close to tensorflow implementation I thought it would be best to keep the default generator function.

ARY2260 · 2023-09-27T14:37:04Z

deepchem/models/torch_models/weavemodel_pytorch.py

+        n_tasks = self.n_tasks
+        if self.mode == 'classification':
+            n_classes = self.n_classes
+            self.layer_2 = nn.LazyLinear(n_tasks * n_classes)


@rbharath will it be fine to use lazy linear over default nn.linear here. The usage here is based on the fact that input size is not known, but it may be possible to get that.

ARY2260 · 2023-09-27T14:38:23Z

deepchem/models/torch_models/weavemodel_pytorch.py

+        if weight_decay_penalty != 0.0:
+            weights = [layer.weight for layer in self.model.layers2]
+            if weight_decay_penalty_type == 'l1':
+                regularization_loss = lambda: weight_decay_penalty * torch.sum(  # noqa: E731


please check again as we generally don't use #noqa to fix lint issue

#noqa has to be used for lambda and since regularization_loss is a callable type I think it's required.

ARY2260 · 2023-10-08T12:52:06Z

deepchem/models/torch_models/weavemodel_pytorch.py

@@ -172,7 +171,8 @@ def __init__(
        ]
        self.batch_normalize: bool = batch_normalize
        self.n_weave: int = n_weave
-        torch.manual_seed(21)
+
+        # torch.manual_seed(21)


I think you can remove this comment

I have readded the seed statement because otherwise the reload test is failing.

Is it fine to add the seed statement here?

ARY2260 · 2023-10-08T12:52:58Z

deepchem/models/torch_models/weavemodel_pytorch.py

@@ -227,10 +227,15 @@ def __init__(

        if n_layers > 0:
            self.layers2: nn.ModuleList = nn.ModuleList()
+            in_size = 1408


is input size always fixed?

No it actually depends on n_graph_feat. So I have changed this statement in terms of n_graph_feat.

ARY2260 · 2023-10-08T12:56:41Z

deepchem/models/tests/test_weavemodel_pytorch.py

+def test_weave_singletask_classification_overfit():
+    """Test weave model overfits tiny data."""
+    # np.random.seed(123)
+    # torch.manual_seed(123)


I think seed should be turned on for this test.

ARY2260 · 2023-10-08T12:57:58Z

deepchem/models/tests/test_weavemodel_pytorch.py

+
+    # Eval model on train
+    scores = model.evaluate(dataset, [classification_metric])
+


mention a comment here suggesting to inspect model in future to understand low score

Okay I will do that. The unit test for tensorflow code uses the same value though.

deepchem/models/tests/test_weavemodel_pytorch.py

ARY2260 · 2023-10-08T14:33:20Z

deepchem/models/torch_models/weavemodel_pytorch.py

+        else:
+            self.layer_2 = nn.Linear(fully_connected_layer_sizes[1], n_tasks)
+
+    def forward(self, inputs: OneOrMany[torch.Tensor]) -> List[torch.Tensor]:


please add docstrings

deepchem/models/torch_models/weavemodel_pytorch.py

rbharath · 2023-10-11T16:15:28Z

deepchem/models/torch_models/weavemodel_pytorch.py

+        Parameters
+        ----------
+        inputs: OneOrMany[torch.Tensor]
+        Should contain 5 tensors [atom_features, pair_features, pair_split, atom_split, atom_to_pair]


Formatting here is a little off

rbharath

LGTM

NimishaDey marked this pull request as ready for review September 18, 2023 09:22

NimishaDey force-pushed the weavemodel-torch branch from 3ab66ac to 3c9a397 Compare September 18, 2023 15:04

rbharath reviewed Sep 19, 2023

View reviewed changes

ARY2260 reviewed Sep 27, 2023

View reviewed changes

ARY2260 reviewed Oct 8, 2023

View reviewed changes

deepchem/models/tests/test_weavemodel_pytorch.py Show resolved Hide resolved

ARY2260 reviewed Oct 8, 2023

View reviewed changes

deepchem/models/torch_models/weavemodel_pytorch.py Show resolved Hide resolved

NimishaDey added 6 commits October 9, 2023 17:17

Added Weave class and WeaveModel class

b1c1f79

Adding unit tests

17e6228

Correcting mypy errors

2199720

Adding regression and reload unit tests

747b751

Adding overfit tests and necessary comments

6436561

Corrections

8fee49e

NimishaDey force-pushed the weavemodel-torch branch from c151463 to 8fee49e Compare October 9, 2023 11:50

Correction to unit test

9049040

rbharath reviewed Oct 11, 2023

View reviewed changes

rbharath approved these changes Oct 11, 2023

View reviewed changes

rbharath merged commit 96e8b4e into deepchem:master Oct 11, 2023
23 of 33 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added Weave class and WeaveModel class #3529

Added Weave class and WeaveModel class #3529

NimishaDey commented Aug 17, 2023

rbharath left a comment

ARY2260 Sep 27, 2023

ARY2260 Sep 27, 2023

NimishaDey Oct 4, 2023

ARY2260 Sep 27, 2023

ARY2260 Sep 27, 2023

NimishaDey Oct 4, 2023

ARY2260 Oct 8, 2023

NimishaDey Oct 9, 2023

NimishaDey Oct 9, 2023

ARY2260 Oct 8, 2023

NimishaDey Oct 9, 2023

ARY2260 Oct 8, 2023

NimishaDey Oct 9, 2023

ARY2260 Oct 8, 2023

NimishaDey Oct 9, 2023

NimishaDey Oct 9, 2023

ARY2260 Oct 8, 2023

NimishaDey Oct 9, 2023

rbharath Oct 11, 2023

rbharath left a comment


		# Eval model on train
		scores = model.evaluate(dataset, [classification_metric])

Added Weave class and WeaveModel class #3529

Added Weave class and WeaveModel class #3529

Conversation

NimishaDey commented Aug 17, 2023

Description

Type of change

Checklist

rbharath left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rbharath left a comment

Choose a reason for hiding this comment