S-Prompts for ViT and Text Transformers #388

prabhuteja12 · 2023-08-22T18:16:17Z

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

wistuba · 2023-12-04T09:21:21Z

doc/getting_started/supported_algorithms.rst

@@ -41,7 +41,10 @@ using Renate (e.g., using :py:func:`~renate.training.training.run_training_job`;
     - A class that implements a Learning to Prompt method for ViTs. The methods trains only the input prompts that are sampled from a prompt pool in an input dependent fashion.
   * - ``"LearningToPromptReplay"`` 
     - :py:class:`LearningToPromptLearner <renate.updaters.experimental.l2p.LearningToPromptReplayLearner>`
-     - A class that extends the Learning to Prompt method to use a memory replay method like "Offline-ER"
+     - A class that extends the Learning to Prompt method to use a memory replay method like "Offline-ER".


since this is about supported algorithms, it should list S-Prompts and give guidance how to use it with SPeft.

wistuba · 2023-12-04T09:24:18Z

src/renate/benchmark/datasets/vision_datasets.py

@@ -367,6 +367,8 @@ def __init__(
    def prepare_data(self) -> None:
        """Download DomainNet dataset for given domain."""
        file_name = f"{self.data_id}.zip"
+        # update dataset name:
+        self._dataset_name = self.data_id


why is this here? prepare_data is called only once. therefore, one could set this already as part of the constructor. should this replace L365?

wistuba · 2023-12-04T10:03:40Z

src/renate/benchmark/models/spromptmodel.py

+        self._M = prompt_size
+        self._task_id = task_id
+        self._per_task_classifier = per_task_classifier
+        logger.warning(f"Task id is {self._task_id}")


remove this line?

wistuba · 2023-12-04T10:04:12Z

src/renate/benchmark/models/spromptmodel.py

+        self._backbone["transformer"].requires_grad_(False)
+        self._backbone["prompt_pool"].requires_grad_(True)
+
+        # self._backbone["transformer"].transformer._backbone.enable_gradient_checkpointing()


wistuba · 2023-12-04T10:05:00Z

src/renate/benchmark/models/spromptmodel.py

+        # self.s_prompts
+        self._backbone["prompt_pool"].increment_task()
+
+    def forward_for_monkey_patching(


prepend _

wistuba · 2023-12-04T10:40:02Z

src/renate/benchmark/models/spromptmodel.py

+                ]
+            )
+        else:
+            logits = self._backbone["classifier"]["0"](features)


can we remove the hard-coding of "0" somehow? what is defining that name?

It is the update/task_id variable converted into a string. 0 just implies that we are in the first
update. Removing it will just be a cosmetic first_task_name = "0", which doesn't seem to serve any
purpose.

wistuba · 2023-12-04T10:41:50Z

src/renate/models/layers/shared_linear.py

+    Args:
+        in_features: size of each input sample
+        out_features: size of each output sample
+        bias: If set to ``False``, the layer will not learn an additive bias.


missing documentation of args

wistuba · 2023-12-04T10:44:29Z

src/renate/models/layers/shared_linear.py

+            }
+        super().__init__(all_layers)
+
+    def increment_task(self) -> None:


is there a nice way to reuse this function in the constructor to populate all_layers?

src/renate/updaters/experimental/speft.py

wistuba · 2023-12-04T10:53:13Z

src/renate/updaters/experimental/speft.py

+        self, epoch: int, batch_idx: int, optimizer: Optimizer, optimizer_idx: int
+    ) -> None:
+        """Explicitly setting grads to None instead of zero."""
+        optimizer.zero_grad(set_to_none=True)


prabhuteja12 and others added 30 commits June 22, 2023 19:22

running l2p

7bdcab0

Merge remote-tracking branch 'origin/dev' into pt_l2p

962fe19

reorganized prompt pool code

4be7d52

minor changes to vit

1082d74

Merge remote-tracking branch 'origin/dev' into pt_l2p

c77bfe7

l2p working version

bd6194f

handling extra state

d3d17fd

changing constructors args

6222576

fixes to extra state

5c1b225

prompt reimplementation

482a337

dev merge

5b4564b

first commit allowing for a class mask

500f093

avalanche masking unsued classes

b97510c

Merge remote-tracking branch 'origin/pt_class_mask_for_CIL' into pt_l2p

4949f68

Removing debug statements

1dd0784

Merge remote-tracking branch 'origin/pt_class_mask_for_CIL' into pt_l2p

8b40951

device placement fix

f6d0042

addressing comments

ef04cc8

Merge remote-tracking branch 'origin/pt_class_mask_for_CIL' into pt_l2p

7689574

removing legacy tensor constructors

2dc0745

offline ER class mask

d5da9a3

Merge remote-tracking branch 'origin/pt_class_mask_for_CIL' into pt_l2p

8911f2f

working l2p code

595f5d9

flake8

9b019bf

small changes to tests

055b626

code comments in l2p

f295c2d

ViT outputs pooled or full feats flag

e1dd41d

unified mask, abstractions

0a2239d

typing fixes

3142c50

Merge remote-tracking branch 'origin/pt_class_mask_for_CIL' into pt_l2p

e0d8a12

prabhuteja12 and others added 18 commits September 24, 2023 23:01

multiple classifiers

03fc742

bug fix in shared linear

bb3df13

removing nested tensors

aca2873

flake8

47345f0

fixing missing prompt

93bd6b0

changing init method for prompts to kaiming uniform

e123788

Merge remote-tracking branch 'origin/dev' into pt_s_prompts

34e9f49

renaming modules

39dbab7

renaming in parsing_functions

cc3f622

bug fix

088945b

minor simplifications to model

b336e41

Merge remote-tracking branch 'origin/dev' into pt_s_prompts

7da478c

bug fix in parsing function and optimizer zero grad

c64fe4a

doc strings

4ccde7b

doc strings

0f963e2

Documentation update

57f0465

modifed docs and docstrings

0495640

changing default value of per task classifier

161da77

prabhuteja12 requested a review from wistuba December 1, 2023 10:33

prabhuteja12 marked this pull request as ready for review December 1, 2023 10:33

prabhuteja12 assigned wistuba Dec 1, 2023

prabhuteja12 added 3 commits December 4, 2023 08:52

fixing updater name

41cbe1c

model cleanup

16c185d

flake8

d462447

wistuba reviewed Dec 4, 2023

View reviewed changes

Addressing comments

446b8b2

prabhuteja12 requested a review from wistuba December 4, 2023 14:02

wistuba approved these changes Dec 4, 2023

View reviewed changes

prabhuteja12 merged commit c54887c into dev Dec 4, 2023
18 checks passed

prabhuteja12 deleted the pt_s_prompts branch December 4, 2023 17:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

S-Prompts for ViT and Text Transformers #388

S-Prompts for ViT and Text Transformers #388

prabhuteja12 commented Aug 22, 2023

wistuba Dec 4, 2023

prabhuteja12 Dec 4, 2023

wistuba Dec 4, 2023

wistuba Dec 4, 2023

prabhuteja12 Dec 4, 2023

wistuba Dec 4, 2023

prabhuteja12 Dec 4, 2023

wistuba Dec 4, 2023

prabhuteja12 Dec 4, 2023

wistuba Dec 4, 2023

prabhuteja12 Dec 4, 2023

wistuba Dec 4, 2023

prabhuteja12 Dec 4, 2023

wistuba Dec 4, 2023

prabhuteja12 Dec 4, 2023

wistuba Dec 4, 2023

S-Prompts for ViT and Text Transformers #388

S-Prompts for ViT and Text Transformers #388

Conversation

prabhuteja12 commented Aug 22, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment