[WIP] Code implementation of Conv-LoRA #3933

Harry-zzh · 2024-02-17T15:09:40Z

Issue #, if available:

Description of changes:

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

hohoCode · 2024-02-17T15:24:06Z

Great work! One real quick question, have you tried conv-lora on standard text tasks instead of image/SAM tasks? If so, how was it? If havnt tried, do you think it is a more general-purpose PEFT method, or it is more of a SAM/CV-specific approach? Thanks a lot!

Harry-zzh · 2024-02-19T03:55:27Z

Great work! One real quick question, have you tried conv-lora on standard text tasks instead of image/SAM tasks? If so, how was it? If havnt tried, do you think it is a more general-purpose PEFT method, or it is more of a SAM/CV-specific approach? Thanks a lot!

Thank you for your question. While I haven't tried text tasks yet, my understanding is that Conv-LoRA is primarily designed for image tasks.

Conv-LoRA incorporates local priors into image features at appropriate scales, considering potential variations in object scale. This involves interpolating image features to larger scales than default and subsequently employing convolution operations for injecting local priors. In our paper, we find that interpolating features to larger scales for local prior injection is more beneficial, given that features in the Vision Transformer (ViT) are downscaled by a factor (e.g., 16) from the original continuous image.

However, texts are 1-D discrete sequences and lack a concept akin to "object scale". Consequently, considering the feature processing in Conv-LoRA and its motivation, it is unsuitable for text tasks.

hohoCode · 2024-02-19T06:12:44Z

we find that interpolating features to larger scales for local prior injection is more beneficial, given that features in the Vision Transformer (ViT) are downscaled by a factor (e.g., 16) from the original continuous image

Excellent! Thanks for the explanations.

github-actions · 2024-02-27T10:08:15Z

Job PR-3933-17d9af4 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-3933/17d9af4/index.html

zhiqiangdon · 2024-03-13T00:50:34Z

multimodal/src/autogluon/multimodal/models/sam.py

+    def train(self, mode: bool = True):
+        super().train(mode)
+        for module in self.modules():
+            if isinstance(module, ConvLoRALinear):
+                self.output_moe_loss = True
+                return self
+
+        return self
+


This function sets output_moe_loss to True for training. During inference, it should be False, but it seems always True?

We need the MoE loss when calculating the validation loss. During the validation process, the module mode is set to "eval". So we cannot distinguish the validation and inference process here.

zhiqiangdon · 2024-03-13T01:27:53Z

multimodal/src/autogluon/multimodal/models/conv_lora/adaptation_layers.py

+
+            # Calculate the gating values.
+            lora_res = lora_res.permute(0, 3, 1, 2).contiguous()
+            gates, moe_loss = self.lora_moe_gating(lora_res)


Avoid computing the moe loss during inference for better efficiency?

zhiqiangdon · 2024-03-13T02:00:07Z

multimodal/src/autogluon/multimodal/models/conv_lora/moe_conv.py

+from torch.distributions.normal import Normal
+
+
+class MoEConv(nn.Module):


Need a more accurate name? MoEGate? This class doesn't contain convolutions and is to determine gates.

multimodal/src/autogluon/multimodal/models/conv_lora/modeling_sam.py

multimodal/src/autogluon/multimodal/models/conv_lora/adaptation_layers.py

multimodal/src/autogluon/multimodal/configs/optimization/default.yaml

multimodal/src/autogluon/multimodal/constants.py

2catycm · 2024-03-14T03:51:27Z

multimodal/src/autogluon/multimodal/models/conv_lora/adaptation_layers.py

+        lora_alpha: int = 1,
+        lora_dropout: float = 0.0,
+        fan_in_fan_out: bool = False,  # Set this to True if the layer to replace stores weight like (fan_in, fan_out)
+        merge_weights: bool = False,


Is Conv-Lora reparameterizable?
It is more complicated than lora, lora just merge the weights by multiplying the matrices.
but here we have convolutions.

Conv-LoRA is not reparameterizable mainly due to its interpolation operation.
Actually convolutions is not the main reason because convolution layer could be re-parameterized into FC layer in some cases. You could refer to papers about structural re-parameterization for more details.

zhiqiangdon

We also need to add examples of using conv-lora in the path: https://github.com/autogluon/autogluon/tree/master/examples/automm/Conv-LoRA

…to conv-lora

github-actions · 2024-03-18T06:08:08Z

Job PR-3933-24ee8b2 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-3933/24ee8b2/index.html

zhiqiangdon

LGTM!

…tch-4 * 'master' of https://github.com/awslabs/autogluon: (46 commits) [core] move transformers to setup_utils, bump dependency version (autogluon#3984) [AutoMM] Fix one lightning upgrade issue (autogluon#3991) [CI][Feature] Create a package version table (autogluon#3972) [v.1.1][Upgrade] PyTorch 2.1 and CUDA 12.1 upgrade (autogluon#3982) [WIP] Code implementation of Conv-LoRA (autogluon#3933) [timeseries] Ensure that all metrics handle missing values in the target (autogluon#3966) [timeseries] Fix path and device bugs (autogluon#3979) [AutoMM]Remove grounding-dino (autogluon#3974) [Docs] Update install modules content (autogluon#3976) Add note on pd.to_datetime (autogluon#3975) [AutoMM] Improve DINO performance (autogluon#3970) Minor correction in differ to pick correct environment (autogluon#3968) Fix windows python 3.11 issue by removing ray (autogluon#3956) [CI][Feature] Package Version Comparator (autogluon#3962) [timeseries] Add support for categorical covariates (autogluon#3874) [timeseries] Add method for plotting forecasts (autogluon#3889) Update conf.py copyright to reflect current year (autogluon#3932) [Timeseries][CI]Refactor CI to skip AutoMM and Tabular tests w.r.t timeseries changes (autogluon#3942) Fix HPO crash in memory check (autogluon#3931) [AutoMM][CI] Capping scikit-learn to avoid HPO test failure (autogluon#3947) ...

Co-authored-by: Ubuntu <ubuntu@ip-172-31-3-160.us-west-2.compute.internal> Co-authored-by: Zhiqiang Tang <zhiqiang.tang@rutgers.edu>

Ubuntu added 2 commits February 17, 2024 15:08

conv-lora

ca6dbcf

spelling error

3bc2315

zhiqiangdon added model list checked You have updated the model list after modifying multimodal unit tests/docs run-multi-gpu Run multimodal multi-gpu tests labels Feb 20, 2024

Merge remote-tracking branch 'upstream/master' into conv-lora

17d9af4

zhiqiangdon self-requested a review February 27, 2024 07:11

zhiqiangdon reviewed Mar 13, 2024

View reviewed changes

2catycm reviewed Mar 14, 2024

View reviewed changes

zhiqiangdon reviewed Mar 15, 2024

View reviewed changes

Ubuntu added 2 commits March 18, 2024 03:14

modify

47f25f8

Merge branch 'conv-lora' of https://github.com/Harry-zzh/autogluon in…

24ee8b2

…to conv-lora

zhiqiangdon approved these changes Mar 18, 2024

View reviewed changes

zhiqiangdon merged commit 7d8cef5 into autogluon:master Mar 18, 2024
36 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Code implementation of Conv-LoRA #3933

[WIP] Code implementation of Conv-LoRA #3933

Harry-zzh commented Feb 17, 2024

hohoCode commented Feb 17, 2024

Harry-zzh commented Feb 19, 2024

hohoCode commented Feb 19, 2024

github-actions bot commented Feb 27, 2024

zhiqiangdon Mar 13, 2024

Harry-zzh Mar 18, 2024

zhiqiangdon Mar 13, 2024

zhiqiangdon Mar 13, 2024

2catycm Mar 14, 2024

Harry-zzh Mar 14, 2024

zhiqiangdon left a comment

github-actions bot commented Mar 18, 2024

zhiqiangdon left a comment

		from torch.distributions.normal import Normal


		class MoEConv(nn.Module):

[WIP] Code implementation of Conv-LoRA #3933

[WIP] Code implementation of Conv-LoRA #3933

Conversation

Harry-zzh commented Feb 17, 2024

hohoCode commented Feb 17, 2024

Harry-zzh commented Feb 19, 2024

hohoCode commented Feb 19, 2024

github-actions bot commented Feb 27, 2024

zhiqiangdon Mar 13, 2024

Choose a reason for hiding this comment

Harry-zzh Mar 18, 2024

Choose a reason for hiding this comment

zhiqiangdon Mar 13, 2024

Choose a reason for hiding this comment

zhiqiangdon Mar 13, 2024

Choose a reason for hiding this comment

2catycm Mar 14, 2024

Choose a reason for hiding this comment

Harry-zzh Mar 14, 2024

Choose a reason for hiding this comment

zhiqiangdon left a comment

Choose a reason for hiding this comment

github-actions bot commented Mar 18, 2024

zhiqiangdon left a comment

Choose a reason for hiding this comment