Allow other types of visual SSL when initiating CLIP #5

Froskekongen · 2022-02-20T15:49:32Z

In the following code as part of CLIP.__init__

        if use_visual_ssl:
            if visual_ssl_type == 'simsiam':
                ssl_type = SimSiam
            elif visual_ssl_type == 'simclr':
                ssl_type = partial(SimCLR, temperature = simclr_temperature)
            else:
                raise ValueError(f'unknown visual_ssl_type')

            self.visual_ssl = ssl_type(
                self.visual_transformer,
                image_size = visual_image_size,
                hidden_layer = visual_ssl_hidden_layer
            )

the visual self-supervised learning is hardcoded. I would suggest changing this to accept the visual SSL module as an argument when instantiating CLIP to allow flexibility in the same manner as it does for the image encoder and text encoder.

Example:

barlow = BarlowTwins(augmentatation_fns)
clip = CLIP(..., visual_ssl=barlow)

The text was updated successfully, but these errors were encountered:

lucidrains · 2022-03-01T00:30:22Z

@Froskekongen Hi Erlend! Took up your suggestion here https://github.com/lucidrains/x-clip/tree/0.2.4#custom-vision-self-supervised-learning-module let me know if that works for you

lucidrains · 2022-03-01T00:34:39Z

How is your experience with Barlow? Does it work?

Froskekongen · 2022-03-01T13:43:19Z

Thanks a lot!

BarlowTwins was just an example. Personally, I work with frameworks that are more akin to VICReg (https://arxiv.org/abs/2105.04906) and VIbCreg (https://arxiv.org/abs/2109.00783).

And I am investigating CLIP with other modalities than images and words, with less data.

lucidrains · 2022-03-01T19:15:22Z

@Froskekongen awesome! hope this feature is fruitful for you then! :)

lucidrains closed this as completed Mar 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow other types of visual SSL when initiating CLIP #5

Allow other types of visual SSL when initiating CLIP #5

Froskekongen commented Feb 20, 2022 •

edited

lucidrains commented Mar 1, 2022

lucidrains commented Mar 1, 2022

Froskekongen commented Mar 1, 2022

lucidrains commented Mar 1, 2022

Allow other types of visual SSL when initiating CLIP #5

Allow other types of visual SSL when initiating CLIP #5

Comments

Froskekongen commented Feb 20, 2022 • edited

lucidrains commented Mar 1, 2022

lucidrains commented Mar 1, 2022

Froskekongen commented Mar 1, 2022

lucidrains commented Mar 1, 2022

Froskekongen commented Feb 20, 2022 •

edited