Could you provide the checkpoint of the CLIPPO model? #29

zzhanghub · 2023-01-05T03:55:54Z

I noticed that you have provided the CLIPPO training code. I hope to explore some downstream task based on the pre-trained CLIPPO model. Could you please release the checkpoint?

Thank you!

Adonis-galaxy · 2023-01-08T07:20:49Z

Looking forward to the release of CLIPPO checkpoints too~

andsteing · 2023-01-09T06:19:54Z

@mitscha

mitscha · 2023-01-09T08:35:10Z

We're looking into it, but I can't promise a strict timeline. Near term we will likely only be able to release checkpoints trained on the same data sets as the released LiT models (CC12M and/or YFCC100M).

jianghaojun · 2023-01-11T13:17:33Z

+1

nahidalam · 2023-01-16T23:32:03Z

Hi @mitscha I am working on a distillation problem and CLIPPO model checkpoint will be really useful. Looking forward to it.

mitscha · 2023-03-20T14:26:13Z

We just released a set of CLIPPO checkpoints. Please refer to the readme for details and check out the colab to use the checkpoints.

mitscha · 2023-03-27T16:56:42Z

Tagging @zzhanghub @Adonis-galaxy @jianghaojun @nahidalam for visibility.
Could someone with permission please close this issue (it seems I can't close it myself).

zzhanghub · 2023-03-28T12:04:03Z

Tagging @zzhanghub @Adonis-galaxy @jianghaojun @nahidalam for visibility. Could someone with permission please close this issue (it seems I can't close it myself).

Thank you very much!

yukang123 · 2023-04-17T09:20:49Z

Hi all,

I saw multiple checkpoints of ViT-B/16 models have been released. I am wondering if you plan to release the checkpoints of ViT models of other scales, such as ViT-H-14, ViT-L. The pretrained ViT-H model seems to be more suitable for our research on the downstream image generation task. I would appreciate it if you could share these pretrained checkpoints. That would help a lot!
@mitscha

Thanks!

mitscha · 2023-04-18T16:38:47Z

Hi @yukang123, we did not plan to release additional checkpoints.

I could look into training one L/16 model for release, for example one with ImageNet21k init, trained on YFCC-100M + 25%C4 data. This one might improve a bit over the released corresponding B/16 model, but generally the models trained on YFCC-100M do not perform as well as the main models in the paper trained on WebLI. Let me know if such an L/16 model could be interesting for your use case.

yukang123 · 2023-04-29T07:05:41Z

@mitscha Thanks for your reply!

I am currently using the released checkpoints of stable diffusion v2, which use CLIP text encoder (the corresponding image encoder is ViT-H-14) to generate the text embedding of length 1024, for AIGC tasks.

I would like to combine the image embedding generated by CLIP image encoder with the text embedding. It would bring less uncertainty on the training if the dimension of image embedding matches the text embedding (i.e., 1024) because I do not need to train another full-connected layer to transform the features before concatenation.

Besides, the current task I've been working could be inspired by the idea of CLIPPO about using the images with text rendered on them. Thus, it would be very helpful for my research if I could have opportunities to transfer the released CLIPPO checkpoints onto my task. A ViT-H-14 pretrained CLIPPO model would be more suitable for my use case. If such checkpoints would be not available, could you please give me some suggestions on how to transform the dimension of image embedding without dampening the strengths of pretrained CLIPPO model?

Thanks for your understanding! Appreciate it!

mitscha mentioned this issue Mar 15, 2023

Add CLIPPO checkpoints and colab. #33

Merged

zzhanghub closed this as completed Mar 28, 2023

google-research locked and limited conversation to collaborators Nov 7, 2023

lucasb-eyer converted this issue into discussion #70 Nov 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This issue was moved to a discussion.

Could you provide the checkpoint of the CLIPPO model? #29

Could you provide the checkpoint of the CLIPPO model? #29

zzhanghub commented Jan 5, 2023

Adonis-galaxy commented Jan 8, 2023

andsteing commented Jan 9, 2023

mitscha commented Jan 9, 2023

jianghaojun commented Jan 11, 2023

nahidalam commented Jan 16, 2023

mitscha commented Mar 20, 2023

mitscha commented Mar 27, 2023

zzhanghub commented Mar 28, 2023

yukang123 commented Apr 17, 2023 •

edited

Loading

mitscha commented Apr 18, 2023

yukang123 commented Apr 29, 2023 •

edited

Loading

This issue was moved to a discussion.

This issue was moved to a discussion.

Could you provide the checkpoint of the CLIPPO model? #29

Could you provide the checkpoint of the CLIPPO model? #29

Comments

zzhanghub commented Jan 5, 2023

Adonis-galaxy commented Jan 8, 2023

andsteing commented Jan 9, 2023

mitscha commented Jan 9, 2023

jianghaojun commented Jan 11, 2023

nahidalam commented Jan 16, 2023

mitscha commented Mar 20, 2023

mitscha commented Mar 27, 2023

zzhanghub commented Mar 28, 2023

yukang123 commented Apr 17, 2023 • edited Loading

mitscha commented Apr 18, 2023

yukang123 commented Apr 29, 2023 • edited Loading

This issue was moved to a discussion.

yukang123 commented Apr 17, 2023 •

edited

Loading

yukang123 commented Apr 29, 2023 •

edited

Loading