SigLIP impl #634

rwightman · 2023-09-15T20:14:32Z

lucasb-eyer · 2023-09-18T18:38:11Z

Can't comment on the distributed part of the code as I don't know that part of PyTorch, but the rest (loss details, bias/temp/inits) LGTM.

rwightman · 2023-09-18T18:43:37Z

@lucasb-eyer thanks for taking a look, yeah the dist part is where a lot of the risk is, but seems to be behaving on local cc12m runs comparing single to 4x GPU.

lucasb-eyer · 2023-09-19T12:55:27Z

FYI: in our code, Basil implemented a small unit-test checking both formulations for "almost equalness" of chunked vs non-chunked, this gave us good reassurance in the implementation (+looking at profiler for memory use).

rwightman · 2023-09-22T18:42:04Z

I've tested

convnext_tiny on cc12m old InfoNCE run vs new SigLIP (36.13 vs 36.46 zero-shot in1k) (4 GPU)
initial convergence w/ siglip + grad accum enabled
initial convergence of custom text and original clip models w/o siglip (original CLIP InfoNCE loss)
initial convergence with bidirection exchange and unidirectional
validating several existing models

Will merge shortly to prevent this getting stale

Initial SigLIP impl

f8babcf

rwightman mentioned this pull request Sep 15, 2023

Request for Sigmoid Loss Integration: SigLip #618

Closed

rwightman added 4 commits September 15, 2023 13:18

Add logit_bias to custom text clip

e48ee1a

non-dict model output wrong way around wrt logit_bias

b449bf4

Disable diving loss by world size, better without

85725fc

A bit of cleanup

b88364b

rwightman changed the title ~~Initial SigLIP impl~~ SigLIP impl Sep 16, 2023

Add bidirectional exchange option, more cleanup

134e61a

Add reference in siglip docstring

8ba778d

rwightman requested review from mitchellnw and rom1504 September 19, 2023 06:45

Remove some comments after further verification

ffb9c1b

rwightman added 2 commits September 22, 2023 12:07

bidir exchange by default

3181385

Proper bidir default

a5ba05f

rwightman merged commit a6a80c4 into main Sep 22, 2023
5 checks passed

rwightman deleted the siglip branch September 22, 2023 19:17

lucasb-eyer mentioned this pull request Sep 28, 2023

Question: Will SigLIP / SigLiT be added to this codebase? google-research/big_vision#38

Closed

NielsRogge mentioned this pull request Dec 17, 2023

Add SigLIP huggingface/transformers#26522

Merged

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SigLIP impl #634

SigLIP impl #634

rwightman commented Sep 15, 2023 •

edited

lucasb-eyer commented Sep 18, 2023

rwightman commented Sep 18, 2023

lucasb-eyer commented Sep 19, 2023

rwightman commented Sep 22, 2023 •

edited

SigLIP impl #634

SigLIP impl #634

Conversation

rwightman commented Sep 15, 2023 • edited

lucasb-eyer commented Sep 18, 2023

rwightman commented Sep 18, 2023

lucasb-eyer commented Sep 19, 2023

rwightman commented Sep 22, 2023 • edited

rwightman commented Sep 15, 2023 •

edited

rwightman commented Sep 22, 2023 •

edited