Update table 1 in README to contain SigLIP, DFN #729

mitchellnw · 2023-11-02T04:29:44Z

No description provided.

rwightman · 2023-11-03T21:55:48Z

@mitchellnw wouldn't it be better to highlight the best of each and use the 336, 384, and 378 res results for openai, siglip, dfn ?

gabrielilharco · 2023-11-04T17:41:10Z

So I think the current table nicely highlights the models trained on OpenCLIP. I worry about it becoming too polluted if we keep adding new rows when models come out.

That said, I think it's awesome that OpenCLIP is making it so easy to access tons of models trained by others. One suggestion is to keep the table as is, and add the IN/avg acc scatter plot (and maybe the retrieval one also) to the readme. That way we can highlight that we support so many models, how good the best ones are, and point to the big table if readers need more details.

ludwigschmidt · 2023-11-05T05:26:31Z

@gabrielilharco I think that some people come to OpenCLIP not to train models, but to easily use current SotA models. For them, it's less relevant what models were trained with OpenCLIP, and more important to quickly see the best models available in the repository. While the scatter plot is a really great overview of the models we have in OpenCLIP, it doesn't quickly tell people what the best models are (the plot is a bit complex). I think having a few rows in the table with the best models available (maybe even with the specific pretrained model names to load them via OpenCLIP) is a good addition. We've received this feedback in the past, so let's take it into account (e.g., https://twitter.com/giffmana/status/1707664238557221285?s=20).

ludwigschmidt · 2023-11-05T05:28:19Z

Having said that, I'm also a big supporter of adding the comprehensive scatter plot as long as we update the table with the best models :-) (probably with the higher resolutions as @rwightman suggested).

gabrielilharco · 2023-11-05T05:50:02Z

I agree that it's good to make clear what the best models supported by this codebase is. I think
the comment from Lucas reinforces what I'm trying to say though, we should be careful not to add too much information here because it can make it harder for people to find the information they need.

Knowing which models are best is easy from looking at the big tables (which are sorted by perf). I think we can highlight that better on the readme with some writing changes and the plots

ludwigschmidt · 2023-11-05T05:58:46Z

In the abstract, I agree with the point about too many models causing confusion. Concretely here, the updated table would have 12 models, which seems manageable and is in line with Luca's suggestion of around 10 key models. Note that the proposal here is not to keep expanding the list but to keep it to around 10 key models (and the best models available in the repo should count as key models). So I don't think we need to worry right now about potential future expansions making the table bigger.

Knowing which models are best is easy from looking at the big tables (which are sorted by perf). I think we can highlight that better on the readme with some writing changes and the plots

I think if a reader needs to go to from the README to another table or parse a complex plot, finding the best model available is more complicated than it needs to be. But I may also be misunderstanding your proposal! If you make another PR we can compare :-)

gabrielilharco · 2023-11-05T06:09:15Z

As a compromise, how about we add a single new model (the best one, DFN H/14@376) and remove an old one (maybe L/14 trained on LAION)?

ludwigschmidt · 2023-11-05T06:14:06Z

Also good with me! I'm also still OK with 12 rows if others are OK with that.

Update table 1 in README to contain SigLIP, DFN

626346a

rwightman merged commit 1531130 into main Jun 8, 2024

rwightman deleted the mitchellnw-patch-2 branch June 8, 2024 22:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update table 1 in README to contain SigLIP, DFN #729

Update table 1 in README to contain SigLIP, DFN #729

mitchellnw commented Nov 2, 2023

rwightman commented Nov 3, 2023

gabrielilharco commented Nov 4, 2023

ludwigschmidt commented Nov 5, 2023

ludwigschmidt commented Nov 5, 2023

gabrielilharco commented Nov 5, 2023

ludwigschmidt commented Nov 5, 2023

gabrielilharco commented Nov 5, 2023

ludwigschmidt commented Nov 5, 2023

Update table 1 in README to contain SigLIP, DFN #729

Update table 1 in README to contain SigLIP, DFN #729

Conversation

mitchellnw commented Nov 2, 2023

rwightman commented Nov 3, 2023

gabrielilharco commented Nov 4, 2023

ludwigschmidt commented Nov 5, 2023

ludwigschmidt commented Nov 5, 2023

gabrielilharco commented Nov 5, 2023

ludwigschmidt commented Nov 5, 2023

gabrielilharco commented Nov 5, 2023

ludwigschmidt commented Nov 5, 2023