Improved linear evaluation that achieves better results #107

teasgen · 2023-08-29T10:13:29Z

In the updated linear evaluation, the calculation process involves dividing the dataset into three parts: train, validation, and test. However, if the dataset does not already have a validation split, I will divide the train part into two sections based on the specified proportion.
It means we will get more fair results.
Also I've added regularization with openAI hyperparameter sweep (https://arxiv.org/pdf/2103.00020.pdf A.3).
Now the results are more similar to openAI metrics for CLIP models (same paper, table 10)

e.g. ViT-L-14 openai model

Metric	Current	New	openAI	diff decrease
DTD dataset	80.1	82.1	82.1	-2.0
Country211	38.7	42.1	42.9	-1.4
Food101	93.4	95.3	95.2	-1.9
Aircraft	62.4	67.5	69.4	-4.1
Cifar100	84.8	87.3	87.5	-2.5
Cifar10	97.7	98.0	98.0	-0.3

Hyperparameters	Value
Batch size	512
Epochs	20
LR	0.1

Danil328 · 2023-10-10T11:46:15Z

Approve, please

mehdidc · 2023-10-14T16:23:49Z

Sorry for the delay an thank you very much for the PR @teasgen . I will have a look right after fixing #109

ankitkv · 2023-11-07T16:38:51Z

Hi @teasgen ! Do you happen to know the best setting to use your PR for linear probe on ImageNet?

teasgen · 2023-11-08T10:27:54Z

Hi @teasgen ! Do you happen to know the best setting to use your PR for linear probe on ImageNet?

Hi, unfortunately I haven't tested my PR on ImageNet. But you can efficiently find best hyperparameters using cli arguments. However, I used same setting for all datasets, so you can firstly try it

mehdidc · 2023-12-01T17:27:52Z

Hi @teasgen, working fine for me! the only thing that would be nice to keep is the default behavior, i.e. not specifying a validation dataset. Currently, it fails with an error message if validation set or validation proportion are not given. With this commit: 396f807, I could make it working fine again, but I might have missed something. Could you please have a look/confirm ?

Thanks!

teasgen · 2023-12-01T18:54:46Z

Hi @teasgen, working fine for me! the only thing that would be nice to keep is the default behavior, i.e. not specifying a validation dataset. Currently, it fails with an error message if validation set or validation proportion are not given. With this commit: 396f807, I could make it working fine again, but I might have missed something. Could you please have a look/confirm ?

Thanks!

Hi! Your commit looks good, I suppose now its alright. Could you please release new version to pypi as soon as pr will be merged?

mehdidc · 2023-12-01T18:57:47Z

Great, thanks @teasgen!
yes, sure, will release a new version on pypi!

mehdidc · 2023-12-01T23:54:03Z

Merging, will add the other commit right after.

mehdidc · 2023-12-02T01:08:03Z

@teasgen @Danil328 available now in 1.6.0, pip install clip_benchmark==1.6.0

Improved linear evaluation that achieves better results

4ed690b

mehdidc merged commit 0652bec into LAION-AI:main Dec 1, 2023
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved linear evaluation that achieves better results #107

Improved linear evaluation that achieves better results #107

teasgen commented Aug 29, 2023

Danil328 commented Oct 10, 2023

mehdidc commented Oct 14, 2023

ankitkv commented Nov 7, 2023

teasgen commented Nov 8, 2023

mehdidc commented Dec 1, 2023

teasgen commented Dec 1, 2023

mehdidc commented Dec 1, 2023

mehdidc commented Dec 1, 2023 •

edited

mehdidc commented Dec 2, 2023

Improved linear evaluation that achieves better results #107

Improved linear evaluation that achieves better results #107

Conversation

teasgen commented Aug 29, 2023

Danil328 commented Oct 10, 2023

mehdidc commented Oct 14, 2023

ankitkv commented Nov 7, 2023

teasgen commented Nov 8, 2023

mehdidc commented Dec 1, 2023

teasgen commented Dec 1, 2023

mehdidc commented Dec 1, 2023

mehdidc commented Dec 1, 2023 • edited

mehdidc commented Dec 2, 2023

mehdidc commented Dec 1, 2023 •

edited