Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GTSRB dataset issue #156

Closed
pj-ms opened this issue Sep 23, 2021 · 4 comments
Closed

GTSRB dataset issue #156

pj-ms opened this issue Sep 23, 2021 · 4 comments

Comments

@pj-ms
Copy link

pj-ms commented Sep 23, 2021

According to the official website, there are two versions of GTSRB:

Version num train images num test images
IJCNN 2011 competition 26640 12569
Official 39209 12630

The dataset stats (Table 9, Page 39) seem to suggest it is using train set from IJCNN 2011 version but test set from official version.

Given that Official-Train = IJCNN-Train + IJCNN-Test (Source), is CLIP using IJCNN Train as train set, IJCNN Test set as val set to tune hyper-param, Official Test set as test set? Thanks!

image

@jongwook
Copy link
Collaborator

Thanks for catching this. We were not aware of the distinction and seems like we happened to use the training images from the competition and the testing images from the "official" version.

We'd expect the linear probes (for all models compared) would perform slightly better when it's trained on more images in the official version, so I assume the reported accuracies can still serve as a "baseline" in future studies. We also note that the same set of training images are used across all models, so the comparisons in the paper can still be considered "fair". The zero-shot evaluations are not affected by this, since it only uses the test split.

@pj-ms
Copy link
Author

pj-ms commented Sep 24, 2021

Thanks for the prompt response! Yeah, it is good enough to know the numbers from the competition train + official test.

Btw, as zero-shot is mentioned here, any chance I could know what Alec eventually used for those traffic signs :-D

image

@jongwook
Copy link
Collaborator

We recently uploaded the labels at https://github.com/openai/CLIP/blob/main/data/prompts.md#gtsrb

@pj-ms
Copy link
Author

pj-ms commented Sep 24, 2021

Thanks! Just found it there as well :-D

@pj-ms pj-ms closed this as completed Sep 24, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants