Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Filter YFCC data #13

Open
Hxyou opened this issue May 25, 2022 · 3 comments
Open

Filter YFCC data #13

Hxyou opened this issue May 25, 2022 · 3 comments

Comments

@Hxyou
Copy link

Hxyou commented May 25, 2022

Hi, thanks for the great work. After downloading the provided YFCC15M label file, I can see there are three keys caption filename url in each one of the labels. how should we find the corresponding YFCC image according to your label? i.e., which key should we use to align with YFCC data?

@SlotherCui
Copy link
Collaborator

You can use the url as key , and filename for check

@raytrun
Copy link

raytrun commented Jun 23, 2022

The image name of YFCC data seems to be a md5 encoding. I'm also a little confused about how to make a connection.

@DonkeyShot21
Copy link

I am also trying to filter YFCC and I have the same issue. The dataset I have downloaded has a very different structure, and I don't know how to find the images based on the filename that you provide. Also I am not sure about what you mean by "Prepare the YFCC15M subset metadata pickle by the label".

My version of YFCC100M looks exactly the same as the one they have in the SLIP repo. Do you organise the data in a different way?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants