Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[data request] OpenImages v7 #906

Open
rodrigob opened this issue Aug 14, 2019 · 15 comments
Open

[data request] OpenImages v7 #906

rodrigob opened this issue Aug 14, 2019 · 15 comments
Labels
contributions welcome dataset request Request for a new dataset to be added

Comments

@rodrigob
Copy link
Contributor

rodrigob commented Aug 14, 2019

Name of dataset: OpenImages v7
URL of dataset: https://g.co/dataset/open-images
License of dataset: licensed by Google Inc. under CC BY 4.0 license. The images are listed as having a CC BY 2.0 license.

Short description of dataset and use case(s): bigger than ImageNet with 61M image level labels, 16M bounding boxes, 3M visual relationships, 2.7M instance segmentation masks, 600k localized narratives (synchronized audio and text caption, with mouse trace), and 66M point labels.

Folks who would also like to see this dataset in tensorflow/datasets, please thumbs-up so the developers can know which requests to prioritize.

And if you'd like to contribute the dataset (thank you!), see our guide to adding a dataset.

@rodrigob rodrigob added the dataset request Request for a new dataset to be added label Aug 14, 2019
@pierrot0
Copy link
Collaborator

aman2930 is looking into this.

@aman2930
Copy link
Member

Could you please assign it to me?

@rodrigob
Copy link
Contributor Author

any update on this ?

@cyfra cyfra mentioned this issue Oct 18, 2019
@rodrigob
Copy link
Contributor Author

For info we are now at open_images_v6 (same image labels, boxes, masks, and images as v5, but new types of annotations added, and larger number of relation annotations).

@Conchylicultor Conchylicultor changed the title [data request] OpenImages v5 [data request] OpenImages v6 Apr 17, 2020
@Conchylicultor
Copy link
Member

Nice, we would love have this!

For info, we (TFDS team) ensure the core API support and help with issues, but we let the community (both internal and external) implement the datasets they want (we have 130+ dataset requests).

Don't hesitate to help us with this. Or if anyone else is interested to work on this, don't hesitate to send a PR.
By starting from open_images_v4, it should be relatively straightforward to add a OpenImagesV6: https://github.com/tensorflow/datasets/blob/master/tensorflow_datasets/object_detection/open_images.py
We're here to help if anyone encounter issues for this.

@rodrigob
Copy link
Contributor Author

rodrigob commented Apr 17, 2020

relatively straightforward

Not so much, since new data types / data conventions are needed.
(instance segmentation, localized captions, audio)

@jponttuset FYI.

@Eshan-Agarwal
Copy link
Contributor

@Conchylicultor I want to work on it , should we keep both v4 and v6 ?

@rodrigob
Copy link
Contributor Author

rodrigob commented Apr 17, 2020

Note also that there was a potential bug in v4 tfds import (in the quantization of the image level machine scores), so v5/v6 should be implemented with care (and probably consider removing the quantization). Please add me in the reviewers pool.

@Conchylicultor
Copy link
Member

@Eshan-Agarwal, yes we should keep both v4 and v6. However I feel this one may be a little too ambitious for you, especially if you don't have enough compute power.

@Eshan-Agarwal
Copy link
Contributor

Yes as open_images dataset have huge size but I will try.

@rodrigob
Copy link
Contributor Author

For info, I am currently working on this issue.

@BlackHC
Copy link

BlackHC commented Jan 20, 2022

Any updates on this? 🤗 This would be super useful to have

@rodrigob rodrigob changed the title [data request] OpenImages v6 [data request] OpenImages v7 Oct 25, 2022
@rodrigob
Copy link
Contributor Author

Any updates on this? 🤗 This would be super useful to have

For context, a not-yet released implementation exists. It was used to generate the new Open Image visualizers.
I will be spending the next couple of weeks cleaning the code and pushing the public release.

@joaoguilhermeS
Copy link

joaoguilhermeS commented May 3, 2023

any updates on this :? I guess it would optimize a lot the work for a beginner.

@whoschek
Copy link

Would invite so much more use and experimentation!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
contributions welcome dataset request Request for a new dataset to be added
Projects
None yet
Development

No branches or pull requests

8 participants