Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add BLIP2 generated captions of selected objects #5

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

pilot7747
Copy link
Contributor

@pilot7747 pilot7747 commented Aug 17, 2023

Pull Request Description

Added a new column named caption to all dataset splits. This column contains captions describing the regions of images enclosed within bounding boxes. These captions were generated using the BLIP2 model.

Please review and let me know if there are any changes needed.

@dustalov
Copy link
Collaborator

Thanks! I would put the files to the repository root as they have a similar format to other bits of the dataset, and I feel that the cognitive load of having one more subdirectory for three smallish files is too much.

@dustalov
Copy link
Collaborator

Also, since in our dataset there is only one bounding box per image, I propose to keeping only the primary keys (URLs) and the generated captions in these new files. That would prevent everyone from messing up with coordinates or questions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants