Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Does Flamingo support NLVR^2 and its paired-image input setting? #257

Open
chenxshuo opened this issue Sep 5, 2023 · 3 comments
Open

Does Flamingo support NLVR^2 and its paired-image input setting? #257

chenxshuo opened this issue Sep 5, 2023 · 3 comments

Comments

@chenxshuo
Copy link

Hi there,

thank you for your awesome work!

I wonder whether you have tried OpenFlamingo on NLVR^2. The input in NLVR^2 is always a pair of images along with a question and the output is either true or false.

Screenshot 2023-09-05 at 21 37 32
An example

I am not sure whether the in-context setting in OpenFlamingo supports such paired-image input as demonstration. Shoud I use a pair of <image> to indicate these two images in one example?

Do you have any comments?

Best.

@anas-awadalla
Copy link
Collaborator

Hello!

The way I would format the input would be to do <image><|endofchunk|><image>The left image contains twice the number of dogs as the right image, and at least two dogs in total are standing.. One limitation of doing so is that you would get less 'signal' from the first image and the text can only attend to the immediate image. Maybe one thing you can explore is combining the images and passing them in as a single image?

@chenxshuo
Copy link
Author

Hi @anas-awadalla many thanks to your reply! Let me try both methods.

@meharbhatia
Copy link

Hey @chenxshuo. Do you receive fair results when combining the images as a single image using OpenFlamingo?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants