Does Flamingo support NLVR^2 and its paired-image input setting? #257

chenxshuo · 2023-09-05T19:39:03Z

Hi there,

thank you for your awesome work!

I wonder whether you have tried OpenFlamingo on NLVR^2. The input in NLVR^2 is always a pair of images along with a question and the output is either true or false.

An example

I am not sure whether the in-context setting in OpenFlamingo supports such paired-image input as demonstration. Shoud I use a pair of <image> to indicate these two images in one example?

Do you have any comments?

Best.

The text was updated successfully, but these errors were encountered:

anas-awadalla · 2023-09-06T16:45:44Z

Hello!

The way I would format the input would be to do <image><|endofchunk|><image>The left image contains twice the number of dogs as the right image, and at least two dogs in total are standing.. One limitation of doing so is that you would get less 'signal' from the first image and the text can only attend to the immediate image. Maybe one thing you can explore is combining the images and passing them in as a single image?

chenxshuo · 2023-09-10T20:55:35Z

Hi @anas-awadalla many thanks to your reply! Let me try both methods.

meharbhatia · 2023-10-02T05:35:27Z

Hey @chenxshuo. Do you receive fair results when combining the images as a single image using OpenFlamingo?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does Flamingo support NLVR^2 and its paired-image input setting? #257

Does Flamingo support NLVR^2 and its paired-image input setting? #257

chenxshuo commented Sep 5, 2023

anas-awadalla commented Sep 6, 2023

chenxshuo commented Sep 10, 2023

meharbhatia commented Oct 2, 2023

Does Flamingo support NLVR^2 and its paired-image input setting? #257

Does Flamingo support NLVR^2 and its paired-image input setting? #257

Comments

chenxshuo commented Sep 5, 2023

anas-awadalla commented Sep 6, 2023

chenxshuo commented Sep 10, 2023

meharbhatia commented Oct 2, 2023