Regarding "<ImageHere>" #225

thiner · 2024-03-20T05:40:59Z

Is <ImageHere> a fixed placeholder in text prompt?
What kind of value does the VL model expect? A path, URL or base64 encoded image?

The text was updated successfully, but these errors were encountered:

yuhangzang · 2024-03-22T00:58:13Z

Hi thiner, you may refer to this line and this line for your questions.

thiner · 2024-03-26T05:49:53Z

@yhcao6 Thanks for your answer.
I'd like to summarize my study from the code, please correct me if misunderstood the logic.

<ImageHere> is a fixed placeholder which separate image and text prompt.
XComposer-VL expects the image input be a path which is recognizable by PIL.Image.open method or a torch.Tensor instance.

Based on above summaries, I have a further question, does XComposer-VL supports multiple images as input? I think it's not supported currently, is it?

yuhangzang · 2024-03-28T03:07:28Z

XComposer-VL supports multiple images as input, e.g., query = '<ImageHere> <ImageHere> balabala', img_path = ['a.jpg', 'b.jpg']

yuhangzang · 2024-04-11T13:16:32Z

Kindly reopen this issue if you have any further questions.

mm-assistant bot assigned yhcao6 Mar 20, 2024

yuhangzang closed this as completed Apr 11, 2024

yuhangzang mentioned this issue Apr 19, 2024

Does it support multi-image interleaved conversations? #277

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regarding "<ImageHere>" #225

Regarding "<ImageHere>" #225

thiner commented Mar 20, 2024 •

edited

Loading

yuhangzang commented Mar 22, 2024

thiner commented Mar 26, 2024 •

edited

Loading

yuhangzang commented Mar 28, 2024 •

edited

Loading

yuhangzang commented Apr 11, 2024

Regarding "<ImageHere>" #225

Regarding "<ImageHere>" #225

Comments

thiner commented Mar 20, 2024 • edited Loading

yuhangzang commented Mar 22, 2024

thiner commented Mar 26, 2024 • edited Loading

yuhangzang commented Mar 28, 2024 • edited Loading

yuhangzang commented Apr 11, 2024

thiner commented Mar 20, 2024 •

edited

Loading

thiner commented Mar 26, 2024 •

edited

Loading

yuhangzang commented Mar 28, 2024 •

edited

Loading