Generate "OOOOOOOOOOOOOOOOOOOOO" instead of words #621

SamuraiBUPT · 2023-12-23T14:53:55Z

Hello, I follow the tutorial in instructblip and pick a test image. But its output seems to be weird.

Code:

from lavis.models import load_model_and_preprocess

test_img2 = Image.open('../dataset/fashion-new/images-raw/test_img.jpg').convert('RGB')      # open the image and convert to RGB
display(test_img2.resize((256, 378)))

model_instructBlip, vis_processors, _ = load_model_and_preprocess(name="blip2_vicuna_instruct", 
                                                                    model_type="vicuna7b", 
                                                                    is_eval=True, 
                                                                    device=device)

image_ = vis_processors["eval"](test_img2).unsqueeze(0).to(device)
caption_test = model_instructBlip.generate({"image": image_, "prompt": "Describe the man's clothes."})
print(caption_test)

output:

['OOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO']

I wonder what step I missed or did wrong to get this result.

Any help will be appreciated.

The text was updated successfully, but these errors were encountered:

SamuraiBUPT · 2023-12-23T15:19:41Z

Problem solved, I used vicuna-7b-v1.5, not vicuna-7b-v1.1, sorry to bother.

purshow · 2024-01-23T19:25:12Z

Hi bro!I meet the same question.Could you teach me how to solve it?

SamuraiBUPT · 2024-01-24T00:45:41Z

Hi bro!I meet the same question.Could you teach me how to solve it?

The reason my model generated "OOOOOOOO" is that I was using vicuna-7b-1.5, but actually, according to the document of LAVIS, you should use vicuna-7b-1.1 instead. The instruct-blip model seems not to be capable with vicuna version 1.5.

Please check your vicuna version to avoid using model of wrong version.

purshow · 2024-03-06T14:13:07Z

Thanks！

SamuraiBUPT closed this as completed Dec 23, 2023

Morningstaripfy mentioned this issue Mar 14, 2024

instruct-blip output long meanless string #666

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate "OOOOOOOOOOOOOOOOOOOOO" instead of words #621

Generate "OOOOOOOOOOOOOOOOOOOOO" instead of words #621

SamuraiBUPT commented Dec 23, 2023

SamuraiBUPT commented Dec 23, 2023

purshow commented Jan 23, 2024

SamuraiBUPT commented Jan 24, 2024

purshow commented Mar 6, 2024

Generate "OOOOOOOOOOOOOOOOOOOOO" instead of words #621

Generate "OOOOOOOOOOOOOOOOOOOOO" instead of words #621

Comments

SamuraiBUPT commented Dec 23, 2023

SamuraiBUPT commented Dec 23, 2023

purshow commented Jan 23, 2024

SamuraiBUPT commented Jan 24, 2024

purshow commented Mar 6, 2024