[Usage] Some Weights not used, when loaded in eval mmbench. #672

shipengai · 2023-10-26T04:42:10Z

Describe the issue

Issue:
When I use my second stage trained model , there are some logs

Command:

python -m llava.eval.model_vqa_mmbench \
    --model-path ./checkpoints/llava-v1.5-7b \
    --question-file ./playground/data/eval/mmbench/$SPLIT.tsv \
    --answers-file ./playground/data/eval/mmbench/answers/$SPLIT/llava-v1.5-7b.jsonl \
    --single-pred-prompt \
    --temperature 0 \
    --conv-mode vicuna_v1

Log:

Some weights of the model checkpoint at./checkpoints/llava-v1.5-7b were not used when initializing LlavaLlamaForCausalLM: ['model.vision_tower.vision_tower.vision_model.encoder.layers.17.self_attn.q_proj.bias',

Screenshots:
You may attach screenshots if it better explains the issue.

The text was updated successfully, but these errors were encountered:

haotian-liu · 2023-10-26T04:58:44Z

Is the checkpoint trained by yourself? If so, this is expected, as DeepSpeed saves the frozen vision encoder weights as well. If your results are normal, than you can safely ignore this warning.

shipengai · 2023-10-26T06:22:52Z

@haotian-liu ,Yes, the checkpoint is trained myself. Thanks your reply. But I found that when use your released checkpoint , there are not such logs.

haotian-liu · 2023-10-28T05:06:27Z

If you want to remove the vision tower as the checkpoint we released, you can do this:

python -m llava.model.consolidate --src model --dst model_consolidate

The model prediction would be the same regardless of you do anything like that.

annopackage · 2024-02-23T08:58:35Z

Is the checkpoint trained by yourself? If so, this is expected, as DeepSpeed saves the frozen vision encoder weights as well. If your results are normal, than you can safely ignore this warning.

Hi, why is vision_tower not initialized through model.from_pretrain(model_name_or_path) since getattr('vision_tower') is true and there is state_dict in checkpoint?

CrossLee1 · 2024-03-16T03:05:36Z

@haotian-liu as for the mmbench dataset, gt answers are provided in mmbench_dev_20230712.tsv
why do you upload the results to the evaluation server, rather than calculating offline?

TianyunYoung · 2024-06-18T12:38:46Z

@haotian-liu as for the mmbench dataset, gt answers are provided in mmbench_dev_20230712.tsv why do you upload the results to the evaluation server, rather than calculating offline?

@haotian-liu I have the same question, hhh

ppalantir · 2024-08-20T19:34:42Z

@haotian-liu as for the mmbench dataset, gt answers are provided in mmbench_dev_20230712.tsv why do you upload the results to the evaluation server, rather than calculating offline?

Hi @CrossLee1, I also found the gt answers, but the accuracy I calculated is much higher than reported. could you please give me some suggestions? thanks

dacian7 · 2024-08-20T22:31:25Z

Hi, why is vision_tower not initialized through model.from_pretrain(model_name_or_path) since getattr('vision_tower') is true and there is state_dict in checkpoint?

@annopackage Same question here... have you figured it out?

shipengai changed the title ~~[Usage] Error in eval.~~ [Usage] Some Weights not used, when loaded in eval mmbench. Oct 26, 2023

BAJUKA mentioned this issue Oct 27, 2023

[Usage] Some weights of the model checkpoint at ./checkpoints/llava-v1.5-13b were not used when initializing LlavaLlamaForCausalLM: #687

Closed

shipengai closed this as completed Nov 8, 2023

wdrink mentioned this issue Nov 27, 2023

some weights not used when init X2FD/LVIS-INSTRUCT4V#12

Closed

RobitsG mentioned this issue Aug 29, 2024

[Usage] Some weights of LlavaLlamaForCausalLM were not initialized from the model checkpoint #1679

Open

wisdomikezogwo mentioned this issue Sep 13, 2024

Some Weights not used when initializing LlavaLlamaForCausalLM aldraus/quilt-llava#23

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Usage] Some Weights not used, when loaded in eval mmbench. #672

[Usage] Some Weights not used, when loaded in eval mmbench. #672

shipengai commented Oct 26, 2023

haotian-liu commented Oct 26, 2023

shipengai commented Oct 26, 2023 •

edited

Loading

haotian-liu commented Oct 28, 2023

annopackage commented Feb 23, 2024

CrossLee1 commented Mar 16, 2024

TianyunYoung commented Jun 18, 2024

ppalantir commented Aug 20, 2024

dacian7 commented Aug 20, 2024

[Usage] Some Weights not used, when loaded in eval mmbench. #672

[Usage] Some Weights not used, when loaded in eval mmbench. #672

Comments

shipengai commented Oct 26, 2023

Describe the issue

haotian-liu commented Oct 26, 2023

shipengai commented Oct 26, 2023 • edited Loading

haotian-liu commented Oct 28, 2023

annopackage commented Feb 23, 2024

CrossLee1 commented Mar 16, 2024

TianyunYoung commented Jun 18, 2024

ppalantir commented Aug 20, 2024

dacian7 commented Aug 20, 2024

shipengai commented Oct 26, 2023 •

edited

Loading