Qwen2: can not run with the latest Qwen2 models #2256

zensh · 2024-06-07T09:14:17Z

I am trying to run Qwen2 example with the latest Qwen2 0.5B model, but get a Error: cannot find tensor lm_head.weight.

https://huggingface.co/Qwen/Qwen2-0.5B

The text was updated successfully, but these errors were encountered:

LaurentMazare · 2024-06-07T09:37:03Z

Ah thanks for reporting this, indeed there is a small difference in the new model architecture, I'm minting #2257 to support this and you should be able to use the new model with --model 2-0.5b.

zensh · 2024-06-08T00:17:15Z

@LaurentMazare Thank you very much, it’s now running. However, the results from the 0.5B model are a bit strange, which might be due to other issues.

deadash · 2024-06-08T02:22:31Z

7b is incorrect，but cpu is ok.

zensh closed this as completed Jun 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qwen2: can not run with the latest Qwen2 models #2256

Qwen2: can not run with the latest Qwen2 models #2256

zensh commented Jun 7, 2024

LaurentMazare commented Jun 7, 2024

zensh commented Jun 8, 2024

deadash commented Jun 8, 2024 •

edited

Loading

Qwen2: can not run with the latest Qwen2 models #2256

Qwen2: can not run with the latest Qwen2 models #2256

Comments

zensh commented Jun 7, 2024

LaurentMazare commented Jun 7, 2024

zensh commented Jun 8, 2024

deadash commented Jun 8, 2024 • edited Loading

deadash commented Jun 8, 2024 •

edited

Loading