Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Qwen2: can not run with the latest Qwen2 models #2256

Closed
zensh opened this issue Jun 7, 2024 · 3 comments
Closed

Qwen2: can not run with the latest Qwen2 models #2256

zensh opened this issue Jun 7, 2024 · 3 comments

Comments

@zensh
Copy link

zensh commented Jun 7, 2024

I am trying to run Qwen2 example with the latest Qwen2 0.5B model, but get a Error: cannot find tensor lm_head.weight.

https://huggingface.co/Qwen/Qwen2-0.5B

@LaurentMazare
Copy link
Collaborator

Ah thanks for reporting this, indeed there is a small difference in the new model architecture, I'm minting #2257 to support this and you should be able to use the new model with --model 2-0.5b.

@zensh
Copy link
Author

zensh commented Jun 8, 2024

@LaurentMazare Thank you very much, it’s now running. However, the results from the 0.5B model are a bit strange, which might be due to other issues.

@zensh zensh closed this as completed Jun 8, 2024
@deadash
Copy link

deadash commented Jun 8, 2024

7b is incorrect,but cpu is ok.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants