-
Notifications
You must be signed in to change notification settings - Fork 957
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ValueError: The checkpoint you are trying to load has model type starcoder2
but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.
#1620
Comments
I saw that pr already supports starcoder2, and the docker image I downloaded is also the latest, but I don’t know why the deployment of starcoder2 still failed. |
@coder-xieshijie |
I think the problem is your cuda driver version:
|
This is my docker image running log.
I completely inherited it from the latest tgi image. The theoretical cuda version is also completely inherited from this image. I saw that tgi’s dockerfile file has a cuda version description from https://github.com/huggingface/text-generation-inference/blob/main/Dockerfile
Is this version outdated? |
The driver version is related to your host running the container, not the docker image. |
I'm facing the same issue but I do not have any error/warning related to GPU(except FlashAttantion). Docker Image: ghcr.io/huggingface/text-generation-inference
|
Thank you for your opinion, I will give it a try and will get back to you with the results. |
It worked. "Successfully installed transformers-4.39.0.dev0", the transformers version matters |
This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days. |
System Info
FROM ghcr.io/huggingface/text-generation-inference:sha-7dbaf9e
Information
Tasks
Reproduction
I built a tgi image myself. The specific dockerfile is as follows:
When I use this built image to load starcoder2-15b, the error is:
My command to start the model in the image is as follows:
It is worth noting that for the same image, when I start starcoder1-15b, it is normal, but when I start starcoder2-15b, it fails. The full error message is as follows:
Expected behavior
I hope to start starcoder2-15b normally through tgi, I ask for help, thank you very much
The text was updated successfully, but these errors were encountered: