Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cannot load "starcoder-15.5B" with weight_sharding=True #21

Closed
acforvs opened this issue Jun 6, 2023 · 2 comments
Closed

Cannot load "starcoder-15.5B" with weight_sharding=True #21

acforvs opened this issue Jun 6, 2023 · 2 comments

Comments

@acforvs
Copy link

acforvs commented Jun 6, 2023

To reproduce:

model = load_model_pipeline(model_name="causallm", task="pretrained", model_type="starcoder-15.5B", is_eval=True, weight_sharding=True)

The error I get:
Entry Not Found for url: https://huggingface.co/bigcode/starcoder/resolve/main/pytorch_model.bin.

I believe the line that causes the problem is

weights_location = hf_hub_download(checkpoint, "pytorch_model.bin")

@acforvs acforvs changed the title Cannot load "starcoder-15.5B" Cannot load "starcoder-15.5B" with weight_sharding=True Jun 7, 2023
@acforvs acforvs changed the title Cannot load "starcoder-15.5B" with weight_sharding=True Cannot load "starcoder-15.5B" with weight_sharding=True Jun 7, 2023
@generative-ai758
Copy link

Hello,

I am facing a similar issue. I think it's because the model has been sharded into multiple files and there's no single file named pytorch_model.bin.

@bdqnghi
Copy link
Contributor

bdqnghi commented Jul 4, 2023

yes, the starcoder model has been sharded into multiple files already, so I recommend do not use weight_sharding for starcoder.

@bdqnghi bdqnghi closed this as completed Jul 4, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants