[Bugfix] fix load local safetensors model #2512

esmeetu · 2024-01-19T23:50:30Z

No description provided.

simon-mo · 2024-01-20T00:06:45Z

i think i roughly understand the fix, can you explain though? i'm a bit confused on in what case this would happen?

esmeetu · 2024-01-20T00:20:37Z

i think i roughly understand the fix, can you explain though? i'm a bit confused on in what case this would happen?

When loading local safetensors model, use_safetensors flag was missed currently. And it will go to load default .bin format model.

You can download TinyLlama/TinyLlama-1.1B-Chat-v1.0 in a local folder like /models/TinyLlama-1.1B-Chat-v1.0, and then running any server with --model /models/TinyLlama-1.1B-Chat-v1.0 args.

NikolaBorisov · 2024-01-24T00:24:47Z

Sorry, I think I broke this. Fix makes sense.

esmeetu added 2 commits January 20, 2024 07:49

fix load safetensors model

312a3c0

format

f1b44b6

simon-mo approved these changes Jan 20, 2024

View reviewed changes

simon-mo merged commit 91a61da into vllm-project:main Jan 20, 2024
16 checks passed

esmeetu deleted the fix-sf branch January 20, 2024 00:26

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

[Bugfix] fix load local safetensors model (vllm-project#2512)

b058f81

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix] fix load local safetensors model #2512

[Bugfix] fix load local safetensors model #2512

esmeetu commented Jan 19, 2024

simon-mo commented Jan 20, 2024

esmeetu commented Jan 20, 2024 •

edited

NikolaBorisov commented Jan 24, 2024

[Bugfix] fix load local safetensors model #2512

[Bugfix] fix load local safetensors model #2512

Conversation

esmeetu commented Jan 19, 2024

simon-mo commented Jan 20, 2024

esmeetu commented Jan 20, 2024 • edited

NikolaBorisov commented Jan 24, 2024

esmeetu commented Jan 20, 2024 •

edited