Error when preparing Vicuna v0 weights? Can I use Vicuna v1.1? #25

JosephPai · 2023-04-18T07:41:26Z

Hi,

Thanks for releasing the interesting work.
I'm trying to deploy it on my server.
However, I encountered some difficulties when preparing Vicuna weights.

When apply the delta weights of Vicuna to the original LLaMa weights, I always got vocab mismatch error like this:
RuntimeError: The size of tensor a (32000) must match the size of tensor b (32001) at non-singleton dimension 0
I've searched issues in the FastChat repo but didn't find effective solution.
The author of FastChat suggests to directly move to Vicuna v1.1 since they have fixed a lot of issues in the new version.

I'd like to ask

Do you have any experiences/suggestions to solve the issues I encountered?
Do you think it's feasible to directly move to Vicuna v1.1? I noticed that in the new version, they have some changes, like the separator has been changed from ### to </s>. I'm not sure if it is compatible with MiniGPT-4.

Thanks!

The text was updated successfully, but these errors were encountered:

linkct · 2023-04-18T08:14:47Z

I encountered the same issue. Note that per the Vicuna official repo, Vicuna-v0 is only compatible with FastChat version <= v0.1.10, so pip install fschat==0.1.10 solved it for me.

MrToy · 2023-04-18T11:06:45Z

same issue，looking forward to providing vicuna v1.1

JosephPai · 2023-04-18T12:43:21Z

I encountered the same issue. Note that per the Vicuna official repo, Vicuna-v0 is only compatible with FastChat version <= v0.1.10, so pip install fschat==0.1.10 solved it for me.

@linkct This solution works for me. Thanks!

LARRYMIN · 2023-04-23T14:09:41Z

can change the source code of fastchat.model.apply_delta

if delta_state_dict[name].size(0)==32001:
    state_dict[name] += delta_state_dict[name][:32000, :]
else:
    state_dict[name] += delta_state_dict[name]

feymanwang · 2023-04-25T03:00:28Z

can you show the line number to add?

WynMew · 2023-04-25T11:32:58Z

can you show the line number to add?

line 107 I guess.

TsuTikgiau closed this as completed Apr 18, 2023

gch8295322 mentioned this issue Apr 19, 2023

Mistake in the preparation of vicuna weights (error when loading delta weights) #52

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error when preparing Vicuna v0 weights? Can I use Vicuna v1.1? #25

Error when preparing Vicuna v0 weights? Can I use Vicuna v1.1? #25

JosephPai commented Apr 18, 2023

linkct commented Apr 18, 2023

MrToy commented Apr 18, 2023

JosephPai commented Apr 18, 2023 •

edited

Loading

LARRYMIN commented Apr 23, 2023

feymanwang commented Apr 25, 2023

WynMew commented Apr 25, 2023

Error when preparing Vicuna v0 weights? Can I use Vicuna v1.1? #25

Error when preparing Vicuna v0 weights? Can I use Vicuna v1.1? #25

Comments

JosephPai commented Apr 18, 2023

linkct commented Apr 18, 2023

MrToy commented Apr 18, 2023

JosephPai commented Apr 18, 2023 • edited Loading

LARRYMIN commented Apr 23, 2023

feymanwang commented Apr 25, 2023

WynMew commented Apr 25, 2023

JosephPai commented Apr 18, 2023 •

edited

Loading