-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error when preparing Vicuna v0 weights? Can I use Vicuna v1.1? #25
Comments
I encountered the same issue. Note that per the Vicuna official repo, Vicuna-v0 is only compatible with FastChat version <= v0.1.10, so |
same issue,looking forward to providing vicuna v1.1 |
@linkct This solution works for me. Thanks! |
can change the source code of fastchat.model.apply_delta
|
can you show the line number to add? |
line 107 I guess. |
Hi,
Thanks for releasing the interesting work.
I'm trying to deploy it on my server.
However, I encountered some difficulties when preparing Vicuna weights.
When apply the delta weights of Vicuna to the original LLaMa weights, I always got vocab mismatch error like this:
RuntimeError: The size of tensor a (32000) must match the size of tensor b (32001) at non-singleton dimension 0
I've searched issues in the FastChat repo but didn't find effective solution.
The author of FastChat suggests to directly move to Vicuna v1.1 since they have fixed a lot of issues in the new version.
I'd like to ask
###
to</s>
. I'm not sure if it is compatible with MiniGPT-4.Thanks!
The text was updated successfully, but these errors were encountered: