add vocab padding for LLama(Support WizardLM) #411

esmeetu · 2023-07-09T11:03:05Z

This PR add support for WizardLM-13B. When running with args: tensor-parallel-size 2, it will throw AssertionError: 32001 is not divisible by 2.
The reason is WizardLM's vocab size is 32001, and is not divisible by 2.
So i reference gpt2.py to add vocab padding for llama.py and it works.

zhuohan123

LGTM! Thanks for your contribution! Could you fix the style so we can merge the code?

add padding for LLama

70d3cc3

zhuohan123 approved these changes Jul 12, 2023

View reviewed changes

chore: fix style

c17446c

zhuohan123 merged commit 7b6ae94 into vllm-project:main Jul 14, 2023
2 checks passed

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

add vocab padding for LLama(Support WizardLM) (vllm-project#411)

f67b057

sjchoi1 pushed a commit to casys-kaist-internal/vllm that referenced this pull request May 7, 2024

add vocab padding for LLama(Support WizardLM) (vllm-project#411)

5f0f78a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add vocab padding for LLama(Support WizardLM) #411

add vocab padding for LLama(Support WizardLM) #411

esmeetu commented Jul 9, 2023

zhuohan123 left a comment

add vocab padding for LLama(Support WizardLM) #411

add vocab padding for LLama(Support WizardLM) #411

Conversation

esmeetu commented Jul 9, 2023

zhuohan123 left a comment

Choose a reason for hiding this comment