Fix internlm after https://github.com/vllm-project/vllm/pull/2860 #2861

pcmoritz · 2024-02-14T01:38:45Z

Sorry, I was shooting a little too fast with #2860 and it got merged before I could try it out end-to-end. This PR fixes the model, to verify the fix I ran the following:

In [1]: from vllm import LLM, SamplingParams

In [2]: prompts = [
   ...:     "Hello, my name is",
   ...:     "The president of the United States is",
   ...:     "The capital of France is",
   ...:     "The future of AI is",
   ...: ]
   ...: sampling_params = SamplingParams(temperature=0.8, top_p=0.95)

In [3]: llm = LLM(model="internlm/internlm-7b", trust_remote_code=True)

In [5]: outputs = llm.generate(prompts, sampling_params)

In [6]: # Print the outputs.
   ...: for output in outputs:
   ...:     prompt = output.prompt
   ...:     generated_text = output.outputs[0].text
   ...:     print(f"Prompt: {prompt!r}, Generated text: {generated_text!r}")
   ...: 
Prompt: 'Hello, my name is', Generated text: ' Lorena. I am a native Spanish speaker who loves to travel, read,'
Prompt: 'The president of the United States is', Generated text: " in Europe this week, but he's only making one stop in the former Soviet"
Prompt: 'The capital of France is', Generated text: ' Paris. The city is also called “City of Light” and “City of'
Prompt: 'The future of AI is', Generated text: ' bright, but there are a few things to be aware of\n\nThe future of'

The error was that https://huggingface.co/internlm/internlm-7b/blob/main/config.json doesn't have num_key_value_heads.

WoosukKwon

LGTM! Thanks for the PR. Sorry, I merged the previous one too quickly 😓 Let me know if the PR looks good to go.

pcmoritz · 2024-02-14T01:57:35Z

No worries, I should have written that I hadn't tested it yet 😓

I tested it now with internlm/internlm-chat-20b, internlm/internlm-7b and internlm/internlm-chat-7b so should be ready :)

pcmoritz added 2 commits February 13, 2024 17:34

Fix internlm

b28bb00

lint

34b4f54

pcmoritz changed the title ~~Fix internlm~~ Fix internlm after https://github.com/vllm-project/vllm/pull/2860 Feb 14, 2024

WoosukKwon approved these changes Feb 14, 2024

View reviewed changes

WoosukKwon merged commit 0c48b37 into vllm-project:main Feb 14, 2024
16 of 19 checks passed

jvmncs pushed a commit to jvmncs/vllm that referenced this pull request Feb 14, 2024

Fix internlm after vllm-project#2860 (vllm-project#2861)

8c3d97a

xjpang pushed a commit to xjpang/vllm that referenced this pull request Feb 20, 2024

Fix internlm after vllm-project#2860 (vllm-project#2861)

d568c74

xjpang pushed a commit to xjpang/vllm that referenced this pull request Feb 22, 2024

Fix internlm after vllm-project#2860 (vllm-project#2861)

51ed1d5

andy-neuma mentioned this pull request Feb 23, 2024

andy/bump main to v0.3.2 neuralmagic/nm-vllm#49

Closed

xjpang pushed a commit to xjpang/vllm that referenced this pull request Mar 4, 2024

Fix internlm after vllm-project#2860 (vllm-project#2861)

ed47c4e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix internlm after https://github.com/vllm-project/vllm/pull/2860 #2861

Fix internlm after https://github.com/vllm-project/vllm/pull/2860 #2861

pcmoritz commented Feb 14, 2024

WoosukKwon left a comment

pcmoritz commented Feb 14, 2024

Fix internlm after https://github.com/vllm-project/vllm/pull/2860 #2861

Fix internlm after https://github.com/vllm-project/vllm/pull/2860 #2861

Conversation

pcmoritz commented Feb 14, 2024

WoosukKwon left a comment

Choose a reason for hiding this comment

pcmoritz commented Feb 14, 2024