Fix stablelm #3038

esmeetu · 2024-02-26T14:49:43Z

Stabilityai has uploaded transformers implementation. https://huggingface.co/stabilityai/stablelm-3b-4e1t/discussions/10

from vllm import LLM, SamplingParams

# Sample prompts.
prompts = [
    "Hello, my name is"
]
# Create a sampling params object.
sampling_params = SamplingParams(temperature=0.0, max_tokens=64)

# Create an LLM.
llm = LLM(model="stabilityai/stablelm-3b-4e1t", dtype="half", tensor_parallel_size=1, enforce_eager=True, trust_remote_code=True)
# Generate texts from the prompts. The output is a list of RequestOutput objects
# that contain the prompt, generated text, and other information.
outputs = llm.generate(prompts, sampling_params)
# Print the outputs.
for output in outputs:
    prompt = output.prompt
    generated_text = output.outputs[0].text
    print(f"Prompt: {prompt!r}, Generated text: {generated_text!r}")

Ouput:

Prompt: 'Hello, my name is', Generated text: " and I'm writing you today to learn more about the 2018 Ford F-150 XLT SuperCrew 5.5' Box 4WD listed for $39,995.00. I live in the area and I would like to hear back from you soon and learn more about this vehicle. Please call me at at"

WoosukKwon · 2024-02-26T19:02:52Z

@esmeetu Should we replace the old version with this? Do we have a reason to keep the old one?

simon-mo · 2024-02-26T22:46:19Z

AFIAK the old version is needed for older huggingface version?

WoosukKwon · 2024-02-26T23:02:08Z

@simon-mo We already updated our transformers version to >=4.38.0 so it should be fine.

esmeetu · 2024-02-27T00:56:09Z

@WoosukKwon There only have small configuration changes on model and I think we can keep the backwards compatibility for a while.

WoosukKwon · 2024-02-27T01:22:44Z

@esmeetu Could you explain more about the backward compatibility? Do you mean the situation where the user uses an older version of transformers?

esmeetu · 2024-02-27T01:59:32Z

@esmeetu Could you explain more about the backward compatibility? Do you mean the situation where the user uses an older version of transformers?

No, this will provide a more kindly user experience for those (including me) who just use a local offline old repo before update. Otherwise when upgrading vLLM, it will throw an error and let them confused since they don't know that repo update information and think this is must an issue about vLLM upgrade.🤯

WoosukKwon · 2024-02-27T02:30:53Z

@esmeetu Got it. Thanks for the explanation!

esmeetu added 3 commits February 26, 2024 22:01

fix

630b3b8

fix model

402471b

fix type

5e380e9

simon-mo approved these changes Feb 26, 2024

View reviewed changes

WoosukKwon merged commit 4dd6416 into vllm-project:main Feb 27, 2024
20 of 22 checks passed

esmeetu deleted the fix-stablelm branch March 1, 2024 14:00

xjpang pushed a commit to xjpang/vllm that referenced this pull request Mar 4, 2024

Fix stablelm (vllm-project#3038)

e23c08a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix stablelm #3038

Fix stablelm #3038

esmeetu commented Feb 26, 2024

WoosukKwon commented Feb 26, 2024

simon-mo commented Feb 26, 2024

WoosukKwon commented Feb 26, 2024

esmeetu commented Feb 27, 2024

WoosukKwon commented Feb 27, 2024

esmeetu commented Feb 27, 2024 •

edited

WoosukKwon commented Feb 27, 2024

Fix stablelm #3038

Fix stablelm #3038

Conversation

esmeetu commented Feb 26, 2024

WoosukKwon commented Feb 26, 2024

simon-mo commented Feb 26, 2024

WoosukKwon commented Feb 26, 2024

esmeetu commented Feb 27, 2024

WoosukKwon commented Feb 27, 2024

esmeetu commented Feb 27, 2024 • edited

WoosukKwon commented Feb 27, 2024

esmeetu commented Feb 27, 2024 •

edited