Same generate params but got totally different result #706

luohao123 · 2023-08-08T14:12:29Z

Hi, I have a params with do_sample False, the result on huggingface are pretty good.

But same set in vllm, the model likes a stupid bird.

sampling_params = SamplingParams(
        temperature=generating_args.temperature,
        top_p=generating_args.top_p,
        max_tokens=800,
        frequency_penalty=generating_args.repetition_penalty,
        use_beam_search=False,
        ignore_eos=False,
    )

Please let me know how to resolve the bias?

same args:

do_sample: Optional[bool] = field(default=True)
    temperature: Optional[float] = field(
        default=0.84,
        metadata={"help": "The value used to modulate the next token probabilities."},
    )
    top_p: Optional[float] = field(
        default=0.6,
        metadata={
            "help": "The smallest set of most probable tokens with probabilities that add up to top_p or higher are kept."
        },
    )
    top_k: Optional[int] = field(default=40)
    num_beams: Optional[int] = field(default=1)
    max_new_tokens: Optional[int] = field(default=800)
    repetition_penalty: Optional[float] = field(default=1.09)

Both set do_sample False. but the result are out of my scope, they are totally different.

The text was updated successfully, but these errors were encountered:

Zhuqln · 2023-08-09T04:25:40Z

can you show the result about the difference?

BaiMoHan · 2023-08-09T09:31:09Z

This problem may be related to this #712 #450

luohao123 · 2023-08-09T14:16:44Z

@Zhuqln hard to tell, but same question, vllm can not answer correctly.
this caused a massive performance reduce compare with hf transformers....

hmellor · 2024-03-08T10:39:01Z

Closing this issue as stale as there has been no discussion in the past 3 months.

If you are still experiencing the issue you describe, feel free to re-open this issue.

luohao123 mentioned this issue Aug 9, 2023

Inference with LLaMA 65B generates nothing but \n #450

Open

baberabb mentioned this issue Nov 30, 2023

[Refactor] vllm data parallel EleutherAI/lm-evaluation-harness#1035

Merged

hmellor closed this as completed Mar 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Same generate params but got totally different result #706

Same generate params but got totally different result #706

luohao123 commented Aug 8, 2023

Zhuqln commented Aug 9, 2023

BaiMoHan commented Aug 9, 2023 •

edited

Loading

luohao123 commented Aug 9, 2023

hmellor commented Mar 8, 2024

Same generate params but got totally different result #706

Same generate params but got totally different result #706

Comments

luohao123 commented Aug 8, 2023

Zhuqln commented Aug 9, 2023

BaiMoHan commented Aug 9, 2023 • edited Loading

luohao123 commented Aug 9, 2023

hmellor commented Mar 8, 2024

BaiMoHan commented Aug 9, 2023 •

edited

Loading