ollama run qwen:0.5B, Reply exception, stuck in a loop. #2405

samzong · 2024-02-08T02:50:56Z

>>> /show info
Model details:
Family              qwen2
Parameter Size      620M
Quantization Level  Q4_0

~ uname -m -s -r
Darwin 23.3.0 arm64

20240208104056.mp4

/label bug

bm777 · 2024-02-08T10:23:33Z

I had the same behavior with phi2 model. I noticed that the model gives the right or the expected answer before going to a new line (\n). So I had to add "\n" in the stop list.

const stream = await generate({
        model: "phi",
        prompt: text,
        stream: true,
        options: {
            num_predict: 70,
            temperature: 0.65,
            penalize_newline: true,
            top_p: 0.9,
            // presence_penalty: 0.6,
            stop: ["\n", "User:", "Assistant:", "User:"] //["\n"]
        }
    })

It still cuts at a wrong place sometimes, but I can manage to just remove the words after the last punctuation: . or ,
This method will not work if the user ask for a list as a result (give me 3 recipes of cappuccino) -> then after generating the first, the model will try to add a new line for the second element of the list, and it becomes more complicated to control the level.
(any workaround for this use case?)

jmorganca · 2024-05-10T01:08:51Z

The infinite generation should be fixed now. As for the poor responses from smaller models – this may be from the prompt template, prompt or other reasons – all of which we are trying to improve.

bm777 · 2024-05-10T13:10:36Z

@jmorganca So we should update the Ollama binary verison then right ?

jmorganca added the bug Something isn't working label Feb 8, 2024

jmorganca self-assigned this Feb 8, 2024

samzong mentioned this issue Feb 8, 2024

qwen:0.5b run with ollama have a question: Reply exception, stuck in a loop. QwenLM/Qwen2#31

Closed

svilupp mentioned this issue Feb 9, 2024

Running Qwen #2419

Closed

kennethkcox mentioned this issue Apr 25, 2024

[Snyk] Upgrade winston from 3.10.0 to 3.13.0 kennethkcox/ollama#4

Open

jmorganca closed this as completed May 10, 2024

michael8213 mentioned this issue May 13, 2024

[Snyk] Upgrade winston from 3.10.0 to 3.13.0 michael8213/ollama#4

Open

paaschdigital mentioned this issue May 20, 2024

[Snyk] Upgrade winston from 3.10.0 to 3.13.0 paaschdigital/ollama#5

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ollama run qwen:0.5B, Reply exception, stuck in a loop. #2405

ollama run qwen:0.5B, Reply exception, stuck in a loop. #2405

samzong commented Feb 8, 2024

bm777 commented Feb 8, 2024

jmorganca commented May 10, 2024

bm777 commented May 10, 2024

ollama run qwen:0.5B, Reply exception, stuck in a loop. #2405

ollama run qwen:0.5B, Reply exception, stuck in a loop. #2405

Comments

samzong commented Feb 8, 2024

bm777 commented Feb 8, 2024

jmorganca commented May 10, 2024

bm777 commented May 10, 2024