Eval bug: Partial Gibberish on Long Prompts GPT-OSS 120B

### Name and Version

./llama-cli --version
ggml_cuda_init: GGML_CUDA_FORCE_MMQ:    no
ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no
ggml_cuda_init: found 2 ROCm devices:
  Device 0: AMD Radeon PRO W7900, gfx1100 (0x1100), VMM: no, Wave Size: 32
  Device 1: AMD Radeon PRO W7900, gfx1100 (0x1100), VMM: no, Wave Size: 32
version: 6250 (e92734d5)
built with cc (Ubuntu 13.3.0-6ubuntu2~24.04) 13.3.0 for x86_64-linux-gnu

### Operating systems

Linux

### GGML backends

HIP

### Hardware

2x Radeon Pro W7900

### Models

ggml-org/gpt-oss-120b-GGUF

### Problem description & steps to reproduce

On longer prompts 10k+ tokens, GPT-OSS 120B seems to repeatedly print "Dissolution" or  "oooooooooooooo...", I've attached examples of the outputs on random test chats that are longer tokens:

<img width="740" height="1323" alt="Image" src="https://github.com/user-attachments/assets/9723d680-2abc-4b73-a2cf-866f17d9f6a0" />

<img width="819" height="837" alt="Image" src="https://github.com/user-attachments/assets/c4212fb0-c643-4244-ae73-e5f44beeb534" />

<img width="753" height="205" alt="Image" src="https://github.com/user-attachments/assets/3e288514-ba5c-402d-b816-d105c5c54275" />

<img width="706" height="132" alt="Image" src="https://github.com/user-attachments/assets/f19ae646-36a2-479d-96ba-9c138581fdb7" />

<img width="702" height="188" alt="Image" src="https://github.com/user-attachments/assets/dbe58802-e078-43d0-9d7b-b1fcd4f543a7" />

Launch command is: 
./llama-server -m /home/ultimis/LLM/Models/ggml-org/gpt-oss-120b-GGUF/gpt-oss-120b-mxfp4-00001-of-00003.gguf -c 131072 -ngl 999 -b 2048 -ub 2048 -fa --reasoning-format none --jinja --chat-template-kwargs '{"reasoning_effort":"high"}' -host 0.0.0.0 --port 8081

### First Bad Commit

_No response_

### Relevant log output

```shell
Raw outputs:
https://gist.github.com/AbdullahMPrograms/a7a3b96dc1713387fc93911704b2d483
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Eval bug: Partial Gibberish on Long Prompts GPT-OSS 120B #15516

Name and Version

Operating systems

GGML backends

Hardware

Models

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Eval bug: Partial Gibberish on Long Prompts GPT-OSS 120B #15516

Description

Name and Version

Operating systems

GGML backends

Hardware

Models

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions