Skip to content

Certain ONNX models ignore the system prompt #1172

@RonanKMcGovern

Description

@RonanKMcGovern

System Info

Here's a model that follows the system prompt:

  • HuggingFaceTB/SmolLM2-1.7B-Instruct

Here are two that do not:

  • onnx-community/Llama-3.2-3B-Instruct-onnx-web-gqa
  • onnx-community/Qwen2.5-Coder-1.5B-Instruct

Is this intentional or accidental?

Environment/Platform

  • Website/web-app
  • Browser extension
  • Server-side (e.g., Node.js, Deno, Bun)
  • Desktop app (e.g., Electron)
  • Other (e.g., VSCode extension)

Description

I'm running these models in q4f16 with webgpu

Reproduction

I'm following the examples provided for smollm in the examples, but swapping the model.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions