Skip to content

Error with unsloth/Qwen3.5-27B-GGUF:Q3_K_S #5

@julitroalves

Description

@julitroalves

When running unsloth/Qwen3.5-27B-GGUF:Q3_K_S model I'm getting this error:

srv    operator(): got exception: {"error":{"code":500,"message":"\n------------\nWhile executing CallExpression at line 85, column 32 in source:\n...first %}↵            {{- raise_exception('System message must be at the beginnin...\n                                           ^\nError: Jinja Exception: System message must be at the beginning.","type":"server_error"}}
srv  log_server_r: done request: POST /v1/chat/completions 127.0.0.1 500
srv    operator(): got exception: {"error":{"code":500,"message":"\n------------\nWhile executing CallExpression at line 85, column 32 in source:\n...first %}↵            {{- raise_exception('System message must be at the beginnin...\n                                           ^\nError: Jinja Exception: System message must be at the beginning.","type":"server_error"}}
srv  log_server_r: done request: POST /v1/chat/completions 127.0.0.1 500

My params:

llama-server \
    -hf unsloth/Qwen3.5-27B-GGUF:Q3_K_M \
    -ngl 90 \
    -fit on \
    --cache-ram 2056 \
    --parallel 1 \
    --ctx-size 62536 \
    --temp 0.6 \
    --top-p 0.95 \
    --top-k 20 \
    --min-p 0.00 \
    --cache-type-k q4_0 --cache-type-v q4_0 \
    --flash-attn on \
    --kv-unified \
    --fit-target 100 \
    --reasoning off --threads 16 --jinja -ctxcp 5

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions