When calling to vllm server with hermes tool parser opencode doesn't call the tools

I'm trying to get this working with my local vllm server on fedora 41. Here's the current server command I'm running:

 vllm serve Qwen/Qwen2.5-Coder-32B-Instruct   --tensor-parallel-size 4   --enable-auto-tool-choice   --tool-call-parser hermes   --gpu-memory-utilization 0.8   --max-model-len 30000   --trust-remote-code

It seems that the model generates the tool calls fine, but but opencode just shows them as text instead of calling them. Is support for this feature coming later? Am I missing an easy workaround?

<img width="1214" height="1330" alt="Image" src="https://github.com/user-attachments/assets/c9438719-3b7b-4cf9-b874-baff8d8c4122" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When calling to vllm server with hermes tool parser opencode doesn't call the tools #1122

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

When calling to vllm server with hermes tool parser opencode doesn't call the tools #1122

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions