Skip to content

Conversation

laundrevity
Copy link

This has particularly been an issue when trying to use the qwen-code CLI in conjunction with llama.cpp - it essentially only works with vLLM for Qwen3-Coder models.

changes:

  • strip <tool_call>/<tool> wrappers when parsing Hermes/Qwen function-tag tool calls so they no longer leak into assistant content
  • add parser helper to drop wrapper-only preludes before consuming <function=...> blocks
  • cover nested <tool_call><function=...> cases with a regression in test-chat.cpp

@laundrevity laundrevity requested a review from ggerganov as a code owner October 1, 2025 23:47
@github-actions github-actions bot added the testing Everything test related label Oct 1, 2025
@laundrevity laundrevity closed this Oct 2, 2025
@laundrevity laundrevity deleted the remove-tool-xml-from-content branch October 2, 2025 00:05
@laundrevity laundrevity restored the remove-tool-xml-from-content branch October 2, 2025 00:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

testing Everything test related

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant