hermes-llama parser (vllm)

This code is a hermes tool call parser for llama that works on vllm. It adds a buffer function that prevents fragmentation with the "<", "tool", "_call", ">" tokens when training or prompting the llama model with the format defined in NousResearch/Hermes-Function-Calling without additional tokens. You can check the original hermes parser at the link below.

https://github.com/vllm-project/vllm/blob/fa82b9385330319619ddb293a9f01ccd96fd0faf/vllm/entrypoints/openai/tool_parsers/hermes_tool_parser.py#L26

Example launch command

vllm serve meta-llama/Llama-3.2-3B-Instruct \
--enable-auto-tool-choice --tool-call-parser llama_hermes --tool-parser-plugin <<this_cloned_repo_path>>/lh_tool_parser.py  \
--port 4000 --enable-lora --lora-modules tool=morsmordre/m-3b-v1-iteration-00-sf-xlam-10

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
README.md		README.md
lh_tool_parser.py		lh_tool_parser.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

hermes-llama parser (vllm)

Example launch command

About

Languages

minpeter/hermes-llama-parse

Folders and files

Latest commit

History

Repository files navigation

hermes-llama parser (vllm)

Example launch command

About

Resources

Stars

Watchers

Forks

Languages