fix: append stop words from request in case of using template renderer for local engine #2017

vansangpfiev · 2025-02-24T03:52:16Z

Describe Your Changes

This pull request includes an important change to the InferenceService::HandleChatCompletion method in the engine/services/inference_service.cc file. The change ensures that the end-of-sequence (EOS) token is correctly appended to the stop field in the JSON body if it is not already present.

Key change:

engine/services/inference_service.cc: Added logic to check if the stop field in the JSON body already contains the EOS token and append it if necessary.

Fixes Issues

bug: The "stop word" feature is not working #2016

Self Checklist

Added relevant comments, esp in complex areas
Updated docs (for bug fixes / features)
Created issues for follow-up changes or refactoring needed

…r for local engine

qnixsynapse

LGTM but I would prefer these all things to be handled by llamacpp itself at some point.

sangjanai added 2 commits February 24, 2025 10:50

fix: append stop words from request in case of using template rendere…

d4f3a0a

…r for local engine

chore: updated minja to fix crlf on Windows

df501d4

vansangpfiev requested a review from qnixsynapse February 24, 2025 05:37

qnixsynapse approved these changes Feb 24, 2025

View reviewed changes

vansangpfiev merged commit b4164c6 into dev Feb 24, 2025
8 checks passed

vansangpfiev deleted the s/fix/stop-words branch February 24, 2025 07:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: append stop words from request in case of using template renderer for local engine #2017

fix: append stop words from request in case of using template renderer for local engine #2017

Uh oh!

vansangpfiev commented Feb 24, 2025

Uh oh!

qnixsynapse left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix: append stop words from request in case of using template renderer for local engine #2017

fix: append stop words from request in case of using template renderer for local engine #2017

Uh oh!

Conversation

vansangpfiev commented Feb 24, 2025

Describe Your Changes

Fixes Issues

Self Checklist

Uh oh!

qnixsynapse left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants