server : add /apply-template endpoint for additional use cases of Minja functionality #11489

pnb · 2025-01-29T14:33:21Z

This PR adds an endpoint to examples/server, /apply-template, which will apply the model's chat template to the given messages, like for chat completion, but then will simply return the formatted prompt rather than running inference.

Sometimes I modify prompts after formatting them for chat interactions, especially for cases where the goal is to insert something into the beginning of the model's response. For example, the classic "Sure!" insertion to reduce refusals, or more often some partially-constructed solutions that I would like the model to finish.

Previously I have implemented chat templates for each model myself, but with the addition of some Jinja support via Minja it is very tempting to let the server do this for users instead, as it is more likely to be correct and offer good coverage of prompt templates.

This PR also adds a CI test, which I did using the Command-R template to provide some variety versus the closely related test_chat_template unit test (side note: I did not find __verbose very useful for prompt formatting use cases because it includes BOS, hence the new endpoint).

examples/server/server.cpp

…ja functionality (ggml-org#11489) * add /apply-template endpoint to server * remove unnecessary line * add /apply-template documentation * return only "prompt" field in /apply-template * use suggested idea instead of my overly verbose way

pnb added 3 commits January 28, 2025 18:59

add /apply-template endpoint to server

a864590

remove unnecessary line

10448bf

add /apply-template documentation

453d204

pnb requested a review from ngxson as a code owner January 29, 2025 14:33

github-actions bot added examples python python script changes server labels Jan 29, 2025

ngxson reviewed Jan 29, 2025

View reviewed changes

examples/server/server.cpp Outdated Show resolved Hide resolved

pnb added 2 commits January 29, 2025 10:25

return only "prompt" field in /apply-template

b407a4e

use suggested idea instead of my overly verbose way

6f29bcb

ngxson approved these changes Jan 29, 2025

View reviewed changes

ngxson merged commit eb7cf15 into ggml-org:master Jan 29, 2025
47 checks passed

isaac-mcfadyen mentioned this pull request Jan 29, 2025

server: document /apply-template response format #11503

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

server : add /apply-template endpoint for additional use cases of Minja functionality #11489

server : add /apply-template endpoint for additional use cases of Minja functionality #11489

Uh oh!

pnb commented Jan 29, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

server : add /apply-template endpoint for additional use cases of Minja functionality #11489

server : add /apply-template endpoint for additional use cases of Minja functionality #11489

Uh oh!

Conversation

pnb commented Jan 29, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants