Add `{% generation %}` support to training chat templates by qgallouedec · Pull Request #5470 · huggingface/trl

qgallouedec · 2026-04-07T17:38:41Z

SFT with assistant_only_loss=True requires the chat template to include {% generation %} / {% endgeneration %} markers so that return_assistant_tokens_mask=True produces correct masks. Very few models ship these markers natively. Users currently hit a cryptic error when trying assistant_only_loss=True with e.g. Qwen3.

Changes

This PR aims to provide a base structure to future patched chat templates. The first one to be added is Qwen3. Here are the changes:

in qwen3_training.jinjaAdded {% generation %} / {% endgeneration %} around the assistant message output block.
get_training_chat_template now returns None only when the template is both prefix-preserving and already contains {% generation %} markers. (Previously it returned None for any prefix-preserving template)
In SFT trainer when assistant_only_loss=True, automatically swaps in the training chat template if the current one lacks {% generation %} markers. and passes self.chat_template through _tokenize → apply_chat_template.

This PR requires #5459

Note

Medium Risk
Changes how chat templates are selected/patched for both SFT (assistant_only_loss) and GRPO/tooling flows; incorrect detection or patching could affect tokenization, masking, and therefore training loss/behavior for supported models.

Overview
Enables SFT assistant_only_loss=True to work out-of-the-box for supported models by ensuring the training chat template includes {% generation %} / {% endgeneration %} markers and passing the patched template through tokenization.

Updates get_training_chat_template to only return None when the current template is both prefix-preserving and generation-tagged; otherwise it returns the patched Qwen3 training template (or errors for unsupported templates). The Qwen3 training template is updated to wrap assistant output in generation tags, and tests/docs are extended to validate return_assistant_tokens_mask=True behavior and clarify the requirements.

Separately tightens GRPO/async GRPO initialization so the training template is only swapped in when tools are enabled and the original template is not prefix-preserving, avoiding unnecessary patching.

^{Reviewed by Cursor Bugbot for commit 31e640f. Bugbot is set up for automated code reviews on this repo. Configure here.}

…sages + async grpo

qgallouedec · 2026-04-07T17:39:43Z

tests/test_chat_template_utils.py

 class TestGetTrainingChatTemplate:
    def test_new_chat_template_is_prefix_preserving(self, tokenizer_name):
        tokenizer = AutoTokenizer.from_pretrained(tokenizer_name)
-        assert is_chat_template_prefix_preserving(tokenizer) is False


for future models, get_training_chat_template will be called with any chat template, not only non-prefix-preserving ones

HuggingFaceDocBuilderDev · 2026-04-07T17:41:17Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 3c2fc8e38f

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

trl/chat_template_utils.py

trl/trainer/sft_trainer.py

trl/chat_template_utils.py

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 9d4d57f. Configure here.}

trl/chat_templates/qwen3_training.jinja

cursor · 2026-04-07T18:10:08Z

trl/trainer/sft_trainer.py


+        # When assistant_only_loss is enabled, swap in a training chat template with {% generation %} markers
+        # if the current template doesn't already have them.
+        if args.assistant_only_loss and "{% generation %}" not in processing_class.chat_template:


Potential crash when chat_template is None

Low Severity

The in check "{% generation %}" not in processing_class.chat_template will raise a TypeError if processing_class.chat_template is None. This can happen with tokenizers that have no chat template set. A guard like processing_class.chat_template and "{% generation %}" not in processing_class.chat_template would prevent the crash and provide a clearer path to the downstream ValueError from get_training_chat_template.

^{Reviewed by Cursor Bugbot for commit 9d4d57f. Configure here.}

…formatting

qgallouedec and others added 19 commits April 5, 2026 18:44

Narrow prefix-preserving check to the actual requirement

4b3aa51

Merge branch 'main' into narrow-prefix-preserving-check

0894910

Update chat template examples to use multiplication function calls

730070b

style

4622d77

Move chat templates from inline strings to .jinja files

08d4c51

tools in dummy

276559d

Add chat template files to MANIFEST.in

673c35d

Enhance chat template handling to include tool call formatting in mes…

604c476

…sages + async grpo

align grpo and async

83a7ef6

Merge branch 'main' into chat-templates-files

0f28384

revert no content

e5d7cdf

docstyle ignore

a618809

Merge branch 'main' into chat-templates-files

a0b81b1

Merge branch 'main' into chat-templates-files

67ab0af

Merge branch 'main' into chat-templates-files

63ec7d3

Merge branch 'main' into chat-templates-files

c838146

revert old modif

7b7f5d1

Merge branch 'main' into chat-templates-files

e37fe00

Add {% generation %} support to training chat templates

3c2fc8e

qgallouedec commented Apr 7, 2026

View reviewed changes

chatgpt-codex-connector bot reviewed Apr 7, 2026

View reviewed changes

trl/chat_template_utils.py Show resolved Hide resolved

trl/trainer/sft_trainer.py Show resolved Hide resolved

cursor bot reviewed Apr 7, 2026

View reviewed changes

trl/chat_template_utils.py Outdated Show resolved Hide resolved

qgallouedec mentioned this pull request Apr 7, 2026

Tracking: Add {% generation %} chat templates for common model families #5471

Open

21 tasks

qgallouedec added 2 commits April 7, 2026 17:55

conditionally call this in grpo

d887724

align

9d4d57f

cursor bot reviewed Apr 7, 2026

View reviewed changes

Merge branch 'main' into generation-tags

632e8b1

albertvillanova approved these changes Apr 8, 2026

View reviewed changes

Refactor Qwen3 training template: Adjust generation block and output …

b33bc66

…formatting

qgallouedec added 2 commits April 8, 2026 20:43

Merge branch 'main' into generation-tags

8820a29

Merge branch 'main' into generation-tags

31e640f

qgallouedec merged commit dd071d7 into main Apr 9, 2026
16 checks passed

qgallouedec deleted the generation-tags branch April 9, 2026 02:46

casinca mentioned this pull request Apr 9, 2026

feat(gpt-oss): Add {% generation %} markers for training chat template #5484

Merged

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `{% generation %}` support to training chat templates#5470

Add `{% generation %}` support to training chat templates#5470
qgallouedec merged 25 commits intomainfrom
generation-tags

qgallouedec commented Apr 7, 2026 •

edited by cursor bot

Loading

Uh oh!

qgallouedec Apr 7, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Apr 7, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Uh oh!

Uh oh!

cursor bot Apr 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

qgallouedec commented Apr 7, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Uh oh!

qgallouedec Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Apr 7, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cursor bot Apr 7, 2026

Choose a reason for hiding this comment

Potential crash when chat_template is None

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

qgallouedec commented Apr 7, 2026 •

edited by cursor bot

Loading