Fix structured_outputs handling and tool normalization in vLLM backend by ehofm · Pull Request #5155 · huggingface/trl

ehofm · 2026-02-23T21:27:04Z

Summary

This PR adds support for JSON Schema-based structured generation in the vLLM backend and improves the handling of empty tool lists in the GRPOTrainer.

Key Features:

JSON Schema Support: Enables passing full JSON schemas (e.g., from Pydantic models) through generation_kwargs when using vLLM-serve.
Robust Parameter Handling: trl/scripts/vllm_serve.py now automatically converts dictionary-based structured_outputs into the required StructuredOutputsParams objects.
Empty Tool Normalization: Updated VLLMClient and GRPOTrainer to treat empty tool lists as None, preventing NotImplementedError when tools are initialized but not used.

Usage Instructions

Using JSON Schemas (New)

You can now use Pydantic models to define structured outputs for vLLM-backed generation:

from pydantic import BaseModel
from trl import GRPOConfig, GRPOTrainer

# 1. Define your schema as a Pydantic model
class ReasoningResponse(BaseModel):
    reasoning: str
    answer: float

# 2. Convert to JSON schema dictionary
my_schema = ReasoningResponse.model_json_schema()

# 3. Pass the schema to GRPOConfig via generation_kwargs
training_args = GRPOConfig(
    output_dir="./output",
    use_vllm=True,
    generation_kwargs={
        "structured_outputs": {"json": my_schema}
    }
)

trainer = GRPOTrainer(
    model="...",
    args=training_args,
    train_dataset=dataset,
    reward_funcs=[...]
)

Fixes # 5154

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

chatgpt-codex-connector · 2026-02-24T02:43:48Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.
Credits must be used to enable repository wide code reviews.

qgallouedec

thanks!

qgallouedec · 2026-02-24T19:21:38Z

@codex review

HuggingFaceDocBuilderDev · 2026-02-24T19:24:08Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 62e03f23a3

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

trl/generation/vllm_generation.py

trl/scripts/vllm_serve.py

qgallouedec · 2026-02-24T19:40:23Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 9efdfce628

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

trl/generation/vllm_generation.py

qgallouedec · 2026-02-24T19:50:42Z

@codex review

chatgpt-codex-connector · 2026-02-24T19:54:25Z

Codex Review: Didn't find any major issues. Breezy!

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Fix structured_outputs handling and tool normalization in vLLM backend

26fb793

ehofm marked this pull request as ready for review February 24, 2026 02:43

qgallouedec added 3 commits February 24, 2026 18:49

nits

8d810a2

for colocate as well

5273018

warning

62e03f2

qgallouedec approved these changes Feb 24, 2026

View reviewed changes

chatgpt-codex-connector bot reviewed Feb 24, 2026

View reviewed changes

trl/generation/vllm_generation.py Show resolved Hide resolved

trl/scripts/vllm_serve.py Show resolved Hide resolved

fix overiding logic

9efdfce

chatgpt-codex-connector bot reviewed Feb 24, 2026

View reviewed changes

trl/generation/vllm_generation.py Outdated Show resolved Hide resolved

don't use pop

561c9a1

Merge branch 'main' into structured-outputs-grpo-vllm-server

2c3a889

Merge branch 'main' into structured-outputs-grpo-vllm-server

3580253

qgallouedec merged commit 1149b74 into huggingface:main Feb 25, 2026
12 of 13 checks passed

qgallouedec added a commit to albertvillanova/trl that referenced this pull request Feb 25, 2026

apply huggingface#5155 by hand

bfdc9ea

qgallouedec mentioned this pull request Feb 25, 2026

Decouple rollout dispatch from vLLM backend in GRPO _generate_single_turn #5122

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix structured_outputs handling and tool normalization in vLLM backend#5155

Fix structured_outputs handling and tool normalization in vLLM backend#5155
qgallouedec merged 8 commits intohuggingface:mainfrom
ehofm:structured-outputs-grpo-vllm-server

ehofm commented Feb 23, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector bot commented Feb 24, 2026

Uh oh!

qgallouedec left a comment

Uh oh!

qgallouedec commented Feb 24, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Feb 24, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

Uh oh!

qgallouedec commented Feb 24, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

qgallouedec commented Feb 24, 2026

Uh oh!

chatgpt-codex-connector bot commented Feb 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ehofm commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Key Features:

Usage Instructions

Using JSON Schemas (New)

Before submitting

Who can review?

Uh oh!

chatgpt-codex-connector bot commented Feb 24, 2026

Uh oh!

qgallouedec left a comment

Choose a reason for hiding this comment

Uh oh!

qgallouedec commented Feb 24, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Feb 24, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

qgallouedec commented Feb 24, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

qgallouedec commented Feb 24, 2026

Uh oh!

chatgpt-codex-connector bot commented Feb 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ehofm commented Feb 23, 2026 •

edited

Loading