Add pydantic support in response_format #2647

lhoestq · 2024-10-31T11:27:36Z

support passing a Pydantic schema in response_format in InferenceClient.chat_completion

class MyExampleResponseFormat(BaseModel):
    foo: str
    bar: str

response = client.chat_completion(messages=messages, response_format=MyExampleResponseFormat)

added response.choices[0].message.parsed (and response.choices[0].message.refusal if the generation failed as in the openai client)

message = response.choices[0].message
if message.parsed:
    print(message.parsed)
else:
    print(message.refusal)

close #2646

HuggingFaceDocBuilderDev · 2024-10-31T11:37:58Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Wauplin · 2024-10-31T15:42:17Z

As discussed, we'll have to see what we can do for this. Especially because ChatCompletionOutputMessage is an auto-generated class (also the stream version ChatCompletionStreamOutputMessage would have to be updated). And do the same for InferenceClient.text_generation(..., grammar=...).

lhoestq · 2024-10-31T16:24:06Z

Ok, note that this is mostly useful for the non-streaming case imo

patrickvonplaten · 2025-02-27T06:50:36Z

Big +1 on adding pydantic to both requests and responses. If HF becomes the gateway for many inference providers, it'd be super important to make the user experience with it as flawless as possible. To do so, I think both requests and responses objects should have very strict typing and validation that throw directly once the user makes a mistake => Pydantic is a must for this.

E.g.:

huggingface_hub/src/huggingface_hub/inference/_generated/types/base.py

Line 59 in 290aa26

raise ValueError(f"Invalid input data for {cls}. Expected a list, but got {type(output)}.")

=> these manual checks are not needed when making this a Pydantic object
There is a lot of validation missing at the moment (besides simple type checking):
- Tools are very weakly typed:
  
  huggingface_hub/src/huggingface_hub/inference/_generated/types/chat_completion.py
  
  Line 70 in 290aa26
  
  class ChatCompletionInputFunctionDefinition(BaseInferenceType):
  
  => there should be more validations (e.g. arguments should be a dict)
- Generally the whole message structure needs to be validated (e.g. there cannot be more tool answers than tool calls in history, there cannot be two assistant messages following each other, ....) => is this done?

Strict validation would also be very important here - otherwise people might waste quite some money on ill-formatted requests

Wauplin · 2025-02-27T09:30:49Z

@patrickvonplaten could you open a separate issue for that topic please? @lhoestq's initial request is simply that the response_format parameter from chat_completion accepts Pydantic model as well, not only a jsonschema.
Adding validation to all inputs and outputs is an orthogonal (and big!) topic so I'd prefer to have the discussion in a dedicated issue.

lhoestq added 4 commits October 31, 2024 12:26

add pydantic support in response_format

297d3bf

style

893e7d4

minor

1a5f134

update async client

9488831

lhoestq added 3 commits October 31, 2024 12:48

add refusal

90b1c5d

mypy

3014429

comment

f7b813a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add pydantic support in response_format #2647

Add pydantic support in response_format #2647

Uh oh!

lhoestq commented Oct 31, 2024 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Oct 31, 2024

Uh oh!

Wauplin commented Oct 31, 2024

Uh oh!

lhoestq commented Oct 31, 2024

Uh oh!

patrickvonplaten commented Feb 27, 2025 •

edited

Loading

Uh oh!

Wauplin commented Feb 27, 2025 •

edited

Loading

Uh oh!

Uh oh!

Add pydantic support in response_format #2647

Are you sure you want to change the base?

Add pydantic support in response_format #2647

Uh oh!

Conversation

lhoestq commented Oct 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Oct 31, 2024

Uh oh!

Wauplin commented Oct 31, 2024

Uh oh!

lhoestq commented Oct 31, 2024

Uh oh!

patrickvonplaten commented Feb 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Wauplin commented Feb 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

lhoestq commented Oct 31, 2024 •

edited

Loading

patrickvonplaten commented Feb 27, 2025 •

edited

Loading

Wauplin commented Feb 27, 2025 •

edited

Loading