Generate inference types + start using output types #2036

Wauplin · 2024-02-20T17:50:09Z

This PR adds scripts to generate inference types + start to use the output types in the InferenceClient.

The broader goal is to use all the spec-ed types in the InferenceClient but also in transformers and diffusers pipelines.

How it works:

generating script in huggingface.js/tasks => done separately
utils/generate_inference_types.py => clean Python generated code + generate imports + generate package reference
all generated classes inherit from src/huggingface_hub/inference/_generated/types/base.py
- dataclass + inherit from dict for backward compatibility => goal is to make it dataclass-only in the future (1.0?)
- adds parsing methods to avoid pydantic
- all fields are optional E.g. don't raise if server "forget" a required field
- extra fields are set as attributes for future compatibility
output types are now used in src/huggingface_hub/inference/_client.py

Docs:

TODO:

add types to inference package reference in docs
how to handle parsing audio or images in output? currently done "manually", e.g. not from the json specs
fix tests
fix type annotations in _client.py...
make dataclasses first class citizen in huggingface_hub?
better auto-generated docstrings => following HF docs format => will be done later

Breaking changes in outputs 💔:

automatic_speech_recognition: str => AutomaticSpeechRecognitionOutput
summarization: str => SummarizationOutput
translation: str => TranslationOutput

Left out of scope for this PR:

text_generation => requires more typing for stream=True
deal with conversational => will be deprecated/removed soon
use input parameters generated from specs

HuggingFaceDocBuilderDev · 2024-02-20T17:55:50Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

codecov · 2024-02-21T16:39:29Z

Codecov Report

Attention: Patch coverage is 97.85047% with 23 lines in your changes are missing coverage. Please review.

Project coverage is 82.71%. Comparing base (e7f243c) to head (e25bcaf).

❗ Current head e25bcaf differs from pull request most recent head 56955a8. Consider uploading reports for the commit 56955a8 to get more accurate results

Files	Patch %	Lines
...gingface_hub/inference/_generated/_async_client.py	21.42%	22 Missing ⚠️
src/huggingface_hub/inference/_client.py	96.42%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2036      +/-   ##
==========================================
+ Coverage   80.70%   82.71%   +2.01%     
==========================================
  Files          71      102      +31     
  Lines        8519     9490     +971     
==========================================
+ Hits         6875     7850     +975     
+ Misses       1644     1640       -4

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

julien-c

as discussed irl, looks quite cool already!

Wauplin · 2024-02-23T16:06:19Z

This PR is now ready for review :) 🎉
I've updated the PR description accordingly. This PR will introduce breaking changes for some tasks but hopefully not too problematic. I'm planning to make a release of huggingface_hub early next week but no rush to merge this PR. It can wait the next one which would allow to implement a few more TODOs before shipping.

LysandreJik

Looks good! Clear implementation, awesome to have the types so neatly defined within the runtime.

It would be awesome to gradually move towards the same input/output definitions within transformers & diffusers pipelines.

(No need to review but FYI @sanchit-gandhi @Rocketknight1 @amyeroberts @molbap @yiyixuxu as we'll want to adopt this soon)

src/huggingface_hub/inference/_client.py

utils/generate_inference_types.py

utils/helpers.py

src/huggingface_hub/inference/_generated/types/zero_shot_image_classification.py

SBrandeis · 2024-02-27T17:07:45Z

src/huggingface_hub/inference/_generated/types/text_classification.py

+@dataclass
+class TextClassificationOutput(BaseInferenceType):
+    """Outputs of inference for the Text Classification task"""
+
+    label: str
+    """The predicted class label."""
+    score: float
+    """The corresponding probability."""


Output is actually a List of those if I'm not mistaken :(

I had to manually parse the TS code to add that array definition in huggingface.js (bug also reported in quicktype)

Yeah I already had a script to handle the shared classes (typically ClassificationOutput) but now I pushed 56955a8 so that classes are named AudioClassificationOutputElement, to emphasize that the expected output will be a list of those.

Does it look better for you like this?

SBrandeis · 2024-02-27T17:11:31Z

Very cool that the inference types are used in the Inference client right away!

Wauplin · 2024-02-29T16:03:36Z

Let's merge this PR now that everything's addressed! I have listed remaining todos in a follow-up issue: #2063.

start using generated types

b79b961

Wauplin added 7 commits February 21, 2024 12:21

fix some typing + bug

d055e39

style

da33de8

make optional fields optional by default

43c728e

docstring

e46e2fe

import dataclasses as first-class citizen

fc25978

fixing some tests

740a1fc

fix ambiguous ClassificationOutput type

8c83911

more fixes

ae1d00d

Wauplin mentioned this pull request Feb 21, 2024

✨ [Widgets] Enable streaming in the conversational widget huggingface/huggingface.js#486

Merged

Wauplin added 6 commits February 22, 2024 15:12

Merge branch 'main' into generate-inference-type

719cbc0

fix setattr + first tests

a330c02

Merge branch 'main' into generate-inference-type

ccbe38b

new tests

ed93eed

more tests

54fc83a

audio to audio

5e5cdd2

Wauplin mentioned this pull request Feb 23, 2024

Spec for audio to audio huggingface/huggingface.js#502

Open

Wauplin added 2 commits February 23, 2024 11:55

normalize keys + fix tests

965edf3

make stlye

7695d15

julien-c reviewed Feb 23, 2024

View reviewed changes

Wauplin added 5 commits February 23, 2024 15:05

docs

55f502f

reference package

ff1f883

fix docs + translation + summarization

3c8236c

typo

577d579

typoe

6264b2c

Wauplin marked this pull request as ready for review February 23, 2024 16:02

Wauplin requested review from SBrandeis and julien-c February 23, 2024 16:02

Wauplin requested review from LysandreJik and osanseviero February 23, 2024 16:03

test

1070d5a

LysandreJik approved these changes Feb 26, 2024

View reviewed changes

Wauplin added 2 commits February 27, 2024 11:32

feedback comments

4fefa78

Merge branch 'main' into generate-inference-type

e25bcaf

SBrandeis reviewed Feb 27, 2024

View reviewed changes

Wauplin added 4 commits February 28, 2024 16:50

Merge branch 'main' into generate-inference-type

427deb2

ClassificationOutput -> ClassificationOutputElement

56955a8

a bit too enthusiastic on my previous commit

ab9d6c3

Merge branch 'main' into generate-inference-type

61e1659

Wauplin mentioned this pull request Feb 29, 2024

Use inference types dataclasses everywhere #2063

Open

9 tasks

Wauplin merged commit 3575a72 into main Feb 29, 2024
16 checks passed

Wauplin deleted the generate-inference-type branch February 29, 2024 16:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate inference types + start using output types #2036

Generate inference types + start using output types #2036

Wauplin commented Feb 20, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Feb 20, 2024

codecov bot commented Feb 21, 2024 •

edited

Loading

julien-c left a comment

Wauplin commented Feb 23, 2024

LysandreJik left a comment

SBrandeis Feb 27, 2024

Wauplin Feb 28, 2024 •

edited

Loading

SBrandeis commented Feb 27, 2024

Wauplin commented Feb 29, 2024

Generate inference types + start using output types #2036

Generate inference types + start using output types #2036

Conversation

Wauplin commented Feb 20, 2024 • edited Loading

HuggingFaceDocBuilderDev commented Feb 20, 2024

codecov bot commented Feb 21, 2024 • edited Loading

Codecov Report

julien-c left a comment

Choose a reason for hiding this comment

Wauplin commented Feb 23, 2024

LysandreJik left a comment

Choose a reason for hiding this comment

SBrandeis Feb 27, 2024

Choose a reason for hiding this comment

Wauplin Feb 28, 2024 • edited Loading

Choose a reason for hiding this comment

SBrandeis commented Feb 27, 2024

Wauplin commented Feb 29, 2024

Wauplin commented Feb 20, 2024 •

edited

Loading

codecov bot commented Feb 21, 2024 •

edited

Loading

Wauplin Feb 28, 2024 •

edited

Loading