Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Add HuggingFace runtime out-of-the-box support #3395

Merged
merged 8 commits into from Feb 10, 2024

Conversation

terrytangyuan
Copy link
Member

@terrytangyuan terrytangyuan commented Jan 30, 2024

Part of #3393.

  • Tests
  • Update Python spec
  • Regenerate manifests

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
@terrytangyuan terrytangyuan changed the title [WIP] feat: Add HuggingFace runtime out-of-the-box support feat: Add HuggingFace runtime out-of-the-box support Feb 6, 2024
@terrytangyuan terrytangyuan marked this pull request as ready for review February 6, 2024 17:41
@@ -44,6 +44,8 @@ type PredictorSpec struct {
Triton *TritonSpec `json:"triton,omitempty"`
// Spec for ONNX runtime (https://github.com/microsoft/onnxruntime)
ONNX *ONNXRuntimeSpec `json:"onnx,omitempty"`
// Spec for HuggingFace runtime (https://github.com/huggingface)
HuggingFace *HuggingFaceRuntimeSpec `json:"huggingface,omitempty"`
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actually we are trying to deprecate these individually defined runtime types as we are moving users to the model spec instead. @sivanantha321 has a PR for this #3144

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah I am aware of that PR and thought about it as well. IMO we should include this for consistency (better UX) and then deprecate all the individually defined runtimes together later as part of #3144.

@yuzisun
Copy link
Member

yuzisun commented Feb 9, 2024

@terrytangyuan Looks great! can you help add an e2e test?

@yuzisun
Copy link
Member

yuzisun commented Feb 10, 2024

/lgtm
/approve

Copy link

oss-prow-bot bot commented Feb 10, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: terrytangyuan, yuzisun

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@yuzisun yuzisun merged commit 5c452e7 into kserve:master Feb 10, 2024
60 checks passed
@terrytangyuan terrytangyuan deleted the hf-runtime branch February 10, 2024 15:01
TimKleinloog pushed a commit to TimKleinloog/kserve that referenced this pull request Feb 13, 2024
* feat: Add HuggingFace runtime out-of-the-box support

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Update violation list

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Add to charts

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Regenerate everything

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* fix lint

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* fix lint

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Empty-Commit

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Empty-Commit

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

---------

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
Signed-off-by: Tim Kleinloog <tkleinloog@deeploy.ml>
TimKleinloog pushed a commit to TimKleinloog/kserve that referenced this pull request Feb 19, 2024
* feat: Add HuggingFace runtime out-of-the-box support

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Update violation list

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Add to charts

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Regenerate everything

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* fix lint

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* fix lint

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Empty-Commit

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Empty-Commit

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

---------

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
Signed-off-by: Tim Kleinloog <tkleinloog@deeploy.ml>
TimKleinloog pushed a commit to TimKleinloog/kserve that referenced this pull request Feb 20, 2024
* feat: Add HuggingFace runtime out-of-the-box support

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Update violation list

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Add to charts

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Regenerate everything

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* fix lint

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* fix lint

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Empty-Commit

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Empty-Commit

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

---------

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
Signed-off-by: Tim Kleinloog <tkleinloog@deeploy.ml>
TimKleinloog pushed a commit to TimKleinloog/kserve that referenced this pull request Feb 20, 2024
* feat: Add HuggingFace runtime out-of-the-box support

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Update violation list

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Add to charts

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Regenerate everything

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* fix lint

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* fix lint

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Empty-Commit

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Empty-Commit

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

---------

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
timothyjlaurent pushed a commit to timothyjlaurent/kserve that referenced this pull request Feb 21, 2024
* feat: Add HuggingFace runtime out-of-the-box support

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Update violation list

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Add to charts

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Regenerate everything

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* fix lint

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* fix lint

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Empty-Commit

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Empty-Commit

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

---------

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants