Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Tracker]: Model Serving v2 - KServe Support #1810

Closed
andrewballantyne opened this issue Sep 18, 2023 · 7 comments · Fixed by #2123
Closed

[Tracker]: Model Serving v2 - KServe Support #1810

andrewballantyne opened this issue Sep 18, 2023 · 7 comments · Fixed by #2123
Assignees
Labels
feature/model-serving Model Serving Feature rhods-2.5 tracker Non-completable ticket; used for tracking work - akin to a Jira Epic

Comments

@andrewballantyne
Copy link
Member

andrewballantyne commented Sep 18, 2023

Description

We'd like to support the ability to host KServe Model Serving servers... this differs from Model Mesh (our current structure).

Target Branch

f/model-serving

Requirements

  1. Model serving runtimes

    • P0: OOTB support Caikit/TGIS
    • P0: (depending on feasibility): NVidia Triton Inference Server
    • P0: Control that some Model Serving Runtimes can only be enabled in kserve
  2. Support options for KServe and/or ModelMesh (support KServe - 1 model per pod or modelmesh - multiple models per pod)

    • P0: RHODS admins should be able to configure whether they want to use KServe (single model serving + additional functionality), ModelMesh, or both
    • P0: Users should be able to select kserve or modelmesh on project creation
      • Only one can be enabled per project
    • P0: Users should be able to deploy in the global view to both kserve and modelmesh

Itemized UX Issues

Itemized Dev Issues

General configuration

Custom Serving Runtimes

Admin Settings

Projects

Global View

Enhancements


Incubation work

Incubation Enhancements (can be done post-MVP)

Related artifacts

@andrewballantyne andrewballantyne added tracker Non-completable ticket; used for tracking work - akin to a Jira Epic feature/model-serving Model Serving Feature labels Sep 18, 2023
@andrewballantyne
Copy link
Member Author

cc @lucferbux please adjust this tracker to fit what we want for v2 -- I imagine some of the issues might come from: #1312

@lucferbux
Copy link
Contributor

Yes, that's right, I would say all the issues for 1312 can go to v2, that old tracker was just a filler until we waited for v2, let me rework this.

@lucferbux
Copy link
Contributor

lucferbux commented Sep 28, 2023

[UPDATE] We are gonna keep #1312 we have some issues there that can be fixed after kserve is implemented, moving only the tasks that are related to kserve development

@gmarkley-VI
Copy link

Is this still on UX? Can you please update the status on this? @lucferbux @vconzola

@vconzola
Copy link

vconzola commented Oct 9, 2023

@gmarkley-VI This is a Tracker. It looks like it has a Tracker State of "UX investigation", whatever that means. I have first pass designs that were reviewed at last Thursday's UX meeting, but the UX will likely be iterated on for quite a while. This should not prevent dev work from getting started. I'm not sure when we declare Tracker issues to be "off" UX?

@andrewballantyne
Copy link
Member Author

I moved this into progress -- Lucas and Juntao are working on it.

@lucferbux
Copy link
Contributor

Moving the enhancements to #1313

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature/model-serving Model Serving Feature rhods-2.5 tracker Non-completable ticket; used for tracking work - akin to a Jira Epic
Projects
Status: Done
Status: No status
Archived in project
Development

Successfully merging a pull request may close this issue.

6 participants