Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow users to force a target_device #15

Conversation

Xaenalt
Copy link

@Xaenalt Xaenalt commented May 10, 2023

This is a workaround to the adapter that allows a user to set OVMS_FORCE_TARGET_DEVICE in the builtInAdapter.env section. This is then written into the config for each model that the model server manages

This is needed right now because when on a node with an Nvidia GPU, OVMS does not always ensure that the model goes onto the right device.

The changes needed to the runtime are:

spec:
  builtInAdapter:
    env:
      - name: OVMS_FORCE_TARGET_DEVICE
        value: NVIDIA

Note, this actually accepts any value from https://github.com/openvinotoolkit/model_server/blob/main/docs/accelerators.md so it enables a lot of the interesting behavior OVMS is capable of

Same changes as red-hat-data-services#3

device in OVMS

This is a workaround to the adapter that allows a user to set
OVMS_FORCE_TARGET_DEVICE in the builtInAdapter.env section. This is then
written into the config for each model that the model server manages

Signed-off-by: Sean Pryor <spryor@redhat.com>
@Xaenalt Xaenalt force-pushed the ovms-force-device-opendatahub branch from f52aade to 453f769 Compare May 10, 2023 22:35
@Jooho
Copy link

Jooho commented May 11, 2023

@Xaenalt I have couple of questions.

  • is it for opendatahub only?
    • Do we need to put this into the release branch too?
  • sound like it is a temporary change so will you update this later?

@Xaenalt
Copy link
Author

Xaenalt commented May 11, 2023

@Jooho Needs to go in 1.27 release by code freeze tomorrow.

This is temporary until the AUTO plugin supports Nvidia, but may be desired longer term to enable more of the functions of the target_device field. It may get restructured later on if that's the case

@Xaenalt Xaenalt changed the title Temporary change to allow users to force a target Temporary change to allow users to force a target_device May 11, 2023
@Xaenalt Xaenalt changed the title Temporary change to allow users to force a target_device Allow users to force a target_device May 11, 2023
Copy link

@Jooho Jooho left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

@openshift-ci
Copy link

openshift-ci bot commented May 11, 2023

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: Jooho, Xaenalt

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-merge-robot openshift-merge-robot merged commit 1d95acc into opendatahub-io:main May 11, 2023
6 checks passed
@Xaenalt
Copy link
Author

Xaenalt commented May 11, 2023

/cherry-pick release-v0.11.0-alpha

@openshift-cherrypick-robot

@Xaenalt: only opendatahub-io org members may request cherry picks. You can still do the cherry-pick manually.

In response to this:

/cherry-pick release-v0.11.0-alpha

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@Jooho
Copy link

Jooho commented May 11, 2023

/cherry-pick release-v0.11.0-alpha

@openshift-cherrypick-robot

@Jooho: only opendatahub-io org members may request cherry picks. You can still do the cherry-pick manually.

In response to this:

/cherry-pick release-v0.11.0-alpha

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants