Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add OVMS OOTB with GPU support #1262

Merged
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
9 changes: 5 additions & 4 deletions manifests/modelserving/kustomization.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -4,8 +4,9 @@ commonLabels:
app: odh-dashboard
app.kubernetes.io/part-of: odh-dashboard
resources:
- servingruntimes-template.yaml
- ovms-ootb.yaml
- ovms-gpu-ootb.yaml
images:
- name: ovms-1
newName: quay.io/opendatahub/openvino_model_server
digest: sha256:20dbfbaf53d1afbd47c612d953984238cb0e207972ed544a5ea662c2404f276d
- name: ovms-1
newName: quay.io/opendatahub/openvino_model_server
digest: sha256:20dbfbaf53d1afbd47c612d953984238cb0e207972ed544a5ea662c2404f276d
60 changes: 60 additions & 0 deletions manifests/modelserving/ovms-gpu-ootb.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,60 @@
kind: Template
apiVersion: template.openshift.io/v1
metadata:
name: ovms-gpu
labels:
opendatahub.io/dashboard: 'true'
opendatahub.io/ootb: 'true'
opendatahub.io/configurable: 'true'
annotations:
tags: 'ovms,servingruntime'
description: 'OpenVino with GPU Support Model Serving Definition'
objects:
- apiVersion: serving.kserve.io/v1alpha1
kind: ServingRuntime
metadata:
name: ovms-gpu
annotations:
openshift.io/display-name: 'OpenVINO Model Server (Supports GPUs)'
labels:
opendatahub.io/dashboard: 'true'
spec:
builtInAdapter:
env:
- name: OVMS_FORCE_TARGET_DEVICE
value: NVIDIA
memBufferBytes: 134217728
modelLoadingTimeoutMillis: 90000
runtimeManagementPort: 8888
serverType: ovms
containers:
- args:
- '--port=8001'
- '--rest_port=8888'
- '--config_path=/models/model_config_list.json'
- '--file_system_poll_wait_seconds=0'
- '--grpc_bind_address=127.0.0.1'
- '--rest_bind_address=127.0.0.1'
image: ovms-1
name: ovms
resources:
limits:
cpu: '0'
memory: 0Gi
requests:
cpu: '0'
memory: 0Gi
grpcDataEndpoint: 'port:8001'
grpcEndpoint: 'port:8085'
multiModel: true
protocolVersions:
- grpc-v1
replicas: 1
supportedModelFormats:
- autoSelect: true
name: openvino_ir
version: opset1
- autoSelect: true
name: onnx
version: '1'
parameters: []
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,7 @@ objects:
name: ovms
annotations:
openshift.io/display-name: 'OpenVINO Model Server'
opendatahub.io/disable-gpu: 'true'
labels:
opendatahub.io/dashboard: 'true'
spec:
Expand Down
Loading