Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG]: ZenML GPU scheduling #1204

Closed
swicaksono opened this issue Dec 26, 2022 · 4 comments · Fixed by #1223
Closed

[BUG]: ZenML GPU scheduling #1204

swicaksono opened this issue Dec 26, 2022 · 4 comments · Fixed by #1223
Labels
bug Something isn't working

Comments

@swicaksono
Copy link

swicaksono commented Dec 26, 2022

Contact Details

sonywi.cs@gmail.com

System Information

ZenML version: 0.30.0
Install path: /Users/user19/Library/Caches/pypoetry/virtualenvs/ktp-object-detection-DhOqw35J-py3.8/lib/python3.8/site-packages/zenml
Python version: 3.8.11
Platform information: {'os': 'mac', 'mac_version': '12.6'}
Environment: native
Integrations: ['kubernetes', 'mlflow', 'pillow', 'scipy', 'sklearn', 'tensorboard', 'tensorflow']

What happened?

How to schedule a specific resource such as GPU when run a ZenML pipeline on Kubernetes orchestrator?

How to parse this k8s manifest manifest for a Pod that requests a GPU in the Kubernetes settings?

resources:
    limits:
     gpu-vendor.example/example-gpu: 1 # requesting 1 GPU

I tried to add those manifest but apparently it does not schedule the process using GPU.

@swicaksono swicaksono added the bug Something isn't working label Dec 26, 2022
@htahir1
Copy link
Contributor

htahir1 commented Dec 26, 2022

Thank you for the report! The ZenML team is away until 3rd January to respond to issues. We'll get to this report as soon as we're back! Sorry for the trouble!

@swicaksono
Copy link
Author

Hi @htahir1 is there any update on the issue? Thanks!

@schustmi
Copy link
Collaborator

schustmi commented Jan 9, 2023

Hi @swicaksono, custom resource limits aren't configurable at the moment. I'll take a look if I can add it today

@schustmi schustmi linked a pull request Jan 9, 2023 that will close this issue
8 tasks
@schustmi
Copy link
Collaborator

Our latest release 0.31.1 comes with updated KubernetesPodSettings which you can use to specify resource requests and limits: https://github.com/zenml-io/zenml/blob/main/src/zenml/integrations/kubernetes/pod_settings.py

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants