Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question] Custom criteria for select_devices #60

Closed
2 tasks done
ioslide opened this issue Mar 17, 2023 · 2 comments
Closed
2 tasks done

[Question] Custom criteria for select_devices #60

ioslide opened this issue Mar 17, 2023 · 2 comments
Assignees
Labels
api Something related to the core APIs question Further information is requested

Comments

@ioslide
Copy link

ioslide commented Mar 17, 2023

Required prerequisites

  • I have searched the Issue Tracker that this hasn't already been reported. (comment there if it has.)
  • I have tried the latest version of nvitop in a new isolated virtual environment.

Motivation

Can "select_devices" be updated to specify a GPU and policy? If the policy is set to "queue", the specified GPU can be held until it meets the requirements.

Solution

No response

Alternatives

No response

Additional context

No response

@ioslide ioslide added the enhancement New feature or request label Mar 17, 2023
@XuehaiPan XuehaiPan changed the title [Feature Request] [Feature Request] Custom criteria for select_devices Mar 17, 2023
@XuehaiPan XuehaiPan changed the title [Feature Request] Custom criteria for select_devices [Question] Custom criteria for select_devices Mar 17, 2023
@XuehaiPan XuehaiPan added question Further information is requested api Something related to the core APIs and removed enhancement New feature or request labels Mar 17, 2023
@XuehaiPan
Copy link
Owner

Can "select_devices" be updated to specify a GPU and policy?

You can pass devices argument to select_devices. Maybe also add sort=False.

devices = select_devices(devices, <criteria args>, sort=False)

For nvisel CLI, use CUDA_VIRIBLE_DEVICES and --no-sort (-S):

$ nvisel -i "GPU-18ef14e9,GPU-849d5a8d" -S [<criteria options>]
5,6

If the policy is set to "queue", the specified GPU can be held until it meets the requirements.

You can write your own logic about that. It may be too complex to add in nvitop.select natively.

@XuehaiPan
Copy link
Owner

Closing for it is a specific usage and really depends on the hardware setup. Workload management is kind of out of topic. You can always write your own logic for device selection or use specialized software like slurm.

@XuehaiPan XuehaiPan closed this as not planned Won't fix, can't repro, duplicate, stale Mar 17, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
api Something related to the core APIs question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants