Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create a graph that shows a ratio of cpu utilization / logical cpu per task name? #45910

Open
alanwguo opened this issue Jun 12, 2024 · 0 comments
Labels
core Issues that should be addressed in Ray Core enhancement Request for new feature and/or capability observability Issues related to the Ray Dashboard, Logging, Metrics, Tracing, and/or Profiling p0.5 uueeehhh

Comments

@alanwguo
Copy link
Contributor

alanwguo commented Jun 12, 2024

One issue users run into is that they are under-utilizing their cluster because they request more logical cpus than a task requires.

One idea is to have a metric that shows a ratio of cpu utilization / logical cpu per task name.

One way this manifests is with ray data, each dataset task requests m cpus and n concurrency. m x n cpus are used correctly but overall cluster utilization is at 40% cpu. It can be counter-intuitive to think to improve utilization, concurrency should not be increased, instead num cpus should be decreased.

@alanwguo alanwguo added observability Issues related to the Ray Dashboard, Logging, Metrics, Tracing, and/or Profiling core Issues that should be addressed in Ray Core labels Jun 12, 2024
@anyscalesam anyscalesam added enhancement Request for new feature and/or capability triage Needs triage (eg: priority, bug/not-bug, and owning component) labels Jul 1, 2024
@jjyao jjyao added p0.5 uueeehhh and removed triage Needs triage (eg: priority, bug/not-bug, and owning component) labels Jul 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
core Issues that should be addressed in Ray Core enhancement Request for new feature and/or capability observability Issues related to the Ray Dashboard, Logging, Metrics, Tracing, and/or Profiling p0.5 uueeehhh
Projects
None yet
Development

No branches or pull requests

3 participants