Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[core][experimental] Meta-issue: Support transferring GPU tensors in accelerated DAG #43830

Open
4 of 5 tasks
stephanie-wang opened this issue Mar 8, 2024 · 0 comments
Open
4 of 5 tasks
Assignees
Labels
compiled-graph core Issues that should be addressed in Ray Core enhancement Request for new feature and/or capability P1 Issue that should be fixed within a few weeks

Comments

@stephanie-wang
Copy link
Contributor

stephanie-wang commented Mar 8, 2024

Description

Add a channel backend for sending GPU tensors between DAG executors. This should cover:

  • single node vs distributed
  • ability to close the channel without destroying the processes
  • must not hang if there are NCCL ops within the DAG task (relevant if using NCCL to send/recv)

Checklist:

Use case

No response

@stephanie-wang stephanie-wang added enhancement Request for new feature and/or capability triage Needs triage (eg: priority, bug/not-bug, and owning component) P1 Issue that should be fixed within a few weeks core Issues that should be addressed in Ray Core compiled-graph and removed triage Needs triage (eg: priority, bug/not-bug, and owning component) labels Mar 8, 2024
@stephanie-wang stephanie-wang changed the title [core] Support transferring GPU tensors in accelerated DAG [core][experimental] Support transferring GPU tensors in accelerated DAG Mar 8, 2024
@stephanie-wang stephanie-wang changed the title [core][experimental] Support transferring GPU tensors in accelerated DAG [core][experimental] Meta-issue: Support transferring GPU tensors in accelerated DAG May 13, 2024
@anyscalesam anyscalesam added this to the ADAG Developer Preview milestone May 23, 2024
@stephanie-wang stephanie-wang removed this from the ADAG Developer Preview milestone Jun 24, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
compiled-graph core Issues that should be addressed in Ray Core enhancement Request for new feature and/or capability P1 Issue that should be fixed within a few weeks
Projects
None yet
Development

No branches or pull requests

2 participants