Skip to content

Use ATen CUDA event/stream wrappers in c10d #11912

@pietern

Description

@pietern

In c10d we still use legacy CUDA event and stream wrappers. These are not aware of the CUDA stream pool that was added by @mruberry in #9938.

One issue we need to solve beforehand is that currently Gloo algorithms take a stream at construction time and we cache them in the c10d process group. This needs to be redone to support passing a stream when running the algorithm instead of when it is constructed.

@mruberry I owed you this issue. We can use this issue to discuss how to proceed.

Metadata

Metadata

Assignees

Labels

oncall: distributedAdd this issue/PR to distributed oncall triage queue

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions