-
Notifications
You must be signed in to change notification settings - Fork 25.7k
Closed
Labels
actionablegood first issuemodule: optimizerRelated to torch.optimRelated to torch.optimmodule: pt2 optimizerRelating to torch.compile'd optimRelating to torch.compile'd optimoncall: pt2triagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate moduleThis issue has been looked at a team member, and triaged and prioritized into an appropriate module
Description
🚀 The feature, motivation and pitch
Currently RAdam doesn't have a capturable implementation for cudagraphs. This is inconsistent with the other optimizer implementations. I implemented a foreach capturable implementation in #117912. It would be useful to extend this to single tensor as a starting point.
Alternatives
No response
Additional context
No response
cc @vincentqb @jbschlosser @albanD @janeyx99 @crcrpar @ezyang @chauhang @penguinwu
Metadata
Metadata
Assignees
Labels
actionablegood first issuemodule: optimizerRelated to torch.optimRelated to torch.optimmodule: pt2 optimizerRelating to torch.compile'd optimRelating to torch.compile'd optimoncall: pt2triagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate moduleThis issue has been looked at a team member, and triaged and prioritized into an appropriate module