torch.AcceleratorError: CUDA error: invalid device ordinal #176

Closed

Labels

easyenhancementhigh prioritytriage review

opened

on Oct 17, 2025

when we try reproducing a multigpu run and the original tensor locates at GPU6, our script raises this error. we need a better device mapping

Metadata

Assignees

No one assigned

Labels

easyenhancementhigh prioritytriage review

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests