Training fixes for MPS #5810

brkirch · 2022-12-17T08:51:32Z

Although training itself works with the MPS device on PyTorch 1.12.1, the result fails to save due to torch.save() throwing an exception: RuntimeError: Can't call numpy() on Tensor that requires grad. Use tensor.detach().numpy() instead. So for MPS on PyTorch 1.12.1, use a monkey patch to call detach() before numpy() on any torch.Tensor that has requires_grad set to true.
Add attributes used by MPS to the safety checker

With these changes training seems to work correctly on MPS.

When saving training results with torch.save(), an exception is thrown: "RuntimeError: Can't call numpy() on Tensor that requires grad. Use tensor.detach().numpy() instead." So for MPS, check if Tensor.requires_grad and detach() if necessary.

brkirch requested a review from AUTOMATIC1111 as a code owner December 17, 2022 08:51

brkirch added 2 commits December 17, 2022 04:22

Add numpy fix for MPS on PyTorch 1.12.1

16b4509

When saving training results with torch.save(), an exception is thrown: "RuntimeError: Can't call numpy() on Tensor that requires grad. Use tensor.detach().numpy() instead." So for MPS, check if Tensor.requires_grad and detach() if necessary.

Add attributes used by MPS

cca1637

brkirch force-pushed the fix-training-mps branch from d61b1ef to cca1637 Compare December 17, 2022 09:24

julianko13 approved these changes Dec 21, 2022

View reviewed changes

AUTOMATIC1111 approved these changes Dec 24, 2022

View reviewed changes

AUTOMATIC1111 merged commit 3bfc6c0 into AUTOMATIC1111:master Dec 24, 2022

brkirch deleted the fix-training-mps branch December 27, 2022 12:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training fixes for MPS #5810

Training fixes for MPS #5810

brkirch commented Dec 17, 2022 •

edited

Training fixes for MPS #5810

Training fixes for MPS #5810

Conversation

brkirch commented Dec 17, 2022 • edited

brkirch commented Dec 17, 2022 •

edited