Add workaround for MPS gather crash #15

brkirch · 2023-04-02T13:02:01Z

This is a workaround for gather on MPS that uses unsqueeze before gather and squeeze afterwards so that gather doesn't cause a crash. Fixes #4.

dbolya · 2023-04-02T20:38:57Z

Oh sweet, I didn't realize the fix was this simple. I don't have an MPS device to test this on, so can someone else (e.g., @KohakuBlueleaf or @tvdtran) confirm that it works for them?

Edit: Also this might fix this directml issues (#13). If so, the workaround could be applied if the device type is "mps" or "dml".

GreenLandisaLie · 2023-04-02T21:58:18Z

Oh sweet, I didn't realize the fix was this simple. I don't have an MPS device to test this on, so can someone else (e.g., @KohakuBlueleaf or @tvdtran) confirm that it works for them?

Edit: Also this might fix this directml issues (#13). If so, the workaround could be applied if the device type is "mps" or "dml".

I tried it with with directml: https://github.com/lshqqytiger/stable-diffusion-webui-directml and still got an error in:
return torch.gather(input.unsqueeze(-1), dim - 1 if dim < 0 else dim, index.unsqueeze(-1)).squeeze(-1)
triggered after this line:
dst_idx = gather(node_idx[..., None], dim=-2, index=src_idx)
The error was something akin to 'invalid parameter'. Also the metric.device.type was not 'dml' but something else I can't remember - might be specific to the UI I mentioned.

dbolya · 2023-04-02T22:01:41Z

I tried it with with directml: https://github.com/lshqqytiger/stable-diffusion-webui-directml and still got an error in: return torch.gather(input.unsqueeze(-1), dim - 1 if dim < 0 else dim, index.unsqueeze(-1)).squeeze(-1) triggered after this line: dst_idx = gather(node_idx[..., None], dim=-2, index=src_idx) The error was something akin to 'invalid parameter'. Also the metric.device.type was not 'dml' but something else I can't remember - might be specific to the UI I mentioned.

Thanks for testing this. Seems like the directml issue is separate then, unfortunately. Then we can keep this as a fix just for MPS and look into a separate fix for directml.

jrittvo · 2023-04-03T05:28:05Z

Using this patched tomesd version in a simple python diffusers pipeline threw an error saying I needed to set an environment variable: export PYTORCH_ENABLE_MPS_FALLBACK=1

Worked after that. A 768x768 straight generation went from 3.75 s/it without tomesd to 2.13 s/it using tomesd at .5 so I assume the "fallback" isn't a big detriment, if at all. I don't know how to do Hires fix with python commands yet, so I can't test this all the way.

dbolya · 2023-04-03T05:36:29Z

Using this patched tomesd version in a simple python diffusers pipeline threw an error saying I needed to set an environment variable: export PYTORCH_ENABLE_MPS_FALLBACK=1

Worked after that. A 768x768 straight generation went from 3.75 s/it without tomesd to 2.13 s/it using tomesd at .5 so I assume the "fallback" isn't a big detriment, if at all. I don't know how to do Hires fix with python commands yet, so I can't test this all the way.

Sounds good to me. Requiring PYTORCH_ENABLE_MPS_FALLBACK=1 is better than just crashing, and that speed-up seems not too far off expected (1.76x speed-up using MPS vs. ~1.87x speed-up using CUDA).
Thank you for testing this! Merging now.

Add workaround for MPS gather crash

9f054cf

dbolya merged commit 6051813 into dbolya:main Apr 3, 2023

dbolya mentioned this pull request Apr 3, 2023

Failed to run on M1Mac with automatic1111 web ui #18

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add workaround for MPS gather crash #15

Add workaround for MPS gather crash #15

brkirch commented Apr 2, 2023 •

edited

Loading

dbolya commented Apr 2, 2023 •

edited

Loading

GreenLandisaLie commented Apr 2, 2023

dbolya commented Apr 2, 2023

jrittvo commented Apr 3, 2023

dbolya commented Apr 3, 2023

Add workaround for MPS gather crash #15

Add workaround for MPS gather crash #15

Conversation

brkirch commented Apr 2, 2023 • edited Loading

dbolya commented Apr 2, 2023 • edited Loading

GreenLandisaLie commented Apr 2, 2023

dbolya commented Apr 2, 2023

jrittvo commented Apr 3, 2023

dbolya commented Apr 3, 2023

brkirch commented Apr 2, 2023 •

edited

Loading

dbolya commented Apr 2, 2023 •

edited

Loading