[xformers] training is broken with `xformers` and PyTorch 2.1 #5484

sayakpaul · 2023-10-23T04:14:52Z

Related issue: #5368

When using PyTorch 2.1 and the latest stable build of xformers, our DreamBooth LoRA script for SDXL doesn't work. #5368 provides more details.

But when using SDPA in the same environment (i.e., no xformers), the issue seems to go away.

Dev environment for this can be found here:
https://github.com/huggingface/diffusers/blob/main/docker/diffusers-pytorch-compile-cuda/Dockerfile

When using PyTorch 2.0.1 with xformers==0.0.21, there seem to be no issues with the exact same script. PyTorch was installed with pip install torch==2.0.1+cu117 --index-url https://download.pytorch.org/whl/cu117 inside a Docker image mounted from nvidia/cuda:11.7.1-cudnn8-runtime-ubuntu20.04.

Cc: @patrickvonplaten @williamberman

The text was updated successfully, but these errors were encountered:

danthe3rd · 2023-10-24T07:03:57Z

Hi,
Thanks for opening this issue.
Currently xformers does not work well with autocast (if that's what is used for mixed precision?), so we recommend that you cast all inputs to the same dtype (float16 in your case): this is also written in the error log:

ValueError: Query/Key/Value should all have the same dtype
  query.dtype: torch.float32
  key.dtype  : torch.float16
  value.dtype: torch.float16

github-actions · 2023-11-22T15:05:08Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

github-actions bot added the stale Issues that haven't received updates label Nov 22, 2023

jinnsp mentioned this issue Nov 24, 2023

Breaking wth --enable_xformers_memory_efficient_attention \ mkshing/ziplora-pytorch#4

Open

github-actions bot closed this as completed Dec 2, 2023

patrickvonplaten removed the stale Issues that haven't received updates label Dec 4, 2023

patrickvonplaten reopened this Dec 4, 2023

github-actions bot closed this as completed Dec 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[xformers] training is broken with `xformers` and PyTorch 2.1 #5484

[xformers] training is broken with `xformers` and PyTorch 2.1 #5484

sayakpaul commented Oct 23, 2023

danthe3rd commented Oct 24, 2023

github-actions bot commented Nov 22, 2023

[xformers] training is broken with xformers and PyTorch 2.1 #5484

[xformers] training is broken with xformers and PyTorch 2.1 #5484

Comments

sayakpaul commented Oct 23, 2023

danthe3rd commented Oct 24, 2023

github-actions bot commented Nov 22, 2023

[xformers] training is broken with `xformers` and PyTorch 2.1 #5484

[xformers] training is broken with `xformers` and PyTorch 2.1 #5484