[`0.0.18`] `memory_efficient_attention` NaNs when `seqlen>32768` #719

comfyanonymous · 2023-04-05T01:47:24Z

🐛 Bug

Command

To Reproduce

Steps to reproduce the behavior:

import xformers
import xformers.ops
import torch

q = torch.zeros(([1, 33728, 512])).cuda()
k = torch.zeros(([1, 33728, 512])).cuda()
v = torch.zeros(([1, 33728, 512])).cuda()
out = xformers.ops.memory_efficient_attention(q, k, v, attn_bias=None)
print(out)
print(torch.isnan(out).any())

Expected behavior

out should not contain Nan.

Environment

This test was done on the free google colab with their T4 GPU using the 0.0.18 package on pip.

Additional context

It works fine on 0.0.17 but fails on 0.0.18.

People have been reporting that my ComfyUI returns black images during the VAE decoding phase when the resolution is higher than a certain amount and I have narrowed it down to this issue.

The text was updated successfully, but these errors were encountered:

leppie · 2023-04-05T06:45:27Z

I hit this issue too yesterday after upgrading to 0.0.18.

danthe3rd · 2023-04-05T11:37:39Z

Hi,
Thanks for the minimal reproduction example. I can repro on A100 - I'll have a look.
Also pinning this issue as it's quite important

danthe3rd · 2023-04-05T11:51:08Z

It seems to happen due to a cast to int16 at some point in the code, so it happens when the sequence length is larger than 32768. Updating the title according.

danthe3rd · 2023-04-06T07:52:45Z

I have a tentative fix - hopefully we can land that soon and release the 0.0.19 early next week to address that

danthe3rd · 2023-04-14T13:27:19Z

It should be fixed as of 68dce69, and will be included in the next release (0.0.19). In the meantime, you can also use a development build >=0.0.19.dev516

patrickvonplaten · 2023-04-22T15:14:57Z

Thanks a mille for the fix @danthe3rd - very helpful thread here!

comfyanonymous mentioned this issue Apr 5, 2023

Latent upscale causes black image on some resolutions comfyanonymous/ComfyUI#394

Closed

leppie mentioned this issue Apr 5, 2023

Integrate optional speed and memory improvements by token merging (via dbolya/tomesd) AUTOMATIC1111/stable-diffusion-webui#9256

Merged

danthe3rd pinned this issue Apr 5, 2023

danthe3rd added the bug Something isn't working label Apr 5, 2023

danthe3rd changed the title ~~On 0.0.18 xformers.ops.memory_efficient_attention returns NaN on certain input shapes~~ [0.0.18] xformers.ops.memory_efficient_attention returns NaN on certain input shapes Apr 5, 2023

danthe3rd changed the title ~~[0.0.18] xformers.ops.memory_efficient_attention returns NaN on certain input shapes~~ [0.0.18] xformers.ops.memory_efficient_attention returns NaN when seqlen>32768 Apr 5, 2023

danthe3rd changed the title ~~[0.0.18] xformers.ops.memory_efficient_attention returns NaN when seqlen>32768~~ [0.0.18] memory_efficient_attention NaNs when seqlen>32768 Apr 6, 2023

Miyuutsu mentioned this issue Apr 11, 2023

[Bug]: NansException: A tensor with all NaNs was produced in Unet. Use --disable-nan-check commandline argument to disable this check. AUTOMATIC1111/stable-diffusion-webui#9294

Closed

1 task

danthe3rd added this to the v0.0.19 milestone Apr 14, 2023

danthe3rd self-assigned this Apr 14, 2023

danthe3rd closed this as completed Apr 14, 2023

TurtleSmoke mentioned this issue Apr 21, 2023

High resolution image result with NaN features facebookresearch/dinov2#33

Closed

patrickvonplaten mentioned this issue Apr 24, 2023

AutoencoderKL encoder outputs NaN for large images huggingface/diffusers#3209

Closed

danthe3rd unpinned this issue May 11, 2023

JeremyMorlier mentioned this issue Jan 23, 2024

[Error] Nan JeremyMorlier/dinov2#3

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`0.0.18`] `memory_efficient_attention` NaNs when `seqlen>32768` #719

[`0.0.18`] `memory_efficient_attention` NaNs when `seqlen>32768` #719

comfyanonymous commented Apr 5, 2023

leppie commented Apr 5, 2023

danthe3rd commented Apr 5, 2023 •

edited

danthe3rd commented Apr 5, 2023

danthe3rd commented Apr 6, 2023

danthe3rd commented Apr 14, 2023 •

edited

patrickvonplaten commented Apr 22, 2023

[0.0.18] memory_efficient_attention NaNs when seqlen>32768 #719

[0.0.18] memory_efficient_attention NaNs when seqlen>32768 #719

Comments

comfyanonymous commented Apr 5, 2023

🐛 Bug

Command

To Reproduce

Expected behavior

Environment

Additional context

leppie commented Apr 5, 2023

danthe3rd commented Apr 5, 2023 • edited

danthe3rd commented Apr 5, 2023

danthe3rd commented Apr 6, 2023

danthe3rd commented Apr 14, 2023 • edited

patrickvonplaten commented Apr 22, 2023

[`0.0.18`] `memory_efficient_attention` NaNs when `seqlen>32768` #719

[`0.0.18`] `memory_efficient_attention` NaNs when `seqlen>32768` #719

danthe3rd commented Apr 5, 2023 •

edited

danthe3rd commented Apr 14, 2023 •

edited