Possibly GPU memory leak? #24

kshieh1 · 2023-04-13T03:16:04Z

Hi,

Found a GPU out-of-memory(OOM) error when using comple in my project. I made a shorter test program out of your compel-demp.py :

import torch
from compel import Compel
from diffusers import StableDiffusionPipeline, DPMSolverMultistepScheduler
from torch import Generator

device = "cuda"
pipeline = StableDiffusionPipeline.from_pretrained("dreamlike-art/dreamlike-photoreal-2.0",
                                                   torch_dtype=torch.float16).to(device)
# dpm++
pipeline.scheduler = DPMSolverMultistepScheduler.from_config(pipeline.scheduler.config,
                                                             algorithm_type="dpmsolver++")

COMPEL = True
compel = Compel(tokenizer=pipeline.tokenizer, text_encoder=pipeline.text_encoder)

i = 0
while True:
    prompts = ["a cat playing with a ball++ in the forest", "a cat playing with a ball in the forest"]

    if COMPEL:
        prompt_embeds = torch.cat([compel.build_conditioning_tensor(prompt) for prompt in prompts])
        images = pipeline(prompt_embeds=prompt_embeds, num_inference_steps=10, width=256, height=256).images
        #del prompt_embeds # not helping
    else:
        images = pipeline(prompt=prompts, num_inference_steps=10, width=256, height=256).images
    i += 1
    print(i, images)

    images[0].save('img0.jpg')
    images[1].save('img1.jpg')

Tested on Nvidia RTX-3050Ti Mobile GPU w/ 4G VRAM, an OOM exception will occur after 10~20 iterations. No OOM if use COMPEL = False.

The text was updated successfully, but these errors were encountered:

damian0815 · 2023-04-13T16:13:37Z

hmm, compel is basically stateless, there isn't much that could leak that i have much control over. torch is sometimes poor at cleaning up its caches properly, you might want to try calling torch.cuda.empty_cache() occasionally

kshieh1 · 2023-04-14T03:27:59Z

Thanks. I think I have pushed VRAM usage on edge -- maybe torch need some extra room to maneuver...

(Updated Apr. 17) OOM occurs even if just prompt embeddings were built repeatedly w/o running inference (i.e., images = pipeline(...) has been commented out). torch.cuda.empty_cache() does not help.

damian0815 · 2023-04-25T08:58:37Z

urgh. idk. i also don't have a local gpu to readily debug this. have you tried tearing down the compel instance and making a new one for each prompt?

kshieh1 · 2023-04-26T06:31:01Z

Interesting. I run the same test on Google Colab (GPU w/ 12G VRAM) and no OOM issue occured. Then I updated my local envrionment with exact same package versions (e.g., torch, diffusers, compel, ... etc) like the Colab however OOM issue still occurs. Local test was on Nvidia GPU with 4G and 8G, btw.

init & delete compel instance inside the loop doesn't help, fyi

jbhurruth · 2023-05-24T19:54:48Z

@kshieh1 Did you ever figure out a solution to this? I'm also hitting my 6GB limit as soon as I use the compel embeddings

kshieh1 · 2023-05-25T00:11:08Z

@kshieh1 Did you ever figure out a solution to this? I'm also hitting my 6GB limit as soon as I use the compel embeddings

No luck so far

kshieh1 · 2023-05-26T01:21:42Z

I think I have come out a solution. After image generation, you should explictly de-reference the tensor object (i.e., prompt_embeds = None) and call gc.collect()

damian0815 · 2023-05-26T17:21:30Z

ahh nice. i'll add a note on the readme for the next version. thanks for sharing your solution!

damian0815 · 2023-06-03T06:42:02Z

The readme has been updated.

damian0815 · 2023-07-04T22:02:47Z

@kshieh1 we encountered a possibly related (possibly the same?) problem in InvokeAI, which was resolved by doing calls to Compel inside a with torch.no_grad(): block. did you try this?

kshieh1 · 2023-07-05T01:50:28Z

Yeah, I just did a quick test and found the amount of cuda memory allocation is stable -- I think I can get rid of those costly gc.collect() operations from my code.

Thanks for sharing.

damian0815 closed this as completed Jun 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possibly GPU memory leak? #24

Possibly GPU memory leak? #24

kshieh1 commented Apr 13, 2023

damian0815 commented Apr 13, 2023

kshieh1 commented Apr 14, 2023 •

edited

damian0815 commented Apr 25, 2023

kshieh1 commented Apr 26, 2023

jbhurruth commented May 24, 2023

kshieh1 commented May 25, 2023

kshieh1 commented May 26, 2023

damian0815 commented May 26, 2023

damian0815 commented Jun 3, 2023

damian0815 commented Jul 4, 2023

kshieh1 commented Jul 5, 2023

Possibly GPU memory leak? #24

Possibly GPU memory leak? #24

Comments

kshieh1 commented Apr 13, 2023

damian0815 commented Apr 13, 2023

kshieh1 commented Apr 14, 2023 • edited

damian0815 commented Apr 25, 2023

kshieh1 commented Apr 26, 2023

jbhurruth commented May 24, 2023

kshieh1 commented May 25, 2023

kshieh1 commented May 26, 2023

damian0815 commented May 26, 2023

damian0815 commented Jun 3, 2023

damian0815 commented Jul 4, 2023

kshieh1 commented Jul 5, 2023

kshieh1 commented Apr 14, 2023 •

edited