[IntegratedGradients] Memory leaks when calculating IG and LIG #866

roma-glushko · 2022-02-16T11:14:48Z

🐛 Bug

I'm experiencing RAM leaks when calculating word attributions through https://github.com/cdpierse/transformers-interpret library which delegates the most of the heavy lifting to Captum. So I assume this issue is relevant to Captum.

To Reproduce

My setup is described in detail in the following issue: cdpierse/transformers-interpret#78

Expected behavior

Every time the model gets a request that should contain interpretability information, Captum calculates IG/LIG using some amount of RAM and then clean all the used RAM once that is done, so at the end of the request processing we have almost the same amount of memory used by the model service.

Environment

 - Captum / PyTorch Version (e.g., 1.0 / 0.4.0): 0.4.0/1.9.1
 - OS: registry.access.redhat.com/ubi8/python-39:latest docker image
 - How you installed Captum / PyTorch: via Poetry
 - Python version: 3.9
 - CUDA/cuDNN version: N/A, running on CPU

The text was updated successfully, but these errors were encountered:

NarineK · 2022-02-17T21:51:34Z

@roma-glushko, this is interesting because LIG and IG are stateless, there shouldn't be any memory leak. Have you tried to use another layer method such as captum.attr.LayerActivation ? Do you see similar issue ?

roma-glushko · 2022-02-21T09:57:10Z

Hey @NarineK, thank you for the replay!
Unfortunately, I have no idea about that. Maybe @cdpierse has any.

In any case, I could track it down on the python side. My gut feeling that the issue goes beyond the Python realm and may lay in the C/C++ level. Although I have no direct evidence of that, other than I could not find any gradient leaks debugging Python codebase.

jakobamb · 2022-04-14T12:55:29Z

Hi @NarineK, any news on this? I am experiencing similar issues with a transformer model. Are you planning on looking into this? I could try to create a minimal example if this helps.

NarineK · 2022-04-14T21:01:34Z

@jakobamb if you could send me a minimal example that will help me with the debugging. Thank you1

roma-glushko mentioned this issue Feb 16, 2022

[RobertaForSequenceClassification] RAM memory leaks during retrieving word attributions cdpierse/transformers-interpret#78

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[IntegratedGradients] Memory leaks when calculating IG and LIG #866

[IntegratedGradients] Memory leaks when calculating IG and LIG #866

roma-glushko commented Feb 16, 2022

NarineK commented Feb 17, 2022

roma-glushko commented Feb 21, 2022

jakobamb commented Apr 14, 2022

NarineK commented Apr 14, 2022

[IntegratedGradients] Memory leaks when calculating IG and LIG #866

[IntegratedGradients] Memory leaks when calculating IG and LIG #866

Comments

roma-glushko commented Feb 16, 2022

🐛 Bug

To Reproduce

Expected behavior

Environment

NarineK commented Feb 17, 2022

roma-glushko commented Feb 21, 2022

jakobamb commented Apr 14, 2022

NarineK commented Apr 14, 2022