2.12.0: memory leak in TFLite's tflite::Interpreter::Invoke() #66736

gestalone · 2024-04-30T21:44:09Z

Issue type

Bug

Have you reproduced the bug with TensorFlow Nightly?

No

Source

binary

TensorFlow version

2.12.0

Custom code

Yes

OS platform and distribution

Cross-build from 'Windows:x86_64' to 'Android:armv8'

Mobile device

Android with Snapdragon 820

Python version

No response

Bazel version

No response

GCC/compiler version

CXX compiler identification is Clang 14.0.7

CUDA/cuDNN version

no

GPU model and memory

Snapdragon 820 with Adreno 530

Current behavior?

Running the invoke for a tflite model using the gpu delegate, with opencl backend.
It goes fast and well, the problem is that exist a memory leak, that is increasing, not sure how to fix it. Not sure if it's an error on the opencl implementation, on the drivers of the adreno gpu or in the delegate implementation.

Standalone code to reproduce the issue

#include <tensorflow/lite/model.h>
#include <tensorflow/lite/interpreter.h>
#include <tensorflow/lite/delegates/gpu/delegate.h>
#include <tensorflow/lite/c/common.h>


std::unique_ptr<tflite::FlatBufferModel> m_model;
std::unique_ptr<tflite::Interpreter> m_interpreter;

m_model = tflite::FlatBufferModel::BuildFromBuffer(m_modelName, m_bufferSize);

tflite::ops::builtin::BuiltinOpResolver resolver;
tflite::InterpreterBuilder(*m_model.get(), resolver)(&m_interpreter);

auto delegategpu = tflite::Interpreter::TfLiteDelegatePtr(TfLiteGpuDelegateV2Create(&gpu_options), &TfLiteGpuDelegateV2Delete);

m_interpreter->ModifyGraphWithDelegate(std::move(delegategpu))


m_interpreter->AllocateTensors()
for(int i = 0; i< 1000 ;i ++)
{
m_interpreter->Invoke() != TfLiteStatus::kTfLiteOk)
}

Relevant log output

No response

gestalone · 2024-05-02T07:51:14Z

I am able to reproduce this in the benchmark android_aarch64_benchmark_model of the tf nightly build

gestalone · 2024-05-02T07:51:39Z

@sushreebarsa can you take a look?

sushreebarsa · 2024-05-08T10:55:16Z

@gestalone Could you please try to upgrade to the latest TF version as memory leak issues are often addressed in subsequent versions. Kindly let us know if it is appearing in the latest and try to explore if your stable delegate library supports alternative back-ends besides OpenCL?
Thank you!

gestalone · 2024-05-13T06:09:39Z

Hi @sushreebarsa, I was able to reproduce it with the benchmark 2.16 version. I tried the opengl backend of the gpu delegate, but unfortunately is not working.
it's quite easy to test, just use the android benchmark, with any tensorflow approved model using the gpu delegate.
The tutorial can be followed from here.
https://www.tensorflow.org/lite/performance/measurement
I think should be fixed, as memory leaks give quite a lot of issues.

gestalone · 2024-05-15T13:45:45Z

@sawantkumar, any help here?

sawantkumar · 2024-05-17T04:17:51Z

Hi @gestalone ,

Sure thing, let me replicate your issue . However gpu delegates today primarily use "openCL" as their backend instead of "openGL" . I will get back to you .

gestalone · 2024-05-21T06:55:40Z

Hi @sawantkumar!
The issue I had was using opencl backend, i was not able to use opengl

sawantkumar · 2024-05-23T08:46:32Z

Hi @gestalone ,

I used "android_aarch64_benchmark_model" on pixel 6a to test a tflite model using the below command

adb shell am start -S \
  -n org.tensorflow.lite.benchmark/.BenchmarkModelActivity \
  --es args '"--graph=/data/local/tmp/efficientnet.tflite \
              --use_gpu=true \
              --num_runs=1000\
              --report_peak_memory_footprint=true\
            —max_secs=30\
                —gpu_backend= “cl”\
        --num_threads=4"'

I used android stuido pofiler to check the memory used by the "tflite benchmark activity " process and it didn't show any memory leaks . The memory usage spiked up to 130 MB when using the benchmark tool but it came back to normal once the benchmarking was complete. Can you please try out your code on a different phone and let me know if you are able to replicate this issue on a different phone. Also if possible , can you provide your tflite model for easier debugging for me.

gestalone · 2024-05-23T17:08:38Z

Hi! I am using a different device and I am not able to reproduce, then I guess is device related. MAybe the opencl drivers? I will check.
both run with the same command:

gestalone · 2024-05-23T17:37:39Z

I was looking a bit and I found this:
https://developer.qualcomm.com/forum/qdn-forums/software/adreno-gpu-sdk/35473
for the model that i Found, I will try to change the buildoptions of opencl for adreno 530.
I will let you know

gestalone · 2024-05-24T07:34:42Z

@sawantkumar Hi!
I found the issue. Seems the snapdragon profiler program that i use to check the memory make a bad interaction with the opencl thing. I check it with a different method to get the memory and is not reproducible. Then I guess the issue was to run the benchmark and my code with the snapdragon profiler.
really strange interaction, should be reporte to qualcomm.

Best you can close and thanks for all the help

google-ml-butler · 2024-05-24T07:34:46Z

Are you satisfied with the resolution of your issue?
Yes
No

google-ml-butler bot added the type:bug Bug label Apr 30, 2024

google-ml-butler bot assigned sushreebarsa Apr 30, 2024

sushreebarsa added comp:lite TF Lite related issues TF 2.12 For issues related to Tensorflow 2.12 labels May 6, 2024

sushreebarsa added the stat:awaiting response Status - Awaiting response from author label May 8, 2024

google-ml-butler bot removed the stat:awaiting response Status - Awaiting response from author label May 13, 2024

sushreebarsa assigned sawantkumar and unassigned sushreebarsa May 15, 2024

sawantkumar added TFLiteGpuDelegate TFLite Gpu delegate issue stat:awaiting response Status - Awaiting response from author labels May 23, 2024

google-ml-butler bot removed the stat:awaiting response Status - Awaiting response from author label May 23, 2024

gestalone closed this as completed May 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2.12.0: memory leak in TFLite's tflite::Interpreter::Invoke() #66736

2.12.0: memory leak in TFLite's tflite::Interpreter::Invoke() #66736

gestalone commented Apr 30, 2024

gestalone commented May 2, 2024

gestalone commented May 2, 2024

sushreebarsa commented May 8, 2024

gestalone commented May 13, 2024

gestalone commented May 15, 2024

sawantkumar commented May 17, 2024

gestalone commented May 21, 2024 •

edited

sawantkumar commented May 23, 2024 •

edited

gestalone commented May 23, 2024

gestalone commented May 23, 2024

gestalone commented May 24, 2024

google-ml-butler bot commented May 24, 2024

2.12.0: memory leak in TFLite's tflite::Interpreter::Invoke() #66736

2.12.0: memory leak in TFLite's tflite::Interpreter::Invoke() #66736

Comments

gestalone commented Apr 30, 2024

Issue type

Have you reproduced the bug with TensorFlow Nightly?

Source

TensorFlow version

Custom code

OS platform and distribution

Mobile device

Python version

Bazel version

GCC/compiler version

CUDA/cuDNN version

GPU model and memory

Current behavior?

Standalone code to reproduce the issue

Relevant log output

gestalone commented May 2, 2024

gestalone commented May 2, 2024

sushreebarsa commented May 8, 2024

gestalone commented May 13, 2024

gestalone commented May 15, 2024

sawantkumar commented May 17, 2024

gestalone commented May 21, 2024 • edited

sawantkumar commented May 23, 2024 • edited

gestalone commented May 23, 2024

gestalone commented May 23, 2024

gestalone commented May 24, 2024

google-ml-butler bot commented May 24, 2024

gestalone commented May 21, 2024 •

edited

sawantkumar commented May 23, 2024 •

edited