Android Tflite model fails to load on GPU Delegate: CL_OUT_OF_HOST_MEMORY #68470

filip-halt · 2024-05-22T21:10:26Z

Issue type

Bug

Have you reproduced the bug with TensorFlow Nightly?

Yes

Source

source

TensorFlow version

org.tensorflow:tensorflow-lite:2.16.1

Custom code

Yes

OS platform and distribution

Android

Mobile device

Samsung S23

Python version

No response

Bazel version

No response

GCC/compiler version

No response

CUDA/cuDNN version

No response

GPU model and memory

No response

Current behavior?

Currently trying to get a larger model to load on an S23 but I am running into OOM errors. When initializing an Interpreter using a GPUDelegate with factory options grabbed from CompatibilityList.getBestOptionsForThisDevice(), the Interpreter crashes with Failed to apply delegate: Failed to build program executable - Out of host memoryError: Program not built!. This seems to pop up from an OpenCL error that is parsed with:

tensorflow/tensorflow/lite/delegates/gpu/cl/util.cc

Line 42 in dd5c426

case CL_OUT_OF_HOST_MEMORY:

My best guess is that this is due to hitting the Dalvik-heap memory limit of 512mb found on my device with Runtime.maxMemory(). I profiled the memory usage and it seems to crash around the 450mb mark. Does Tflite on android not use native memory to get around this? I seem to recall people getting 1gb+ models running on their devices. I guess this could possibly be a build step that is going over the limits, but once built it would be offloaded to native?

Note: I am using pyjnius to do this which might be causing problems, but I feel like that isn't the cause.

Standalone code to reproduce the issue

Not sure how useful.

Relevant log output

05-22 16:31:08.869 23806 23859 I python  :  jnius.jnius.JavaException: JVM exception occurred: Internal error: Failed to apply delegate: Failed to build program executable - Out of host memoryError: Program not built!
05-22 16:31:08.869 23806 23859 I python  :  Falling back to OpenGL
05-22 16:31:08.869 23806 23859 I python  :  TfLiteGpuDelegate Init: No shader implementation for transpose
05-22 16:31:08.869 23806 23859 I python  :  TfLiteGpuDelegate Prepare: delegate is not initialized
05-22 16:31:08.869 23806 23859 I python  :  Node number 2612 (TfLiteGpuDelegateV2) failed to prepare.

The text was updated successfully, but these errors were encountered:

sawantkumar · 2024-05-23T07:32:54Z

Hi @filip-halt ,

Can you please provide the tflite model file so that i can replicate the issue?

filip-halt · 2024-05-23T19:38:23Z

Hi @filip-halt ,

Can you please provide the tflite model file so that i can replicate the issue?

You can find a copy here: https://github.com/filip-halt/tflite_bug It was too large to upload directly into this chat.

filip-halt · 2024-05-24T15:12:30Z

Turns out that this is most likely due to a conv2dtranspose layer in the model. I was under the impression that conv2dtranpose was supported but I could be wrong.

Another interesting thing that happens is that when you convert the model with:

converter.optimizations = [tf.lite.Optimize.DEFAULT]
converter.target_spec.supported_types = [tf.float16]

the resulting binary is 2x as large as float32 and 20% slower on mobile. When I inspected the graph with Netron, it looks like nothing was converted to float16, not even the conv2d's

sawantkumar · 2024-05-27T13:08:20Z

Hi @filip-halt ,

I ran your model using GPU delegate on dimensity 9000 and it ran fine without giving any issues. Can you please try it out with a different device and let me know if it worked there. However the list of supported operators for tflite is here and TRANSPOSE_CONV is listed there.

github-actions · 2024-06-04T01:49:43Z

This issue is stale because it has been open for 7 days with no activity. It will be closed if no further activity occurs. Thank you.

filip-halt · 2024-06-06T19:21:19Z

I believe that this is a grouped TRANSPOSE_CONV conversion problem. Tensorflow seems to barely support this and is what is breaking when converting onnx to tf. The default conversion creates a large amount of layers that ultimately cause an OOM on the phone when loading.

google-ml-butler bot added the type:bug Bug label May 22, 2024

google-ml-butler bot assigned tilakrayal May 22, 2024

tilakrayal added TF 2.16 comp:lite TF Lite related issues TFLiteGpuDelegate TFLite Gpu delegate issue labels May 23, 2024

tilakrayal assigned sawantkumar and unassigned tilakrayal May 23, 2024

sawantkumar added the stat:awaiting response Status - Awaiting response from author label May 23, 2024

google-ml-butler bot removed the stat:awaiting response Status - Awaiting response from author label May 23, 2024

sawantkumar added the stat:awaiting response Status - Awaiting response from author label May 27, 2024

github-actions bot added the stale This label marks the issue/pr stale - to be closed automatically if no activity label Jun 4, 2024

google-ml-butler bot removed stale This label marks the issue/pr stale - to be closed automatically if no activity stat:awaiting response Status - Awaiting response from author labels Jun 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Android Tflite model fails to load on GPU Delegate: CL_OUT_OF_HOST_MEMORY #68470

Android Tflite model fails to load on GPU Delegate: CL_OUT_OF_HOST_MEMORY #68470

filip-halt commented May 22, 2024

sawantkumar commented May 23, 2024

filip-halt commented May 23, 2024

filip-halt commented May 24, 2024 •

edited

sawantkumar commented May 27, 2024

github-actions bot commented Jun 4, 2024

filip-halt commented Jun 6, 2024

Android Tflite model fails to load on GPU Delegate: CL_OUT_OF_HOST_MEMORY #68470

Android Tflite model fails to load on GPU Delegate: CL_OUT_OF_HOST_MEMORY #68470

Comments

filip-halt commented May 22, 2024

Issue type

Have you reproduced the bug with TensorFlow Nightly?

Source

TensorFlow version

Custom code

OS platform and distribution

Mobile device

Python version

Bazel version

GCC/compiler version

CUDA/cuDNN version

GPU model and memory

Current behavior?

Standalone code to reproduce the issue

Relevant log output

sawantkumar commented May 23, 2024

filip-halt commented May 23, 2024

filip-halt commented May 24, 2024 • edited

sawantkumar commented May 27, 2024

github-actions bot commented Jun 4, 2024

filip-halt commented Jun 6, 2024

filip-halt commented May 24, 2024 •

edited