Interpreter API (Java) - GpuDelegateV2 support #65114

cfasana · 2024-04-05T10:11:23Z

Hi,
I am trying to run a TFLite model on the GPU of an Android device.
According to this documentation, it is possible to use both the Interpreter API and the Native c++ API to achieve this.

At the moment, I am using the following dependencies:

implementation 'org.tensorflow:tensorflow-lite:2.15.0'
implementation 'org.tensorflow:tensorflow-lite-select-tf-ops:2.15.0'
implementation 'org.tensorflow:tensorflow-lite-support:0.4.4'
implementation 'org.tensorflow:tensorflow-lite-gpu:2.15.0'
implementation 'org.tensorflow:tensorflow-lite-gpu-api:2.15.0'
implementation 'org.tensorflow:tensorflow-lite-gpu-delegate-plugin:0.4.4'

I was able to successfully run my model using the GPUDelegate provided by the Java Interpreter API. However, this delegate does not allow to specify inference priority options (TFLITE_GPU_INFERENCE_PRIORITY_MIN_LATENCY, TFLITE_GPU_INFERENCE_PRIORITY_MIN_MEMORY_USAGE, TFLITE_GPU_INFERENCE_PRIORITY_MAX_PRECISION).

These options can be specified if the Native C++ API is used given the presence of GpuDelegateV2. However, at the moment I don't see this option in the Interpreter API since there is no class named GpuDelegateV2.

Is there a way to make use of this new delegate without the need of using the Native C++ API?

The text was updated successfully, but these errors were encountered:

LakshmiKalaKadali · 2024-04-12T05:50:17Z

Hi @cfasana,

The Java Interpreter API currently doesn't have GpuDelegateV2 directly. However if you would like to achieve the faster inference speed you can use GpuDelegate class by setting the isPrecisionLossAllowed flag to true in the following way as a workaround. But for memory usage and max precision, feature requests will be raised. Thanks for letting us know.

GpuDelegateOptions options = new GpuDelegateOptions();
options.isPrecisionLossAllowed = true;  
GpuDelegate gpuDelegate = new GpuDelegate(options);
InterpreterOptions interpreterOptions = new InterpreterOptions();
interpreterOptions.addDelegate(gpuDelegate);
Interpreter interpreter = new Interpreter(modelBuffer, interpreterOptions);

or
Also try with tflite_flutter library which will provide access to GpuDelegateV2 through DART API.

Hi @pkgoogle,
As @cfasana mentioned, GpuDelegateV2 need to be included in Java Interpreter API with the support of (TFLITE_GPU_INFERENCE_PRIORITY_MIN_LATENCY, TFLITE_GPU_INFERENCE_PRIORITY_MIN_MEMORY_USAGE, TFLITE_GPU_INFERENCE_PRIORITY_MAX_PRECISION). Raised a feature request.

Thank You

cfasana · 2024-04-12T06:38:22Z

Hi @LakshmiKalaKadali,
thanks for the feedback.
I will proceed as you suggested while awaiting the Java Interpreter API update.

pkgoogle · 2024-04-12T18:04:09Z

Hi @sirakiin, can you please take a look a this feature request? Thanks.

google-ml-butler bot assigned SuryanarayanaY Apr 5, 2024

SuryanarayanaY assigned LakshmiKalaKadali and unassigned SuryanarayanaY Apr 6, 2024

LakshmiKalaKadali added comp:lite TF Lite related issues TFLiteGpuDelegate TFLite Gpu delegate issue TF 2.15 For issues related to 2.15.x type:feature Feature requests labels Apr 8, 2024

LakshmiKalaKadali assigned pkgoogle and unassigned LakshmiKalaKadali Apr 12, 2024

pkgoogle assigned sirakiin Apr 12, 2024

pkgoogle added Android stat:awaiting tensorflower Status - Awaiting response from tensorflower labels Apr 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interpreter API (Java) - GpuDelegateV2 support #65114

Interpreter API (Java) - GpuDelegateV2 support #65114

cfasana commented Apr 5, 2024

LakshmiKalaKadali commented Apr 12, 2024

cfasana commented Apr 12, 2024

pkgoogle commented Apr 12, 2024

Interpreter API (Java) - GpuDelegateV2 support #65114

Interpreter API (Java) - GpuDelegateV2 support #65114

Comments

cfasana commented Apr 5, 2024

LakshmiKalaKadali commented Apr 12, 2024

cfasana commented Apr 12, 2024

pkgoogle commented Apr 12, 2024