Build with XNNPACK #36

asus4 · 2020-07-20T12:50:57Z

https://github.com/tensorflow/tensorflow/tree/master/tensorflow/lite/delegates/xnnpack

XNNPACK is a highly optimized library of floating-point neural network inference operators for ARM, x86, and WebAssembly architectures in Android, iOS, Windows, Linux, macOS, and Emscripten environments. This document describes how to use the XNNPACK library as an inference engine for TensorFlow Lite.

TheBricktop · 2020-10-27T12:55:03Z

Could this boost the performance?
I've tried mediapipe hands on gpu on android (s8) and they get about 20-25 fps but current hands implementation are very slow on pc. (i7 16 gb ram gti 960 ti)

asus4 · 2020-10-28T17:58:57Z

Hi @TheBricktop, This is because the GPU delegate (a GPU acceleration feature of the TensorFlow Lite) is not supported on Window at the moment. I would like to support it eventually.

TheBricktop · 2020-10-28T19:14:24Z

What ones should do to implement it? Recompile tf lite library?

asus4 · 2020-10-28T21:17:33Z

It looks like to be not officially supported.
tensorflow/tensorflow#40325

TheBricktop · 2020-10-28T22:17:25Z

So on android the performance would be higher?

asus4 · 2020-10-28T23:24:30Z

Yes, if you are interested in using MediaPipe without the GPU delegate, please refer XNNPACK(this issue) or the integer quantized model

TheBricktop · 2020-10-31T17:38:29Z

Would exporting tflite models to onnx and running in barracuda improve the performance?

asus4 · 2020-11-02T07:27:49Z

Yes, if all ops are supported in Barracuda, could improve it. Please refer to the supported ops in Barracuda

asus4 · 2021-02-03T15:17:39Z

Now, XNNPACK options are enabled in v2.4 libraries

tonysung · 2022-04-29T05:35:26Z

@asus4 I suspect XNNPACK is not correctly enabled based on the following observations:

Currently the CPU mode performance of tf-lite-unity-sample is closer to what I get from the official benchmark tool (https://www.tensorflow.org/lite/performance/measurement) with use_xnnpack=false.
From Tensorflow source code, I think TfLiteXNNPackDelegateCreate must be called in order for XNNPACK to be actually enabled. I don't see it being called anywhere in tf-lite-unity-sample.
When running the official benchmark tool with use_xnnpack=true, there will be a line in log that says Created TensorFlow Lite XNNPACK delegate for CPU. It's missing when tf-lite-unity-sample is run.

Would you like to take a look?

asus4 · 2022-05-02T08:13:18Z

Thanks, @tonysung

I mistakenly thought it would be automatically enabled on the CPU mode. I will add XNNPack delegate.

https://github.com/tensorflow/tensorflow/blob/3f878cff5b698b82eea85db2b60d65a2e320850e/tensorflow/lite/delegates/xnnpack/xnnpack_delegate.h#L48

asus4 added the enhancement New feature or request label Jul 20, 2020

asus4 closed this as completed Feb 3, 2021

asus4 reopened this May 2, 2022

asus4 mentioned this issue May 6, 2022

Add XNNPack Delegate #214

Merged

asus4 closed this as completed in #214 May 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Build with XNNPACK #36

Build with XNNPACK #36

asus4 commented Jul 20, 2020

TheBricktop commented Oct 27, 2020

asus4 commented Oct 28, 2020

TheBricktop commented Oct 28, 2020

asus4 commented Oct 28, 2020

TheBricktop commented Oct 28, 2020

asus4 commented Oct 28, 2020

TheBricktop commented Oct 31, 2020

asus4 commented Nov 2, 2020

asus4 commented Feb 3, 2021

tonysung commented Apr 29, 2022

asus4 commented May 2, 2022

Build with XNNPACK #36

Build with XNNPACK #36

Comments

asus4 commented Jul 20, 2020

TheBricktop commented Oct 27, 2020

asus4 commented Oct 28, 2020

TheBricktop commented Oct 28, 2020

asus4 commented Oct 28, 2020

TheBricktop commented Oct 28, 2020

asus4 commented Oct 28, 2020

TheBricktop commented Oct 31, 2020

asus4 commented Nov 2, 2020

asus4 commented Feb 3, 2021

tonysung commented Apr 29, 2022

asus4 commented May 2, 2022