the inference time of tflite_quant is larger than tflite #23

youngboy52 · 2020-12-02T04:05:08Z

I use the supported model file (model_1.tflite, model_2.tflite, model_quant_1.tflite and model_quant_2.tflite) and the script "real_time_processing_tf_lite.py" to compare the inference time.
My implementation configs: Ubuntu 18.04, tf2.0.
the processing times are shown as follows:
TF-lite: 0.383403 ms; TF-lite quantized: 0.4470351 ms
It is a little abnormal that TF-lite quantized model is slower than TF-lite model during inference. I found the script is required in tf2.3.0 when running tflite model. Does it mean that tf2.0 has some limitations in your script? Looking forward to your reply

wwbnjsace · 2021-01-20T07:06:38Z

i use the onnx model to inference,but it has a Very high CPU usage. so i want to know how to hava a low cput usage?

youngboy52 closed this as completed Dec 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the inference time of tflite_quant is larger than tflite #23

the inference time of tflite_quant is larger than tflite #23

youngboy52 commented Dec 2, 2020

wwbnjsace commented Jan 20, 2021

the inference time of tflite_quant is larger than tflite #23

the inference time of tflite_quant is larger than tflite #23

Comments

youngboy52 commented Dec 2, 2020

wwbnjsace commented Jan 20, 2021