The inference speed using the old version is 5-8 times faster than the latest version #202

yingeo · 2024-03-08T21:49:51Z

1

alexw92 · 2024-03-15T20:03:45Z

I also noticed the inference increasing significantly making live detection with more than super primitive model very ugly. Do you remember which version started the inference to increase @yingeo ?

timukasr · 2024-03-16T12:53:12Z

I looked into it and it seems that issues in Flutter. Forked one older repo and tested it:

Environment

This was tested on the following environment:

Built on Windows 10 with Java 11.0.12
Run on Pixel 5 (Android 14)

Just detected same blank(ish) camera image. Not too scientific (e.g. phone might heat up and give lower results), so few ms difference should not be taken too seriously, but the 7x or 16x difference is significant.

Results

Average time until settles (times in ms).

Test	Inference	Pre-processing	Total predict	Total elapsed
tflite_flutter: 0.9.5 Flutter: 3.13.9	22	17	39	51
tflite_flutter: 0.9.5 Flutter: 3.16.9	359	18	378	387
tflite_flutter: 0.10.4 Flutter: 3.13.9	22	15	38	50
tflite_flutter: 0.10.4 Flutter: 3.16.9	351	16	368	376

Conclusion

No difference between tflite_flutter 0.9.5 and 0.10.4
Big difference between Flutter 3.13.9 and Flutter 3.16.9 - 16x slower inference time and 7.5x slower total time.

Flutter 3.19.3 seemed to be as slow as 3.16.9. So it seems that the problem is not yet fixed.

My repo: https://github.com/timukasr/object_detection_flutter

Something similar results can be reproduced with tflite_flutter sample app live_object_detection_ssd_mobilenet. This app seems a bit broken - does not display results on screen. But it reports inference time. With Flutter 3.13.9, it was 170-200ms, with Flutter 3.19.3, it was 500-550ms. For some reason it is slow even with older Flutter, but even slower with newer flutter.

alexw92 · 2024-03-16T13:07:16Z

@timukasr Thanks a lot for you insights! I didnt even try out different flutter versions. I noticed a inference increase by just changing the tflite_flutter version when I upgraded to 0.10.4 (from 0.10.0) I think. I will continue to investigate this when I find time.

alexw92 · 2024-03-16T23:08:39Z

I take it all back. I was not able to reproduce significant inference differences in 0.10.x versions. The massive spikes due to distinct flutter versions should be addressed at some point.

yingeo · 2024-03-19T16:23:24Z

It has nothing to do with the version of tflite_flutter, but to the version of Flutter. Currently, I use tflite_flutter: version 0.10.4, and Flutter version is 3.7.12. The inference time is consistent with the native speed of Android. If you upgrade flutter to the latest version, the inference speed will be slow. terrible.

yanghoonkim · 2024-05-03T13:39:35Z

@yingeo Did you compare 3.7.12 with the other version(such as 3.13.9). Having a hard time after downgrade to 3.7.12 (several config should be change lol)

yanghoonkim · 2024-05-03T15:47:06Z

tried tflite_flutter 0.10.4 + flutter 3.13.9 + video classification(MoviNet) + isolateInterpreter.runForMultipleInputs.
can't figure out any difference between the 3.13.9 and the latest version of flutter.

My conclusion is that tflite_flutter is currently not optimized for some models/operators... one step of inference (on a single frame image) takes more than 3000ms on Samsung galaxy s8 / more than 2000ms on Samsung galaxy z flip 3.

yanghoonkim mentioned this issue May 9, 2024

Super slow inference speed for a certain model #215

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The inference speed using the old version is 5-8 times faster than the latest version #202

The inference speed using the old version is 5-8 times faster than the latest version #202

yingeo commented Mar 8, 2024 •

edited

alexw92 commented Mar 15, 2024 •

edited

timukasr commented Mar 16, 2024

alexw92 commented Mar 16, 2024 •

edited

alexw92 commented Mar 16, 2024

yingeo commented Mar 19, 2024

yanghoonkim commented May 3, 2024

yanghoonkim commented May 3, 2024 •

edited

The inference speed using the old version is 5-8 times faster than the latest version #202

The inference speed using the old version is 5-8 times faster than the latest version #202

Comments

yingeo commented Mar 8, 2024 • edited

alexw92 commented Mar 15, 2024 • edited

timukasr commented Mar 16, 2024

Environment

Results

Conclusion

alexw92 commented Mar 16, 2024 • edited

alexw92 commented Mar 16, 2024

yingeo commented Mar 19, 2024

yanghoonkim commented May 3, 2024

yanghoonkim commented May 3, 2024 • edited

yingeo commented Mar 8, 2024 •

edited

alexw92 commented Mar 15, 2024 •

edited

alexw92 commented Mar 16, 2024 •

edited

yanghoonkim commented May 3, 2024 •

edited