models works on float32 instead of uint8 #10760

naarkhoo · 2022-09-02T14:16:59Z

Hi,

I am in the process of my ssd model based on ssd_mobilenet_v2_320x320_coco17_tpu and I noticed the model works on float32 and not uint8 - I am curious how I can make that change ?

Also I appreciate if you point me to other tricks that I can make my model run faster at inference level. for example larger kernel size ? or shallower model ? or some threshold ? I feel these recommendations/explanation can be helpful when it comes to optmization

here is the link to the colab notebook https://drive.google.com/file/d/1iqUgeabbTgfixehGomDoj5eHGfHd8Lvt/view?usp=sharing

The text was updated successfully, but these errors were encountered:

sushreebarsa · 2022-09-07T06:23:04Z

@naarkhoo
In order to expedite the trouble-shooting process, could you please provide the entire URL of the repository which you are using. Please provide more details on the issue reported here. Thank you!

google-ml-butler · 2022-09-14T07:14:59Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you.

naarkhoo · 2022-09-14T07:29:57Z

sorry for my late reply here is the colab https://drive.google.com/file/d/1iqUgeabbTgfixehGomDoj5eHGfHd8Lvt/view?usp=sharing and I made sure you have access to the files.

with the current code the model latency on android devices (average device) is 150ms - my goal is to make a model to work at 50ms - seems I have make sure the model works with uint8 data type.

saberkun · 2022-11-26T04:30:44Z

@jaeyounkim for TF-MOT problems

jaeyounkim · 2022-11-28T02:34:53Z

"ssd_mobilenet_v2_320x320_coco17_tpu" is what "TensorFlow Object Detection API" provides. It is not the model officially supported by the Model Garden team. Let me check if the TensorFlow Model Optimization Toolkit (https://github.com/tensorflow/model-optimization) team can provide some help.

Petros626 · 2023-06-24T07:22:57Z

The model is not quantized that's all. Read the name and compare it to the quantized model you'll find the difference. You must do post training quantization for the required result.

Additionally to run faster your model you need a tflite model and possibly a hardware accelerator like Google coral usb accelerator

naarkhoo added the type:support label Sep 2, 2022

sushreebarsa self-assigned this Sep 7, 2022

sushreebarsa added the stat:awaiting response Waiting on input from the contributor label Sep 7, 2022

google-ml-butler bot added the stale label Sep 14, 2022

google-ml-butler bot removed stat:awaiting response Waiting on input from the contributor stale labels Sep 14, 2022

sushreebarsa added the models:official models that come under official repository label Oct 3, 2022

sushreebarsa assigned jaeyounkim, saberkun and rachellj218 and unassigned sushreebarsa Oct 3, 2022

laxmareddyp self-assigned this Nov 21, 2022

saberkun unassigned saberkun and rachellj218 Nov 26, 2022

jaeyounkim added models:research:odapi ODAPI and removed models:official models that come under official repository labels Nov 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

models works on float32 instead of uint8 #10760

models works on float32 instead of uint8 #10760

naarkhoo commented Sep 2, 2022 •

edited

sushreebarsa commented Sep 7, 2022

google-ml-butler bot commented Sep 14, 2022

naarkhoo commented Sep 14, 2022

saberkun commented Nov 26, 2022

jaeyounkim commented Nov 28, 2022

Petros626 commented Jun 24, 2023

models works on float32 instead of uint8 #10760

models works on float32 instead of uint8 #10760

Comments

naarkhoo commented Sep 2, 2022 • edited

sushreebarsa commented Sep 7, 2022

google-ml-butler bot commented Sep 14, 2022

naarkhoo commented Sep 14, 2022

saberkun commented Nov 26, 2022

jaeyounkim commented Nov 28, 2022

Petros626 commented Jun 24, 2023

naarkhoo commented Sep 2, 2022 •

edited