ImageEmbedderOption quantize behavior #5290

1cipher · 2024-04-02T14:43:21Z

Have I written custom code (as opposed to using a stock example script provided in MediaPipe)

Yes

OS Platform and Distribution

Raspberry pi 5

MediaPipe Tasks SDK version

No response

Task name (e.g. Image classification, Gesture recognition etc.)

ImageEmbedder

Programming Language and version (e.g. C++, Python, Java)

Python

Describe the actual behavior

The output from the custom tflite model with quantize set to False is slower than when it is set to True. In addition the output when the flag quantize = False the model output are floats, even if the real model output are specified to be uint8

Describe the expected behaviour

We expect that when quantize is false then the computation should be faster since it should not compute any additional operation and also we expect a uint8 output, since our tflite model when called with Interpreter returns an uint8 embedding

Standalone code/steps you may have used to try to get what you need

# Initialize the object detection model
  base_options = python.BaseOptions(model_asset_path=model)
  options = vision.ImageEmbedderOptions(base_options=base_options,
                                         running_mode=vision.RunningMode.IMAGE,
                                         l2_normalize = False, quantize = False,
                                         ) 
  detector = vision.ImageEmbedder.create_from_options(options)


  # Continuously capture images from the camera and run inference
  image = cv2.imread('carpet.png')
  image = cv2.resize(image, (256, 256), interpolation=cv2.INTER_LINEAR)

  image = cv2.flip(image, 1)

  rgb_image = cv2.cvtColor(image, cv2.COLOR_BGR2RGB)
  print(rgb_image)
  mp_image = mp.Image(image_format=mp.ImageFormat.SRGB, data=rgb_image)

  for i in range(100):

    # Run object detection using the model.

    start = time.time()
    em = detector.embed(mp_image)
    print(time.time()-start)
    embed = em.embeddings[0].embedding
    print(np.unique(embed)) ## here the output is float32, while our model gives uint8 results
    print(len(np.unique(embed)))

  detector.close()

Other info / Complete Logs

No response

1cipher · 2024-04-02T16:32:51Z

practically, my concern is to understand the impliances of using whether quantize set to True or false in ImageEmbedderOptions if i provide a custom .tflite model that already present a uint8 tensor as output. In this case i would like to perform dequantization manually since it appears that mediapipe cannot read .tflite quantization parameters

schmidt-sebastian · 2024-04-18T15:29:28Z

In our current pipeline, your model output has to match the output format of the models that our tasks are designed to handle. It seems pretty likely that we are just passing through quantized data from your model as floats. You might be able to read the data back as uint, but that is not officially supported.

github-actions · 2024-04-27T01:45:18Z

This issue has been marked stale because it has no recent activity since 7 days. It will be closed if no further activity occurs. Thank you.

github-actions · 2024-05-04T01:45:28Z

This issue was closed due to lack of activity after being marked stale for past 7 days.

google-ml-butler · 2024-05-04T01:45:30Z

Are you satisfied with the resolution of your issue?
Yes
No

google-ml-butler bot assigned ayushgdev Apr 2, 2024

1cipher closed this as completed Apr 2, 2024

1cipher reopened this Apr 2, 2024

kuaashish assigned kuaashish and unassigned ayushgdev Apr 3, 2024

kuaashish added platform:raspberry pi Raspberry pi ARM task:image embedding Issues related to Image Embedding: Embed images into feature vectors platform:python MediaPipe Python issues type:support General questions labels Apr 3, 2024

kuaashish added the stat:awaiting response Waiting for user response label Apr 19, 2024

github-actions bot added the stale label Apr 27, 2024

github-actions bot closed this as completed May 4, 2024

kuaashish removed stat:awaiting response Waiting for user response stale labels May 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ImageEmbedderOption quantize behavior #5290

ImageEmbedderOption quantize behavior #5290

1cipher commented Apr 2, 2024

1cipher commented Apr 2, 2024

schmidt-sebastian commented Apr 18, 2024

github-actions bot commented Apr 27, 2024

github-actions bot commented May 4, 2024

google-ml-butler bot commented May 4, 2024

ImageEmbedderOption quantize behavior #5290

ImageEmbedderOption quantize behavior #5290

Comments

1cipher commented Apr 2, 2024

Have I written custom code (as opposed to using a stock example script provided in MediaPipe)

OS Platform and Distribution

MediaPipe Tasks SDK version

Task name (e.g. Image classification, Gesture recognition etc.)

Programming Language and version (e.g. C++, Python, Java)

Describe the actual behavior

Describe the expected behaviour

Standalone code/steps you may have used to try to get what you need

Other info / Complete Logs

1cipher commented Apr 2, 2024

schmidt-sebastian commented Apr 18, 2024

github-actions bot commented Apr 27, 2024

github-actions bot commented May 4, 2024

google-ml-butler bot commented May 4, 2024