When using Mediapipe Hands, it seems that mediapipe does not want to return landmarks about which it is unsure. This makes the tool barely useable for things like gesture recognition. Why don't you allow a setting in which ALL landmark guesses are returned (possible with a confidence value) so the end user can decide what to keep, what to correct and what to discard? #3871

jdambre · 2022-11-17T19:44:09Z

Please make sure that this is a feature request.

System information (Please provide as much relevant information as possible)

MediaPipe Solution (you are using):
Programming language : C++/typescript/Python/Objective C/Android Java
Are you willing to contribute it (Yes/No):

Describe the feature and the current behavior/state:

Will this change the current api? How?

Who will benefit with this feature?

Please specify the use cases for this feature:

Any Other info:

kuaashish · 2022-11-21T10:41:27Z

Hi @jdambre,
Thanks for raising this feature request! This looks like an interesting feature request. Would you also please share a use-case where this can be useful? Thank you!

google-ml-butler · 2022-11-28T10:52:14Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you.

jdambre · 2022-11-28T12:25:59Z

Hi @kuaashish: the fact that currently, no hand keypoints are returned very often currently makes MP unuseable 'in the wild' for tasks related to detailed recognition of hand movements.
This occurs mostly for fast hand movements or for occlusions, but also sometimes for no apparent reason when frames seem perfectly clear to us. This suggests that your current approach to decide which hand keypoints are 'not good enough' to return is not quite robust. In addition, in the case of occlusions, we could still use the visible keypoints in our models (e.g., swapping back to a 2D model instead of a 3D one).

If we would have all keypoints, we can make our own cutoff decisions, train our models robust against wrong predictions or train a model to correct them. So being able to turn off the 'cutoff' to alway return all keypoints is in fact our first feature request.

But since I assume MediaPipe uses an internal uncertainty measure to decide when NOT to return hands, this can just as well be given as an output, which would allow us to set our own threshold, treat inaccurate samples differently, or develop a a more targeted approach in making our models more robust.

kuaashish · 2023-01-31T09:52:07Z

Hi @bazarevsky,
Could you please look into this feature request? Thank you!

khanhlvg · 2023-02-01T15:29:27Z

The new MediaPipe now supports Gesture Recognition out of the box. Can you try it out and see if it fit your use case?
https://developers.google.com/mediapipe/solutions/vision/gesture_recognizer

kuaashish · 2023-06-16T11:25:04Z

Hello @jdambre,

Please go through the above comment. Thank you

jdambre · 2023-06-20T12:31:48Z

WE DO NOT NEED out of the box gesture recognition, we would very much appreciate keypoint confidences in order to be able to catch problematic cases more adequately and develop our own applications on top of mediapipe. As it stands now and since we receive no adequate reaction to any of our requests (there is at least one other thread related to failure cases), we see no other option than to develop our own keypoint extractor on top of more recent state-of-the-art tools!

kuaashish · 2023-06-21T13:07:20Z

@jdambre,

MP Tasks allows the user to configure detection thresholds.

kuaashish · 2023-10-11T12:26:02Z

@jdambre,

You're using an old version of MediaPipe's hand solution, and we no longer support it. Please check our guide for instructions on upgrading to the new Hands API, which also allows you to configure detection thresholds. We suggest switching to our new Hand Task API, as explained in the documentation here.

Additionally, you can find a code example in the same resource for your reference. If you face any issues with the upgraded API, please let us know. Thank you.

m-decoster · 2023-10-11T12:47:10Z

@kuaashish

It is true that is it possible to set a threshold on the palm detection, but this is different from obtaining confidence values. If I set a threshold of for example 0.1 and get 10 hands back from 15 images, I have very little information on the confidence of MediaPipe about these hands (only that the confidence should be higher than 0.1).

As far as I can tell from the documentation in the new solution, this confidence value remains an internal value in the Python API, which only returns handedness, 3D coordinates, and 3D world coordinates.

What @jdambre is requesting, is that the internal confidence value which is compared with the user-set threshold is also returned. I am referring here to this comment which mentions that you might as well return the internal confidence value to allow more flexibility in the API.

I hope this is clear.

kuaashish · 2023-10-16T08:52:19Z

Hi @m-decoster,

Thank you for providing additional information about this issue. We have marked it as a feature request and sharing it with the team. The team will prioritise the work based on our discussion.

jdambre added the type:feature Enhancement in the New Functionality or Request for a New Solution label Nov 17, 2022

kuaashish self-assigned this Nov 18, 2022

kuaashish added legacy:hands Hand tracking/gestures/etc task::all All tasks of MediaPipe labels Nov 18, 2022

kuaashish added the stat:awaiting response Waiting for user response label Nov 21, 2022

google-ml-butler bot added the stale label Nov 28, 2022

google-ml-butler bot removed stale stat:awaiting response Waiting for user response labels Nov 28, 2022

kuaashish assigned bazarevsky and unassigned kuaashish Jan 31, 2023

kuaashish added the stat:awaiting googler Waiting for Google Engineer's Response label Jan 31, 2023

kuaashish assigned kuaashish and unassigned bazarevsky Jun 16, 2023

kuaashish removed the stat:awaiting googler Waiting for Google Engineer's Response label Jun 16, 2023

kuaashish added the stat:awaiting response Waiting for user response label Jun 16, 2023

google-ml-butler bot removed the stat:awaiting response Waiting for user response label Jun 20, 2023

kuaashish assigned yichunk and unassigned kuaashish Jun 20, 2023

kuaashish added the stat:awaiting googler Waiting for Google Engineer's Response label Jun 20, 2023

kuaashish assigned kuaashish and unassigned yichunk Oct 11, 2023

kuaashish removed the stat:awaiting googler Waiting for Google Engineer's Response label Oct 11, 2023

kuaashish added the stat:awaiting response Waiting for user response label Oct 11, 2023

google-ml-butler bot removed the stat:awaiting response Waiting for user response label Oct 11, 2023

kuaashish assigned schmidt-sebastian and unassigned kuaashish Oct 16, 2023

kuaashish added the stat:awaiting googler Waiting for Google Engineer's Response label Oct 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

jdambre commented Nov 17, 2022

kuaashish commented Nov 21, 2022

google-ml-butler bot commented Nov 28, 2022

jdambre commented Nov 28, 2022

kuaashish commented Jan 31, 2023

khanhlvg commented Feb 1, 2023

kuaashish commented Jun 16, 2023

jdambre commented Jun 20, 2023

kuaashish commented Jun 21, 2023

kuaashish commented Oct 11, 2023

m-decoster commented Oct 11, 2023 •

edited

kuaashish commented Oct 16, 2023

Comments