Detection Confidence Needed. #262

MagicFrogSJTU · 2021-04-01T03:46:16Z

The current code outputs grid coordinates as detection results without detection confidence. Therefore, the model often generates confusing detections for some edge-case images.
It is easy to get the face detection confidence, while it is hard to get the alignment confidence. I go through the code but it is not an easy job for new comers. Is there any approach?

1adrianb · 2021-04-01T10:44:18Z

You can use the value of the max as a network confidence measure. While this is not perfect it can be used to detect wrong points fairly accurate. To achieve this you can modify this function:

face-alignment/face_alignment/utils.py

Line 185 in 2bcfcc6

def get_preds_fromhm(hm, center=None, scale=None):

and return also the max value in addition to that of argmax.

MagicFrogSJTU · 2021-04-01T13:12:25Z

You can use the value of the max as a network confidence measure. While this is not perfect it can be used to detect wrong points fairly accurate. To achieve this you can modify this function:

face-alignment/face_alignment/utils.py

Line 185 in 2bcfcc6

def get_preds_fromhm(hm, center=None, scale=None):

and return also the max value in addition to that of argmax.

Thank you for your reply! I have already implemented the method. I wonder if there is any plan to add this as a formal feature?

1adrianb · 2021-04-02T20:54:39Z

There were a few similar questions in the past, so probably its worth adding it. Feel free to make a pull request.

MagicFrogSJTU · 2021-04-06T02:46:43Z

There were a few similar questions in the past, so probably its worth adding it. Feel free to make a pull request.

I will try to make a pull request in recent days!

MagicFrogSJTU · 2021-04-29T02:19:49Z

@1adrianb Very Sorry that these days I am quite occupied.

Before making a PR, should we discuss the API first? How about this:

Add a keyword: return_confidence=False
return landmark point confidence along with coordinates. shape_before: 68, 2. shape_after: 68, 3.

What do you think?

1adrianb · 2021-04-29T10:18:19Z

@MagicFrogSJTU No worries!

Agree
My only concern with this is that depending on the detection type (2D or 3D) the value of the 3rd column may change from representing the depth (for 3D points) to confidence for 2D. Perhaps returning a separate vector with 68 values is simpler?

MagicFrogSJTU · 2021-04-29T15:28:43Z

2. My only concern with this is that depending on the detection type (2D or 3D) the value of the 3rd column may change from representing the depth (for 3D points) to confidence for 2D. Perhaps returning a separate vector with 68 values is simpler?

My concern is:

For detected faces, both the confidence and the coordinates are combined in one variable. For instance, [p1_x, p1_y, p2_x, p2_y, confidence]. Thus, for landmarks, should we keep the same pattern?
If the last column represents landmark confidence, we can still make 2D and 3D compatible. For 2D, the 3rd column is for confidence. For 3D, the 4th column is for confidence.

What do you think?

MagicFrogSJTU · 2021-04-29T15:29:50Z

@MagicFrogSJTU No worries!

Agree

My only concern with this is that depending on the detection type (2D or 3D) the value of the 3rd column may change from representing the depth (for 3D points) to confidence for 2D. Perhaps returning a separate vector with 68 values is simpler?

I know very little about API designing, so maybe you are right. Just give me a final result and I will implement it!

1adrianb · 2021-04-29T19:18:46Z

Can we go with a separate array please? Could you also describe please both the new flag and the returned value in the function doc description? Thanks!

MagicFrogSJTU · 2021-04-30T02:00:30Z

Can we go with a separate array please? Could you also describe please both the new flag and the returned value in the function doc description? Thanks!

One last thing,
Which one to take?

return landmark, landmark_confidence, detected_faces
return (landmark, landmark_confidence), detected_faces

MagicFrogSJTU · 2021-04-30T09:42:07Z

def get_landmarks_from_image(self, image_or_path, detected_faces=None, return_bboxes=False, return_landmark_score=False,):
    """Predict the landmarks for each face present in the image.
    This function predicts a set of 68 2D or 3D images, one for each image present.
    If detect_faces is None the method will also run a face detector.
     Arguments:
        image_or_path {string or numpy.array or torch.tensor} -- The input image or path to it.
    Keyword Arguments:
        detected_faces {list of numpy.array} -- list of bounding boxes, one for each face found
        in the image (default: {None})
        return_bboxes {boolean} -- If True, return the face bounding boxes in addition to the keypoints.
        return_landmark_score {boolean} -- If True, return the keypoint scores along with the keypoints.
    Return:
        result:
            1. If both return_bboxes and return_landmark_score is True, result will be:
                (landmarks, landmarks_scores), detected_faces
            2. If only return_landmark_score is True, result will be:
                landmarks, landmarks_scores
            3. If only return_bboxes is True, result will be:
                landmarks, detected_faces
            4. Otherwise:
                landmarks
    """

It seems over complicated. Cause we will have a lot of combinations.

What about always keeping returning three objects landmark, landmark_confidence, detected_faces, and setting the latter two as None in default? Like landmark, None, None

1adrianb · 2021-04-30T10:35:05Z

Agree, let's go then with 2 cases only: if either landmarks_confidence or detected_faces is True we return 3 values as you suggested, if both are False we will return for now a single value (i.e. only the landmarks). This should simplify this conditioning while maintaining backward compatibility. At a later point in a more major code revision this can be unified.

MagicFrogSJTU · 2021-05-03T15:00:45Z

#271

hengfei-wang · 2023-01-19T22:54:17Z

What is the scale of confidence score? I got something like 1.7108647, 1.718052 , 1.6957333, 1.6364386, 1.5783452, 1.6006193. Is it not in (0,1)?

1adrianb added the question label Apr 1, 2021

1adrianb added the enhancement label Apr 28, 2021

1adrianb assigned MagicFrogSJTU Apr 29, 2021

MagicFrogSJTU mentioned this issue May 3, 2021

return_landmark_score feature implemented. #271

Merged

1adrianb closed this as completed May 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Detection Confidence Needed. #262

Detection Confidence Needed. #262

MagicFrogSJTU commented Apr 1, 2021

1adrianb commented Apr 1, 2021

MagicFrogSJTU commented Apr 1, 2021

1adrianb commented Apr 2, 2021

MagicFrogSJTU commented Apr 6, 2021

MagicFrogSJTU commented Apr 29, 2021 •

edited

1adrianb commented Apr 29, 2021

MagicFrogSJTU commented Apr 29, 2021

MagicFrogSJTU commented Apr 29, 2021

1adrianb commented Apr 29, 2021

MagicFrogSJTU commented Apr 30, 2021 •

edited

MagicFrogSJTU commented Apr 30, 2021

1adrianb commented Apr 30, 2021

MagicFrogSJTU commented May 3, 2021

hengfei-wang commented Jan 19, 2023

Detection Confidence Needed. #262

Detection Confidence Needed. #262

Comments

MagicFrogSJTU commented Apr 1, 2021

1adrianb commented Apr 1, 2021

MagicFrogSJTU commented Apr 1, 2021

1adrianb commented Apr 2, 2021

MagicFrogSJTU commented Apr 6, 2021

MagicFrogSJTU commented Apr 29, 2021 • edited

1adrianb commented Apr 29, 2021

MagicFrogSJTU commented Apr 29, 2021

MagicFrogSJTU commented Apr 29, 2021

1adrianb commented Apr 29, 2021

MagicFrogSJTU commented Apr 30, 2021 • edited

MagicFrogSJTU commented Apr 30, 2021

1adrianb commented Apr 30, 2021

MagicFrogSJTU commented May 3, 2021

hengfei-wang commented Jan 19, 2023

MagicFrogSJTU commented Apr 29, 2021 •

edited

MagicFrogSJTU commented Apr 30, 2021 •

edited