Provide additional details on HPatches evaluation #136

Parskatt · 2022-01-12T13:40:13Z

Hi, I'm interested in how you calculated the AUC for homographies on HPatches exactly.

Related:
#105

zehongs · 2022-01-18T02:40:30Z

Hi, I've checked the realated issue.
We use the implementation of AUC from SuperGlue. Here's the code in our repo:

Lines 151 to 154 in 94e98b6

    
           last_index = np.searchsorted(errors, thr) 
        
           y = recall[:last_index] + [recall[last_index-1]] 
        
           x = errors[:last_index] + [thr] 
        
           aucs.append(np.trapz(y, x) / thr)

And as stated in the paper, the error term is defined as
error = l2norm(homography_projection(corners, H_gt), homography_projection(corners, H_estimate)). To compute the H_estimate, we use cv2.findHomography(src_pts, tgt_pts, cv2.RANSAC, 3.)

Parskatt · 2022-01-18T06:38:36Z

Thanks for the response.

From looking at your code your thresholds are always set to [5,10,20].
In the paper you report [3,5,10]. Is [5,10,20] the actual thresholds you used?

If you have the time, could you try to help reproducing the results in this repo?

GrumpyZhou/image-matching-toolbox#20

Parskatt · 2022-01-18T06:39:25Z

There these results were gotten (albeit with ransac threshold 2px)

Opencv with ransac threshold = 2 at error thresholds [3, 5, 10] px

SuperPoint: 0.37 0.51 0.68
SuperPoint+SuperGlue: 0.39 0.53 0.71
CAPS (w.SuperPoint): 0.33 0.49 0.67
LoFTR (all matches): 0.48 0.6 0.74

Parskatt · 2022-01-18T06:42:29Z

The results seem to align with your own, but with all methods scoring lower, so there seems to be some discrepency.

Parskatt · 2022-01-18T07:01:57Z

From your paper:

Should this be interpreted as that you also scale the homography?

zehongs · 2022-01-19T02:03:32Z

Yes, this follows Sec 7.3 of the SuperPoint paper.
By scaling the images and then comparing the corner AUCs, the evaluation results are consistent across all image pairs. Otherwise, the evaluation can easily deteriorate with large image inputs. You can also replace the pixel-level threshold with normalized pixel-level threshold.

TruongKhang · 2022-03-03T02:08:52Z

Hiiiii @zehongs, I'm also trying to reproduce the result on HPatches. In your last comment above, did you mean that you scaled the ground truth homography to evaluate on the scaled images? Or did you do like this: you scaled the images and estimated the matches. After that, you re-scaled the matches back to the original resolution and perform the evaluation with the original ground truth homography.

georg-bn · 2022-03-30T08:04:29Z

@zehongs We have been able to reproduce the numbers in GrumpyZhou/image-matching-toolbox#20 , if you could comment on/clarify the below points it would be very helpful:

Is it correct that the corner errors are computed in the downsized images?
When you write "shorter dimensions equal to 480" do you mean that as in SuperPoint all images are rescaled to 480x640 (or 640x480), or do you keep the aspect ratio of the images so that the dimensions are 480x?? or ??x480?

Do you use any special hyperparameter settings for the HPatches experiments? For instance I found that increasing MATCH_COARSE.THR=0.5 yields better results than the default 0.2. Also I noticed this comment:

LoFTR/test.py

Lines 49 to 51 in 5d6c834

    
           # tune when testing 
        
           if args.thr is not None: 
        
               config.LOFTR.MATCH_COARSE.THR = args.thr

What settings did you use for cv2.findHomography? EDIT: I see now that you have specified this further up in this thread.

Thank you!

Parskatt mentioned this issue Jan 12, 2022

Feature Request: Add homography AUC GrumpyZhou/image-matching-toolbox#20

Closed

georg-bn mentioned this issue Mar 30, 2022

Further efforts to reproduce loftr hpatches GrumpyZhou/image-matching-toolbox#23

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide additional details on HPatches evaluation #136

Provide additional details on HPatches evaluation #136

Parskatt commented Jan 12, 2022 •

edited

Loading

zehongs commented Jan 18, 2022

Parskatt commented Jan 18, 2022

Parskatt commented Jan 18, 2022

Parskatt commented Jan 18, 2022

Parskatt commented Jan 18, 2022

zehongs commented Jan 19, 2022

TruongKhang commented Mar 3, 2022 •

edited

Loading

georg-bn commented Mar 30, 2022 •

edited

Loading

Provide additional details on HPatches evaluation #136

Provide additional details on HPatches evaluation #136

Comments

Parskatt commented Jan 12, 2022 • edited Loading

zehongs commented Jan 18, 2022

Parskatt commented Jan 18, 2022

Parskatt commented Jan 18, 2022

Opencv with ransac threshold = 2 at error thresholds [3, 5, 10] px

Parskatt commented Jan 18, 2022

Parskatt commented Jan 18, 2022

zehongs commented Jan 19, 2022

TruongKhang commented Mar 3, 2022 • edited Loading

georg-bn commented Mar 30, 2022 • edited Loading

Parskatt commented Jan 12, 2022 •

edited

Loading

TruongKhang commented Mar 3, 2022 •

edited

Loading

georg-bn commented Mar 30, 2022 •

edited

Loading