Fix matcher recall threshold #2652

zhiqiangdon · 2023-01-06T07:54:21Z

Issue #, if available:

Description of changes:
Normalize the accumulated recall to (0, 1) to satisfy the stopping threshold.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

github-actions · 2023-01-06T09:43:16Z

Job PR-2652-c79bfd5 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-2652/c79bfd5/index.html

sxjscience · 2023-01-06T19:11:48Z

multimodal/src/autogluon/multimodal/optimization/utils.py

@@ -228,6 +228,7 @@ def compute_hit_rate(features_a, features_b, logit_scale, top_ks=[1, 5, 10]):
        for k in top_ks:
            hit_rate += (preds < k).float().mean()

+    hit_rate /= len(top_ks) * len(logits)


Consider to add a test-case via the hit_rate implemented in torchmetrics: https://torchmetrics.readthedocs.io/en/stable/retrieval/hit_rate.html. I've implemented the test-case as follows and have verified the new implementation. Feel free to add it intest_utils.py.

from torchmetrics import RetrievalHitRate import numpy.testing as npt def ref_symmetric_hit_rate(features_a, features_b, logit_scale, top_ks=[1, 5, 10]): assert len(features_a) == len(features_b) hit_rate = 0 logits_per_a = (logit_scale * features_a @ features_b.t()).detach().cpu() logits_per_b = logits_per_a.t().detach().cpu() num_elements = len(features_a) for logits in [logits_per_a, logits_per_b]: preds = logits.reshape(-1) indexes = torch.broadcast_to(torch.arange(num_elements).reshape(-1, 1), (num_elements, num_elements)).reshape(-1) target = torch.eye(num_elements, dtype=bool).reshape(-1) for k in top_ks: hr_k = RetrievalHitRate(k=k) hit_rate += hr_k(preds, target, indexes=indexes) return hit_rate / (2 * len(top_ks)) def test_symmetric_hit_rate(): generator = torch.Generator() generator.manual_seed(0) for repeat in range(3): for top_ks in [[1, 5, 10], [20], [3, 7, 9]]: features_a = torch.randn(50, 2, generator=generator) features_b = torch.randn(50, 2, generator=generator) hit_rate_impl = compute_hit_rate(features_a, features_b, logit_scale=1.0, top_ks=top_ks) hit_rate_ref = ref_symmetric_hit_rate(features_a, features_b, logit_scale=1.0, top_ks=top_ks) npt.assert_equal(hit_rate_impl.item(), hit_rate_ref.item())

Added. Thanks.

github-actions · 2023-01-06T21:22:28Z

Job PR-2652-582d50a is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-2652/582d50a/index.html

fix recall threshold

c79bfd5

zhiqiangdon requested review from bryanyzhu and sxjscience January 6, 2023 07:54

zhiqiangdon requested a review from tonyhoo January 6, 2023 18:51

sxjscience reviewed Jan 6, 2023

View reviewed changes

zhiqiangdon added 2 commits January 6, 2023 11:39

add test

c239510

lint

582d50a

tonyhoo approved these changes Jan 6, 2023

View reviewed changes

sxjscience approved these changes Jan 6, 2023

View reviewed changes

sxjscience merged commit aedb720 into autogluon:master Jan 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix matcher recall threshold #2652

Fix matcher recall threshold #2652

zhiqiangdon commented Jan 6, 2023

github-actions bot commented Jan 6, 2023

sxjscience Jan 6, 2023

zhiqiangdon Jan 6, 2023

github-actions bot commented Jan 6, 2023

Fix matcher recall threshold #2652

Fix matcher recall threshold #2652

Conversation

zhiqiangdon commented Jan 6, 2023

github-actions bot commented Jan 6, 2023

sxjscience Jan 6, 2023

Choose a reason for hiding this comment

zhiqiangdon Jan 6, 2023

Choose a reason for hiding this comment

github-actions bot commented Jan 6, 2023