Adding RankMatchFailure metric #184

arvindsrikantan · 2022-07-06T21:02:45Z

No description provided.

… loss

using converting y_true to 1-hot for aux loss for cat cross entropy

- Add basic cross entropy (Still not fully working)

…ion-metric

arvindsrikantan · 2022-07-13T16:29:19Z

@mohazahran , I added the tests too

mohazahran · 2022-07-13T17:55:39Z

python/ml4ir/applications/ranking/model/metrics/metrics_impl.py

+        dtype : str, optional
+            data type of the metric result.
+        rank : Tensor object
+            2D tensor representing ranks/rankitions of records in a query


a typo in 'rankitions'

arvindsrikantan · 2022-07-13T20:11:44Z

python/ml4ir/applications/ranking/model/losses/listwise_losses.py

@@ -36,18 +36,24 @@ def _loss_fn(y_true, y_pred):
            mask : [batch_size, num_classes]


Only auto formatting

python/ml4ir/applications/ranking/model/metrics/metrics_impl.py

mohazahran · 2022-07-14T16:23:53Z

python/ml4ir/base/model/metrics/metrics_impl.py

+        metrics: List[Union[str, Type[Metric]]],
+        feature_config: FeatureConfig,
+        metadata_features: Dict,
+        for_aux_output: bool = False,


This needs to be added to the Parameters list doc string.

mohazahran · 2022-07-14T16:43:14Z

python/ml4ir/applications/ranking/model/metrics/metrics_impl.py


-from typing import Optional, Dict
+
+class CombinationMetric:


Can you remind me, why this is needed?

This is used to distinguish single label metrics from multi-label metrics. Multi-label metrics should be computed only for one of the outputs

mohazahran · 2022-07-14T16:51:37Z

python/ml4ir/base/model/metrics/metrics_impl.py

@@ -41,6 +43,9 @@ def get_metrics_impl(
    metrics_impl: List[Union[Metric, str]] = list()

    for metric in metrics:
+        if isinstance(metric, ranking_metrics_impl.CombinationMetric) and for_aux_output:


Here, it's no allowing RankMatchFailure to be a metric for aux_ouput, but it allows the other metrics as (MRR) to be a metric for aux output, right?
It seems to me that MRR and the other non NMF metrics shouldn't be computed for the aux output because they don't convey any info. In fact it's useless to learn the value of train_aux_ranking_score_new_MRR as it's being measured against the title scores which are not clicks (correct me if I'm wrong here) . However, the NMF metric should be part of either the primary output or the aux output (if it's allowed for both, then both should give the same metric value, right? i.e. for example: train_ranking_score_new_RankMatchFailure should be equals to train_aux_ranking_score_new_RankMatchFailure.

To keep the implementation straight-forward, combination metrics are defined only for the "main output". The other metrics can/cannot be relevant to the aux-output (depending on the actual aux label). That's the reasoning behind the current design. Let me know if you feel another approach is more appropriate.

However, the NMF metric should be part of either the primary output or the aux output (if it's allowed for both, then both should give the same metric value, right? i.e. for example: train_ranking_score_new_RankMatchFailure should be equals to train_aux_ranking_score_new_RankMatchFailure.

This is accurate, but an implementation nightmare at this point. Open to suggestions here

In that case, it seems to me that no metrics other than loss should be tracked for the aux_output. and all other metrics loss, MRR, ACR and RankNMF should be tracked for the main output. In other words, the only change from training a model without aux_labels is that

we should we should add RankNMF to the set of metrics tracked by the main output whenever aux_label is given.

track loss only for aux_output

let me know what do you think

This makes sense for the particular aux feature we're talking about. We might want to measure MRR on a different aux output, no?

Generally it's possible, but that aux source has to be clicks, no?
If we want to make this account for all different cases of possible aux targets/outputs, then perhaps we need to make the metrics to be tracked a user input. What do you think?

Yes, that's what I was thinking too. Will track this in a follow up

mohazahran · 2022-07-14T21:59:28Z

python/ml4ir/applications/ranking/tests/test_auxiliary_loss.py

+        primary_training_loss = float(ml4ir_results.loc[ml4ir_results[0] == 'train_ranking_score_loss'][1])
+        assert np.isclose(primary_training_loss, 1.1877643, atol=0.0001)
+        aux_training_loss = float(ml4ir_results.loc[ml4ir_results[0] == 'train_aux_ranking_score_loss'][1])
+        assert np.isclose(aux_training_loss, 2.3386843, atol=0.0001)


This expected value "2.3386843" is not correct and it's failing the test case. I checked, this value is changed from my PR. This is the value from my PR:
assert np.isclose(aux_training_loss, 1.2242277, atol=0.0001)

Thanks for pointing this out. I might have messed up a merge conflict resolution

…ion-metric

…nto nmf-validation-metric

mohazahran

Thanks Arvind for this PR!
I recommend a follow up story to better select metrics for each output (primary and aux.)

mohazahran and others added 13 commits June 9, 2022 17:05

First rough version

16a7989

Applying softmax to the title matches sum scores before computing the…

0682aae

… loss

Restructure

38de56e

Fix tests

ef05055

Fix tests

4fea860

using 'node_name' to be used in aux_loss

b7df649

using converting y_true to 1-hot for aux loss for cat cross entropy

- Fix test failures for kfold CV

ee28ac2

- Add basic cross entropy (Still not fully working)

Normalizing by batch size in basic cross entropy

38af86a

Testing losses with auxiliary loss in place

cc3fe90

addressing comments

c679601

Adding E2E testing for dual objective training

9f82330

Removing dual objective testing from test_base

0cda4ea

Adding RankMatchFailure metric

6fe5ef7

arvindsrikantan requested a review from mohazahran July 6, 2022 21:02

salesforce-cla bot added the cla:signed label Jul 6, 2022

Base automatically changed from mo/NMF_aux_loss to master July 8, 2022 18:41

arvindsrikantan added 2 commits July 13, 2022 09:13

Adding tests and fixing a few bugs

7c64fe1

Merge branch 'master' of github.com:salesforce/ml4ir into nmf-validat…

6a68a97

…ion-metric

mohazahran reviewed Jul 13, 2022

View reviewed changes

arvindsrikantan commented Jul 13, 2022

View reviewed changes

python/ml4ir/applications/ranking/model/metrics/metrics_impl.py Show resolved Hide resolved

arvindsrikantan commented Jul 13, 2022

View reviewed changes

python/ml4ir/applications/ranking/model/metrics/metrics_impl.py Show resolved Hide resolved

mohazahran reviewed Jul 14, 2022

View reviewed changes

arvindsrikantan added 3 commits July 15, 2022 11:06

Merge branch 'master' of github.com:salesforce/ml4ir into nmf-validat…

0a94ecc

…ion-metric

Merge branch 'nmf-validation-metric' of github.com:salesforce/ml4ir i…

7076521

…nto nmf-validation-metric

Adding missing variable

caf66fe

mohazahran approved these changes Jul 28, 2022

View reviewed changes

arvindsrikantan merged commit 6ff9238 into master Jul 29, 2022

arvindsrikantan deleted the nmf-validation-metric branch July 29, 2022 16:38

mohazahran restored the nmf-validation-metric branch January 23, 2023 20:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding RankMatchFailure metric #184

Adding RankMatchFailure metric #184

arvindsrikantan commented Jul 6, 2022

arvindsrikantan commented Jul 13, 2022

mohazahran Jul 13, 2022

arvindsrikantan Jul 13, 2022

mohazahran Jul 14, 2022

mohazahran Jul 14, 2022

arvindsrikantan Jul 15, 2022

mohazahran Jul 14, 2022

arvindsrikantan Jul 15, 2022

mohazahran Jul 20, 2022 •

edited

Loading

arvindsrikantan Jul 25, 2022

mohazahran Jul 27, 2022

arvindsrikantan Jul 29, 2022

mohazahran Jul 14, 2022

arvindsrikantan Jul 15, 2022

mohazahran left a comment

		@@ -36,18 +36,24 @@ def _loss_fn(y_true, y_pred):
		mask : [batch_size, num_classes]

Adding RankMatchFailure metric #184

Adding RankMatchFailure metric #184

Conversation

arvindsrikantan commented Jul 6, 2022

arvindsrikantan commented Jul 13, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mohazahran Jul 20, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mohazahran left a comment

Choose a reason for hiding this comment

mohazahran Jul 20, 2022 •

edited

Loading