Add MRR to quepid as a communal scorer. #525

david-fisher · 2022-06-03T20:06:26Z

Description

Add reciprocal rank as a communal Scorer

Motivation and Context

Closes #523 adds a useful metric for known item search evaluation.

How Has This Been Tested?

Local install of quepid started with bin/setup_docker followed by bin/docker server

Screenshots or GIFs (if appropriate):

Types of changes

[] Bug fix (non-breaking change which fixes an issue)
Improvement (non-breaking change which improves existing functionality)
New feature (non-breaking change which adds new functionality)
[] Breaking change (fix or feature that would cause existing functionality to change)

Checklist:

My code follows the code style of this project.
[] My change requires a change to the documentation.
[] I have updated the documentation accordingly.
I have read the CONTRIBUTING document.
[] I have added tests to cover my changes.
All new and existing tests passed.

epugh · 2022-06-03T21:42:00Z

I took the ticket name out of the title, because it gets confusing that the ticket isn't the pr, if that makes sense...

epugh · 2022-06-03T21:43:48Z

Is this MRR or RR? It appears to say RR in the code?

epugh

Looks like this should either be rr@10.js or mrr@10.js???

david-fisher · 2022-06-04T16:55:30Z

Looks like this should either be rr@10.js or mrr@10.js???

rr@10 for a single query, mrr@10 for a set of queries.

Much like AP@10 is actually MAP@10 for a collection of queries.

david-fisher · 2022-06-07T13:16:18Z

Looks like this should either be rr@10.js or mrr@10.js???

Did I misunderstand your change request? The new file is already named rr@10.js. What are you asking for here?

epugh · 2022-06-07T13:23:30Z

Looks like this should either be rr@10.js or mrr@10.js???

Did I misunderstand your change request? The new file is already named rr@10.js. What are you asking for here?

The title of the issue is "Add MRR", so maybe it should be "Add RR"?

david-fisher · 2022-06-07T13:39:22Z

Looks like this should either be rr@10.js or mrr@10.js???

Did I misunderstand your change request? The new file is already named rr@10.js. What are you asking for here?

The title of the issue is "Add MRR", so maybe it should be "Add RR"?

RR for a single query, MRR for a set of queries. Quepid will be displaying MRR, the javascript computes RR for each individual query, which quepid averages together. In general, the aggregate name is used when referring to a metric that is being averaged across queries. Either name is fine in the issue.

epugh · 2022-06-07T13:45:20Z

Looks like this should either be rr@10.js or mrr@10.js???

Did I misunderstand your change request? The new file is already named rr@10.js. What are you asking for here?

The title of the issue is "Add MRR", so maybe it should be "Add RR"?

RR for a single query, MRR for a set of queries. Quepid will be displaying MRR, the javascript computes RR for each individual query, which quepid averages together. In general, the aggregate name is used when referring to a metric that is being averaged across queries. Either name is fine in the issue.

I guess I may need to take this on faith. When we refer to DCG, we have a file named DCG.js. I would assume that if we are referring to MRR, we would have a file named MRR.js, and if there was a seperate metric called RR, then it would have a scorer named RR.js?

I don't mean to be obtuse here, but the goal of Quepid is to make metrics etc simple and something I can explain to everyday users.. So I feel like if we are adding MRR to Quepid, then the file should be called mrr.js.

epugh · 2022-06-07T13:46:18Z

Okay, now I am super confused. Is this MRR or RR? Or does this NOT follow the pattern that we have of P, AP, DCG, NDCG etc, and is a new naming pattern?

david-fisher · 2022-06-07T13:50:59Z

I guess I may need to take this on faith. When we refer to DCG, we have a file named DCG.js. I would assume that if we are referring to MRR, we would have a file named MRR.js, and if there was a seperate metric called RR, then it would have a scorer named RR.js?

There is only one metric, reciprocal rank. When we average the reciprocal rank scores for multiple queries, the result is called mean reciprocal rank.

From the perspective of what the code is computing, rr@10.js computes the reciprocal rank for a single query. The Quepid app uses that RR value to average across the set of queries in the collection to produce the MRR score for the full set.

The same is true of the computation being called AP@10, it computes a single value for a query, which is then averaged to produce what should be called MAP@10 in the Quepid display. One metric, averaged across multiple queries.

david-fisher · 2022-06-07T14:32:01Z

Okay, now I am super confused. Is this MRR or RR? Or does this NOT follow the pattern that we have of P, AP, DCG, NDCG etc, and is a new naming pattern?

AP should be named MAP, mean average precision. All of the metric names have been established by the evaluation metric community. The names used by trec_eval should be considered canonical. Note, nDCG does not get called MnDCG when averaged. P@k does not get called MP@k when averaged. Only Mean Reciprocal Rank and Mean Average Precision have the M named variant for the aggregate score across a set of queries.

File is already named as suggested

#513 Add MRR to quepid as a communal scorer.

3e8a783

david-fisher requested review from epugh and nathancday June 3, 2022 20:07

david-fisher marked this pull request as ready for review June 3, 2022 20:10

epugh temporarily deployed to quepid-pr-525 June 3, 2022 21:22 Inactive

epugh changed the title ~~#523 Add MRR to quepid as a communal scorer.~~ Add MRR to quepid as a communal scorer. Jun 3, 2022

epugh previously requested changes Jun 3, 2022

View reviewed changes

david-fisher requested a review from epugh June 4, 2022 16:56

david-fisher assigned epugh Jan 2, 2023

epugh added 3 commits January 25, 2023 06:40

Merge branch 'main' into 523-add-rr

0482605

introduce RR communal scorer...

4908fb4

Merge branch 'main' into 523-add-rr

f06e578

epugh merged commit dfdcf78 into main Jan 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MRR to quepid as a communal scorer. #525

Add MRR to quepid as a communal scorer. #525

david-fisher commented Jun 3, 2022

epugh commented Jun 3, 2022

epugh commented Jun 3, 2022

epugh left a comment

david-fisher commented Jun 4, 2022

david-fisher commented Jun 7, 2022

epugh commented Jun 7, 2022

david-fisher commented Jun 7, 2022

epugh commented Jun 7, 2022

epugh commented Jun 7, 2022

david-fisher commented Jun 7, 2022

david-fisher commented Jun 7, 2022

Add MRR to quepid as a communal scorer. #525

Add MRR to quepid as a communal scorer. #525

Conversation

david-fisher commented Jun 3, 2022

Description

Motivation and Context

How Has This Been Tested?

Screenshots or GIFs (if appropriate):

Types of changes

Checklist:

epugh commented Jun 3, 2022

epugh commented Jun 3, 2022

epugh left a comment

Choose a reason for hiding this comment

david-fisher commented Jun 4, 2022

david-fisher commented Jun 7, 2022

epugh commented Jun 7, 2022

david-fisher commented Jun 7, 2022

epugh commented Jun 7, 2022

epugh commented Jun 7, 2022

david-fisher commented Jun 7, 2022

david-fisher commented Jun 7, 2022