NDCG@10 doesn't include documents rated from Explain Other #78

epugh · 2020-02-13T22:30:10Z

The ndcg@10 appears to only look at the first 10 search results, which I think is the NDCG Local implementation. We notice that for a query that has only a single result, because of how NDCG works, no matter the rating, it scores 100. This makes sense.

However, when we use the Explain Other to find other documents that are relevant and score them, because they don't show up in the search results, the score for the 1 doc result remains 100. We think we should look at the explain other rated documents as well if we don't either have 10 results, or we should use all the explain other results (and make it easy to find them in the UI).

The text was updated successfully, but these errors were encountered:

epugh · 2020-02-13T22:30:39Z

epugh · 2020-02-13T22:33:52Z

In looking at the ratings table, for the above screenshot, we do see the four documents, 1 returned by the query, and three others by the explain other, are rated.

epugh · 2020-02-13T22:49:57Z

In digging through the chain, it appears that the NDCG@10 uses the eachDoc scorer, which only looks at the documents returned by the search engine. However, in the version that @brigaldies wrote (which isn't this default one) we use the getBestDocs which appears to use all the rated documents. See the gist here https://gist.github.com/epugh/2e0a591e2d06b5c472bdfa704a27b142

TheSench · 2020-02-14T13:42:32Z

I can't speak to the exactness of the numbers, but overall, the scores from the linked gist look closer to what I would expect.

TheSench · 2020-02-17T17:23:11Z

What is the expected behavior when the query returns no results, and there are valid results in the "Missing documents"?

epugh · 2020-03-08T00:22:22Z

With this fix in #90 the scoring will include all the rated documents. Going to mark this closed, and we are working on a release.

epugh mentioned this issue Feb 13, 2020

Docs rated using Explain Other get lost. #79

Closed

TheSench mentioned this issue Feb 14, 2020

"nDCG@10" scorer always returns 100 #77

Closed

nathancday mentioned this issue Mar 4, 2020

Access top ratings globally #90

Merged

5 tasks

epugh closed this as completed Mar 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NDCG@10 doesn't include documents rated from Explain Other #78

NDCG@10 doesn't include documents rated from Explain Other #78

epugh commented Feb 13, 2020

epugh commented Feb 13, 2020

epugh commented Feb 13, 2020

epugh commented Feb 13, 2020

TheSench commented Feb 14, 2020

TheSench commented Feb 17, 2020

epugh commented Mar 8, 2020

NDCG@10 doesn't include documents rated from Explain Other #78

NDCG@10 doesn't include documents rated from Explain Other #78

Comments

epugh commented Feb 13, 2020

epugh commented Feb 13, 2020

epugh commented Feb 13, 2020

epugh commented Feb 13, 2020

TheSench commented Feb 14, 2020

TheSench commented Feb 17, 2020

epugh commented Mar 8, 2020