You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Learning to rank algorithms ideally do not depend on the input order of the list of documents. DCG and other measures are over the ranking induced by the scores for the documents, and the position/rank of relevance labels within this.
Since we only keep the list_size number of documents, shuffling makes us select a random set of list_size documents for every call. Otherwise, we only keep the top list_size documents.
Using the libsvm parser I noticed my documents were getting shuffled. I found this code:
ranking/tensorflow_ranking/python/data.py
Line 411 in d7008f2
Why would one shuffle the documents? Doesn't the order (position) of your relevance labels imply meaning? Wouldn't this throw off DCG?
DCG for reference:
The text was updated successfully, but these errors were encountered: