MDPRank

One of the central issues in learning to rank for information retrieval is to develop algorithms that construct ranking models by directly optimizing evaluation measures such as normalized discounted cumulative gain (NDCG). Existing methods usually focus on optimizing a specific evaluation measure calculated at a fixed position, e.g., NDCG calculated at a fixed position K. In information retrieval the evaluation measures, including the widely used NDCG and P@K, are usually designed to evaluate the document ranking at all of the ranking positions, which provide much richer information than only measuring the document ranking at a single position. Thus, it is interesting to ask if we can devise an algorithm that has the ability of leveraging the measures calculated at all of the ranking postilions, for learning a better ranking model. In this paper, we propose a novel learning to rank model on the basis of Markov decision process (MDP), referred to as MDPRank. In the learning phase of MDPRank, the construction of a document ranking is considered as a sequential decision making, each corresponds to an action of selecting a document for the corresponding position. The policy gradient algorithm of REINFORCE is adopted to train the model parameters. The evaluation measures calculated at every ranking positions are utilized as the immediate rewards to the corresponding actions, which guide the learning algorithm to adjust the model parameters so that the measure is optimized. Experimental results on LETOR benchmark datasets showed that MDPRank can outperform the state-of-the-art baselines.

Reinforcement Learning to Rank with Markov Decision Process (SIGIR-2017)

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
IMG		IMG
pre_prosess/OHSUMED		pre_prosess/OHSUMED
IMG~cbcfe43f3d28213fefd81b930b2ffafd1fe337ea		IMG~cbcfe43f3d28213fefd81b930b2ffafd1fe337ea
README.md		README.md
configure.py		configure.py
evaluation.py		evaluation.py
model_new.py		model_new.py
pre_process.py		pre_process.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MDPRank

About

Releases

Packages

Languages

jyy0553/MDPRank

Folders and files

Latest commit

History

Repository files navigation

MDPRank

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages