LambdaRank Gradients are Incoherent

This code is for CIKM 2023 full paper LambdaRank Gradients are Incoherent.

Abstract

In Information Retrieval (IR), the Learning-to-Rank (LTR) task requires building a ranking model that optimises a specific IR metric. One of the most effective approaches to do so is the well-known LambdaRank algorithm. LambdaRank uses gradient descent optimisation, and at its core, it defines approximate gradients, the so-called lambdas, for a non-differentiable IR metric. Intuitively, each lambda describes how much a document's score should be ``pushed'' up/down to reduce the ranking error.

In this work, we show that lambdas may be incoherent w.r.t. the metric being optimised: e.g., a document with high relevance in the ground truth may receive a smaller gradient push than a document with lower relevance. This behaviour goes far beyond the expected degree of approximation. We analyse such behaviour of LambdaRank gradients and we introduce some strategies to reduce their incoherencies. We demonstrate through extensive experiments, conducted using publicly available datasets, that the proposed approach reduces the frequency of the incoherencies in LambdaRank and derivatives, and leads to models that achieve statistically significant improvements in the NDCG metric, without compromising the training efficiency.

Implementation

Lambda-eX is a document-pairs selection strategy built on top of LightGBM.

The code implements LambdaMART, LambdaLoss (NDCG-Loss2 and NDCG-Loss2++) algorithms and all the combinations of Lambda-eX that make use of LambdaMART and LambdaLoss loss functions.

Usage

Lambda-eX is accessible through the lambdarank parameter lambda_ex (or lambdaex) with the following value:

"plain" to enforce the original algorithm (no Lambda-eX) (default).
"static" to enforce Lambda-eX static.
"random" to enforce Lambda-eX random.
"all" to enforce Lambda-eX all.
"all-static" to enforce Lambda-eX all-static.
"all-random" to enforce Lambda-eX all-random.

Loss functions

The code implements three loss functions: LambdaRank, NDCG-Loss2 and NDCG-Loss2++. The three loss functions are accessible through the lambdarank parameters lambdarank_weight (or lr_mu) and lambdaloss_weight (or ll_mu), with the following combinations:

lambdarank_weight=1 and lambdaloss_weight=0 to enforce the LambdaRank loss function (default).
lambdarank_weight=0 and lambdaloss_weight=1 to enforce the NDCG-Loss2 loss function.
lambdarank_weight=1 and lambdaloss_weight>0 to enforce the NDCG-Loss2++ loss function.

Examples

for LambdaMART: objective="lambdarank".
for LambdaMART-eX random: objective="lambdarank" and lambda_ex="random".
for LambdaLoss-eX static with NDCG-Loss2++: objective="lambdarank", lambda_ex="static", and lambdaloss_weight=0.5.
for LambdaLoss-eX all-random with NDCG-Loss2: objective="lambdarank", lambdaex="all-random", lr_mu=0 and ll_mu=1.

Installation

Follow the installation instructions as mentioned in the LightGBM GitHub repository. Where needed, replace the repository https://github.com/microsoft/LightGBM with this one.

Citation

@inproceedings{10.1145/3583780.3614948,
  author    = {Marcuzzi, Federico and Lucchese, Claudio and Orlando, Salvatore},
  title     = {LambdaRank Gradients Are Incoherent},
  year      = {2023},
  isbn      = {9798400701245},
  publisher = {Association for Computing Machinery},
  address   = {New York, NY, USA},
  url       = {https://doi.org/10.1145/3583780.3614948},
  doi       = {10.1145/3583780.3614948},
  abstract  = {In Information Retrieval (IR), the Learning-to-Rank (LTR) task requires building a ranking model that optimises a specific IR metric. One of the most effective approaches to do so is the well-known LambdaRank algorithm. LambdaRank uses gradient descent optimisation, and at its core, it defines approximate gradients, the so-called lambdas, for a non-differentiable IR metric. Intuitively, each lambda describes how much a document's score should be "pushed" up/down to reduce the ranking error.In this work, we show that lambdas may be incoherent w.r.t. the metric being optimised: e.g., a document with high relevance in the ground truth may receive a smaller gradient push than a document with lower relevance. This behaviour goes far beyond the expected degree of approximation. We analyse such behaviour of LambdaRank gradients and we introduce some strategies to reduce their incoherencies. We demonstrate through extensive experiments, conducted using publicly available datasets, that the proposed approach reduces the frequency of the incoherencies in LambdaRank and derivatives, and leads to models that achieve statistically significant improvements in the NDCG metric, without compromising the training efficiency.},
  booktitle = {Proceedings of the 32nd ACM International Conference on Information and Knowledge Management},
  pages     = {1777–1786},
  numpages  = {10},
  keywords  = {lambdarank, information retrieval, learning to rank},
  location  = {Birmingham, United Kingdom},
  series    = {CIKM '23}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3,332 Commits
.ci		.ci
.github		.github
.nuget		.nuget
R-package		R-package
cmake		cmake
docker		docker
docs		docs
examples		examples
external_libs		external_libs
helpers		helpers
include/LightGBM		include/LightGBM
pmml		pmml
python-package		python-package
src		src
swig		swig
tests		tests
windows		windows
.appveyor.yml		.appveyor.yml
.editorconfig		.editorconfig
.gitignore		.gitignore
.gitmodules		.gitmodules
.readthedocs.yaml		.readthedocs.yaml
.vsts-ci.yml		.vsts-ci.yml
CMakeLists.txt		CMakeLists.txt
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
VERSION.txt		VERSION.txt
build-cran-package.sh		build-cran-package.sh
build-python.sh		build-python.sh
build_r.R		build_r.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LambdaRank Gradients are Incoherent

Abstract

Implementation

Usage

Loss functions

Examples

Installation

Citation

About

Uh oh!

Releases

Packages

Languages

License

FedericoMarcuzzi/LambdaRank-Gradients-are-Incoherent

Folders and files

Latest commit

History

Repository files navigation

LambdaRank Gradients are Incoherent

Abstract

Implementation

Usage

Loss functions

Examples

Installation

Citation

About

Resources

License

Code of conduct

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages