Fix rankers' implementation and indexing errors #320

dipanshu124 · 2021-02-02T15:51:21Z

This PR is a followup of #278 and attempts to resolve the remaining issues.

As the implementation is not correct it is better to revisit or rewrite this in the future.

Before, we were dealing with a vector of FeatureVector objects,ie one FeatureVector per entry in the training set. Now have a separate vector per query in the training set. Before queryids were completely ignored For more info on why this change is needed look the implementation of ListNetRanker at https://www.microsoft.com/en-us/research/wp-content/ uploads/2016/02/tr-2007-40.pdf at page 5 and for the implementation of ListMleRanker look at http://icml2008.cs.helsinki.fi/papers/167.pdf page 6. If you look closely you will notice that the current implementation doesn't take query into account which is clearly wrong.

This change applies to both ListNetRanker and ListMleRanker. The motivation for Xavier initialization in Neural Networks is to initialize the weights of the network so that the neuron activation functions are not starting out in saturated or dead regions. In other words, we want to initialize the parameters with random values that are not “too small” and not “too large.”

Gradient being used to update the parameter per query is divided by the number of documents associated with the query else it will simply give more weightage to a query which has more documents associated with it.

In effect, a bias value allows you to shift the activation function to the left or right, which may be critical for successful learning.

This are due to changes made from fixing ranker implementations, fixing indexing errors, adding Xavier initialisation, adding normalisation of gradient and adding bias combined. This also fixes scorer test which was wrong earlier.

…e Rankers

…alization

ojwb · 2022-06-29T04:51:00Z

Thanks for your work on this, and sorry for having dropped the ball on getting it reviewed and merged.

We've since switched CI from travis-ci to GHA, so I've rebase your branch onto current master, updated the new CI to remove the libsvm stuff and opened a new PR: #324

Closing this, will review and try to actually get this merged via the new PR.

VaibhavKansagara and others added 10 commits January 27, 2021 16:30

Remove SVMRanker and related tests

730daf3

As the implementation is not correct it is better to revisit or rewrite this in the future.

Normalize the gradient

717ea93

Gradient being used to update the parameter per query is divided by the number of documents associated with the query else it will simply give more weightage to a query which has more documents associated with it.

Fix indexing errors

e6f1e89

Add a bias term

619ff4c

In effect, a bias value allows you to shift the activation function to the left or right, which may be critical for successful learning.

Fix training related files and tests

a4b205c

This are due to changes made from fixing ranker implementations, fixing indexing errors, adding Xavier initialisation, adding normalisation of gradient and adding bias combined. This also fixes scorer test which was wrong earlier.

fixup! Change Ranker API and fix implementation of ListNet and ListMl…

86b7f79

…e Rankers

fixup! Initialize the parameters for neural network with Xavier initi…

727c039

…alization

fixup! Add a bias term

f3876b3

dipanshu124 mentioned this pull request Feb 2, 2021

Fix implementation of rankers and indexing errors #278

Closed

ojwb mentioned this pull request Jun 29, 2022

Letor updates #324

Open

ojwb closed this Jun 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix rankers' implementation and indexing errors #320

Fix rankers' implementation and indexing errors #320

dipanshu124 commented Feb 2, 2021

ojwb commented Jun 29, 2022

Fix rankers' implementation and indexing errors #320

Fix rankers' implementation and indexing errors #320

Conversation

dipanshu124 commented Feb 2, 2021

ojwb commented Jun 29, 2022