For a problem that requires an order to be learnt, availability of the following two datasets is assumed:
-
$D_{main}$ : A dataset of$x_i$ which ultimately we would like to predict the order between any two members. -
$D_{comp}$ : A dataset consisting of ordered tuples$(x_i, x_j, o)$ , where$o \in {\geq, \leq, \sim }$
A simple toy example.
Rank = 1 | Rank = 10 | Rank = 20 | Rank = 30 | Rank = 40 |
---|---|---|---|---|
For
code for these datasets is here
A Comparator model is used to learn the ranking using
Code for the models can be found here
The training script is here
After training with the EXTREME sampling scheme the model learns the following (Column names are:
- 0 means
$x_i \leq x_j$ - 1 means
$x_i \sim x_j$ - 2 means
$x_i \geq x_j$