SoDeep: A Sorting Deep Net to Learn Ranking Loss Surrogates

Code associated with the paper SoDeep: A Sorting Deep Net to Learn Ranking Loss Surrogates

This code contains the loss functions derived from the following metrics.

Spearman correlation
Mean average Precision
Recall

It also contains the code to train the approximation of the rank function (synthetic data generation, model architecture, training script).

Author and contact: Martin Engilberge

Main dependencies

This code is written in python. To use it you will need:

Python 3.7
Pytorch 1.1
Numpy
TensorboardX

Getting started

To be able to use the loss function, the first step is to train a sorter to approximate the ranking function.

python train.py

Once the training is finished, the sorter's checkpoints are stored in the weights folder. By default the model used in the paper (lstm_large) will be selected and the sequence length will be set to 100. More models are present in model.py and can be selected with the argument -m.

python train.py -m gruc -n model_gruc

The GRU based model were developed after the publication of the paper and might perform better. The sorter_exact model doesn't need to be trained and can be used as a reliable baseline.

By default the training scripts use gpu, you can switch to cpu mode by uncommenting device = torch.device("cpu") at the beginning of the script.

Using the loss function

Once you have trained a sorter or if you decided to use the algorithmic one (sorter_exact) you can use the loss functions.

There are four losses

SpearmanLoss(sorter_type, seq_len=None, sorter_state_dict=None)
MapRankingLoss(sorter_type, seq_len=None, sorter_state_dict=None)
#MultiModal rank based Loss
RankLoss(sorter_type, seq_len=None, sorter_state_dict=None)
#Hard negative MultiModal rank based Loss
RankHardLoss(sorter_type, seq_len=None, sorter_state_dict=None, margin=0.2)

Each loss function can take three main arguments:

sorter_type: the model of sorter used
seq_len: the length of sequence the sorter has been trained on
sorter_state_dict: The state dict containing the weights of the sorter

The function load_sorter is provided to load the required argument all at once from a sorter checkpoints.

import sys
sys.path.append("/path/to/sodeep/folder/")
from sodeep import load_sorter, SpearmanLoss

criterion = SpearmanLoss(*load_sorter(sorter_checkpoint_path))
criterion.to(device)

The syntax to use the programed sorter would be as follows:

import sys
sys.path.append("/path/to/sodeep/folder/")
from sodeep import SpearmanLoss

criterion = SpearmanLoss("exa")
criterion.to(device)

The model.py file also contains an UpdatingWrapper which can be used to update the sorter on real data while it is used in a loss. For stability reason it might be necessary to use the proposed loss in combination with another loss.

On some regression task we noticed that initialization with an L1 loss for a couple of epochs was required before using the SpearmanLoss.

Replicating experiments

To reproduce the results on Cross-modal Retrieval we start from the model from the paper Finding beans in burgers: Deep semantic-visual embedding with localization the code is available here..

We replace the hard negative triplet loss by our sodeep ranking loss as follow:

from sodeep import RankHardLoss

criterion = RankHardLoss()

We also change the following hyper-parameters:

batch_size = 100
Preprocessing image resize:
- train: 350
- validation: 350
- test: 525

During testing the data are shuffled, add the following line and the end of init function of CocoCaptionsRV:

random.shuffle(self.content)

Reference

If you found this code useful, please cite the following paper:

@inproceedings{engilberge2019sodeep,
	title={SoDeep: A Sorting Deep Net to Learn Ranking Loss Surrogates},
	author={Engilberge, Martin and Chevallier, Louis and P{\'e}rez, Patrick and Cord, Matthieu},
	booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
	year={2019}
}

License

This project is licensed under the terms of BSD 3-clause Clear license. by downloading this program, you commit to comply with the license as stated in the LICENSE.md file.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
logs		logs
weights		weights
LICENSE.md		LICENSE.md
README.md		README.md
dataset.py		dataset.py
model.py		model.py
sodeep.py		sodeep.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

logs

logs

weights

weights

LICENSE.md

LICENSE.md

README.md

README.md

dataset.py

dataset.py

model.py

model.py

sodeep.py

sodeep.py

train.py

train.py

utils.py

utils.py

Repository files navigation

SoDeep: A Sorting Deep Net to Learn Ranking Loss Surrogates

Main dependencies

Getting started

Using the loss function

Replicating experiments

Reference

License

About

Releases

Packages

Languages

License

M-Eng/sodeep

Folders and files

Latest commit

History

Repository files navigation

SoDeep: A Sorting Deep Net to Learn Ranking Loss Surrogates

Main dependencies

Getting started

Using the loss function

Replicating experiments

Reference

License

About

Resources

License

Stars

Watchers

Forks

Languages