GitHub - junkangwu/BSL: [ICDE2024] Official code of "BSL: Understanding and Improving Softmax Loss for Recommendation"

BSL: Understanding and Improving Softmax Loss for Recommendation

Performance comparison of different loss functions based on MF and LightGCN on the Yelp2018 and Amazon datasets. SL consistently achieves superior performance over other losses by a significant margin (>15% gain) across models and datasets.

This is the PyTorch implementation for our ICDE 2024 paper.

Junkang Wu, Jiawei Chen, Jiancan Wu, Wentao Shi, Jizhi Zhang, Xiang Wang 2024. BSL: Understanding and Improving Softmax Loss for Recommendation. arxiv link

Prerequisites

Python 3.9
PyTorch 1.11.0

Training & Evaluation

unzip the dataset

tar -xzvf data/yelp2018.tar.gz -C data

Reprocude the experiments: Commands for reproducing the reported results:

MF

Usage (example):

cd bash
bash lgn_Frame.sh $lr $l2 $n_negs $bsz $t1 $t2 $GPU_ID $loss_mode $DATASET_NAME $drop $loss_fn

$GPU_ID refers to the ID of the launched GPU, while $lr, $l2, and $n_negs represent the learning rate, decay, and number of negative sampling, respectively. $DATASET_NAME is the name of the dataset (e.g. yelp2018). Finally, $loss_fn denotes the loss function used.

Yelp2018

# SL
bash MF_Frame_pos.sh 1e-4 1e-3 800 1024 0.11 1.00 0 reweight yelp2018 drop
# BSL
bash MF_Frame_pos.sh 1e-4 1e-3 800 1024 0.11 1.10 0 multi yelp2018 drop

Amazon

# SL
bash MF_Frame_pos.sh 5e-4 1e-3 1024 1024 0.14 1.00 0 reweight amazon drop
# BSL
bash MF_Frame_pos.sh 5e-4 1e-3 1024 1024 0.14 1.32 0 reweight amazon drop

Gowalla

# SL
bash MF_Frame_pos.sh 1e-4 1e-9 800 1024 0.08 1.00 0 reweight gowalla drop
# BSL
bash MF_Frame_pos.sh 1e-4 1e-9 800 1024 0.08 1.22 0 multi gowalla drop

Movielens-1M

# SL
bash MF_Frame_pos.sh 1e-4 1e-3 800 2048 0.17 1.00 0 reweight ml nodrop
# BSL
bash MF_Frame_pos.sh 1e-4 1e-3 800 2048 0.17 1.06 0 multi ml nodrop

LightGCN

Usage (example):

cd bash
bash lgn_Frame_pos.sh $lr $l2 $n_negs $bsz $t1 $t2 $GPU_ID $loss_mode $drop $loss_fn $DATASET_NAME $context_hops $patience $generate_method $sampling_method

$GPU_ID refers to the ID of the launched GPU, while $lr, $l2, and $n_negs represent the learning rate, decay, and number of negative sampling, respectively. $DATASET_NAME is the name of the dataset (e.g. yelp2018), and $context_hops indicates the number of layers (1, 2, or 3). $patience, generate_method and $sampling_method refer to the patience for early stopping (50 as default ), prediction score (cosine similarity or inner product) and the sampling method (uniformly sampling and in batch sampling), respectively. Finally, $loss_fn denotes the loss function used and $loss_fn denotes the loss function used.

yelp2018

# SL
bash lgn_Frame_pos.sh 1e-3 1e-5 1024 1024 0.15 1.00 0 reweight nodrop yelp2018 3 50 no_cosine no_sample
# BSL
bash lgn_Frame_pos.sh 1e-3 1e-5 1024 1024 0.15 1.12 0 reweight nodrop yelp2018 3 50 no_cosine no_sample

Amazon

# SL
bash lgn_Frame_pos.sh 1e-3 1e-1 4096 800 0.30  1.00 0 reweight nodrop amazon 3 50 no_cosine no_sample
# BSL
bash lgn_Frame_pos.sh 1e-3 1e-1 4096 800 0.30  0.80 0 reweight nodrop amazon 3 50 no_cosine no_sample

Gowalla

# SL
bash lgn_Frame_pos.sh 1e-3 1e-5 1024 800 0.15  1.00 0 reweight nodrop ml 1 50 cosine uniform
# BSL
bash lgn_Frame_pos.sh 1e-3 1e-5 1024 800 0.15  1.10 0 reweight nodrop ml 1 50 cosine uniform

Movielens-1M

# SL
bash lgn_Frame_pos.sh 1e-3 1e-5 1024 800 0.15  1.00 0 reweight nodrop ml 1 50 cosine uniform
# BSL
bash lgn_Frame_pos.sh 1e-3 1e-5 1024 800 0.15  1.10 0 reweight nodrop ml 1 50 cosine uniform

Documentation

Thanks to their simple forms, these losses are implemented in just a few lines of code in utils/losses.py.py:

# bsz : batch size (number of positive pairs)
# y_pred[:, 0]:  prediction score of postive samples, shape=[bsz]
# y_pred[:, 1:]: prediction score of negative samples, shape=[bsz, bsz-1]
# temperature: t1
# temperature_2: t2
pos_logits = torch.exp(y_pred[:, 0] / self.temperature)
neg_logits = torch.exp(y_pred[:, 1:] / self.temperature)
neg_logits = neg_logits.sum(dim=-1)
neg_logits = torch.pow(neg_logits, self.temperature_2)
loss = - torch.log(pos_logits / neg_logits).mean()

The training log is also provided. The results fluctuate slightly under different running environment.

For any clarification, comments, or suggestions please create an issue or contact me (jkwu0909@mail.ustc.edu.cn).

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
bash		bash
config		config
data		data
log_pos		log_pos
logs		logs
modules		modules
outputs		outputs
utils		utils
.gitignore		.gitignore
README.md		README.md
main.py		main.py

junkangwu/BSL

Folders and files

Latest commit

History

Repository files navigation

BSL: Understanding and Improving Softmax Loss for Recommendation

Prerequisites

Training & Evaluation

MF

Yelp2018

Amazon

Gowalla

Movielens-1M

LightGCN

yelp2018

Amazon

Gowalla

Movielens-1M

Documentation

About

Topics

Resources

Stars

Watchers

Forks

Languages