We study the problem of sequential recommendation based on reinforcement learning. More specifically, we propose an Efficient Continuous Control (ECoC) framework to facilitate unified action learning, under the framework of actor-critic.
Download the preprocessed datasets Tmall and Yelp.
- unzip the data, put the data folder in the
data/directory - run the corresponding script within
runs/
torch
numpy
pandas
tensorboard
tqdm
If you use our code, please cite the paper
@article{wang2024efficient,
title={An Efficient Continuous Control Perspective for Reinforcement-Learning-based Sequential Recommendation},
author={Wang, Jun and Wu, Likang and Liu, Qi and Yang, Yu},
journal={arXiv preprint arXiv:2408.08047},
year={2024}
}