Evidential Conservative Q-learning (ECQL)

Introduction

Reinforcement learning (RL) has been leveraged in recommender systems (RS) to capture users' evolving preferences and continuously improve the quality of recommendations. In this paper, we propose a novel evidential conservative Q-learning framework (ECQL) that learns an effective and conservative recommendation policy by integrating evidence-based uncertainty and conservative learning. ECQL conducts evidence-aware explorations to discover items that are located beyond current observation but reflect users' long-term interests. Also, it provides an uncertainty-aware conservative view on policy evaluation to discourage deviating too much from users' current interests. Two central components of ECQL include a uniquely designed sequential state encoder and a novel conservative evidential-actor-critic (CEAC) module. The former generates the current state of the environment by aggregating historical information and a sliding window that contains the current user interactions as well as newly recommended items from RL exploration that may represent future interests. The latter performs an evidence-based rating prediction by maximizing the conservative evidential Q-value and leverages an uncertainty-aware ranking score to explore the item space for a more diverse and valuable recommendation. Experiments on multiple real-world dynamic datasets demonstrate the state-of-the-art performance of ECQL and its capability to capture users' long-term interests.

Dataset

Inference

# To reproduce the results in our paper, just run the following file for both training and evaluation.

python evid_main_train_test.py

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.idea		.idea
data		data
images		images
README.md		README.md
actor.py		actor.py
bar_plot.py		bar_plot.py
bar_rating_genre_plot.py		bar_rating_genre_plot.py
count_plots.py		count_plots.py
critic.py		critic.py
cumulative_reward_plot.py		cumulative_reward_plot.py
distribution.png		distribution.png
embedding.py		embedding.py
envs.py		envs.py
evid_main_train_test.py		evid_main_train_test.py
evidence_network.py		evidence_network.py
model.jpg		model.jpg
model.png		model.png
recommender.py		recommender.py
replay_buffer.py		replay_buffer.py
replay_memory.py		replay_memory.py
result.py		result.py
result_main copy.py		result_main copy.py
result_main.png		result_main.png
result_main.py		result_main.py
result_main_pgd.png		result_main_pgd.png
result_main_spline.png		result_main_spline.png
rnn_network.py		rnn_network.py
state_representation.py		state_representation.py
tree.py		tree.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Evidential Conservative Q-learning (ECQL)

Introduction

Dataset

Inference

Result

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

ritmininglab/ECQL

Folders and files

Latest commit

History

Repository files navigation

Evidential Conservative Q-learning (ECQL)

Introduction

Dataset

Inference

Result

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages