AURL

This is the code for the ICASSP2023 Paper "An Asynchronous Updating Reinforcement Learning Framework for Task-oriented Dialog System". Link

Abstract

Reinforcement learning has been applied to train the dialog systems in many works. Previous approaches divide the dialog system into multiple modules including DST (dialog state tracking) and DP (dialog policy), and train these modules simultaneously. However, different modules influence each other during training. The errors from DST might misguide the dialog policy, and the system action brings extra difficulties for the DST module. To alleviate this problem, we propose Asynchronous Updating Reinforcement Learning framework (AURL) that updates the DST module and the DP module asynchronously under a cooperative setting. Furthermore, curriculum learning is implemented to address the problem of unbalanced data distribution during reinforcement learning sampling, and multiple user models are introduced to increase the dialog diversity. Results on the public SSD-PHONE dataset show that our method achieves a compelling result with a 31.37% improvement on the dialog success rate.

Dataset

Download dataset in Link, put the SSD_phone dataset into 'data' dir.

Requirements

torch
tqdm
numpy
sklearn

Train

Pretrain system model

python run.py --mode=sl_sys --device=cuda:0

Pretrain user model

python run.py --mode=sl_user --device=cuda:0

RL train

python run.py --simulator_num=2 --device=cuda:0

The template file is currently being applied for publication.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.idea		.idea
src		src
Readme.md		Readme.md
pretrain_user.py		pretrain_user.py
requirements.txt		requirements.txt
run.py		run.py
supervised_learning.py		supervised_learning.py
test_and_analyse.py		test_and_analyse.py
test_diversity.py		test_diversity.py
train_one_to_many.py		train_one_to_many.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.idea

.idea

src

src

Readme.md

Readme.md

pretrain_user.py

pretrain_user.py

requirements.txt

requirements.txt

run.py

run.py

supervised_learning.py

supervised_learning.py

test_and_analyse.py

test_and_analyse.py

test_diversity.py

test_diversity.py

train_one_to_many.py

train_one_to_many.py

Repository files navigation

AURL

Abstract

Dataset

Requirements

Train

Pretrain system model

Pretrain user model

RL train

About

Releases

Packages

Languages

shunjiu/AURL

Folders and files

Latest commit

History

Repository files navigation

AURL

Abstract

Dataset

Requirements

Train

Pretrain system model

Pretrain user model

RL train

About

Resources

Stars

Watchers

Forks

Languages