I-SEE

High-Quality Diversification for Task-Oriented Dialogue Systems

We follow the code structure of GDPL, but modified files for our needs.

Requirements

python 3.6

pip install -r requirements.txt

Pre-train dialogue policy

python main.py --pretrain --save_dir model

Pre-train world models

python main.py --pretrain_world --save_dir model

RL training

DQN

PPO

python main_vanilla_ppo.py --process=8 --load_model=model/best --lr_rl=1e-4 --lr_irl=1e-4 --epoch=16 --ensemble_size=5 --sim_ratio=0.05 --horizon=5 --save_dir=model_rl

GDPL

python main.py --process=8 --load_model=model/best --lr_rl=1e-4 --lr_irl=1e-4 --epoch=16 --ensemble_size=5 --sim_ratio=0.2 --horizon=5 --save_dir=model_rl

Evaluation

Citation

@inproceedings{traj_acl_2021,
    title = "High-Quality Dialogue Diversification by Intermittent Short Extension Ensemble",
    author = "Tang, Zhiwen  and
      Kulkarni, Hrishikesh  and
      Hui Yang, Grace",
    booktitle = "Proceedings of The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021) (Findings of ACL).",
    year = "2021",
    address = "Bangkok, Thailand",
    publisher = "Association for Computational Linguistics",
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
agenda.py		agenda.py
config.py		config.py
datamanager.py		datamanager.py
dbquery.py		dbquery.py
estimator.py		estimator.py
goal_generator.py		goal_generator.py
main.py		main.py
main_vanilla_ppo.py		main_vanilla_ppo.py
metrics.py		metrics.py
ppo.py		ppo.py
ppo_wm.py		ppo_wm.py
requirements.txt		requirements.txt
rlmodule.py		rlmodule.py
tracker.py		tracker.py
utils.py		utils.py
vanilla_ppo.py		vanilla_ppo.py
world_model.py		world_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

I-SEE

Requirements

Pre-train dialogue policy

Pre-train world models

RL training

DQN

PPO

GDPL

Evaluation

Citation

About

Releases

Packages

Languages

License

smt-HS/I-SEE

Folders and files

Latest commit

History

Repository files navigation

I-SEE

Requirements

Pre-train dialogue policy

Pre-train world models

RL training

DQN

PPO

GDPL

Evaluation

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages