Imperfect also Deserves Reward: Multi-Level and Sequential Reward Modeling for Better Dialog Management

This is the codebase for paper: "Imperfect also Deserves Reward: Multi-Level and Sequential Reward Modeling for Better Dialog Management".

To reproduce our results, you should follow the following two steps:

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
convlab_repo		convlab_repo
gan_v_parallel_finish		gan_v_parallel_finish
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback