DialogVCS

The data and official implementation of NAACL 2024 paper: DialogVCS: Robust Natural Language Understanding in Dialogue System Upgrade

Overview

In the constant updates of the product dialogue systems, we need to retrain the natural language understanding (NLU) model as new data from the real users would be merged into the existing data accumulated in the last updates. Within the newly added data, new intents would emerge and might have semantic entanglement with the existing intents, e.g. new intents that are semantically too specific or generic are actually a subset or superset of some existing intents in the semantic space, thus impairing the robustness of the NLU model. As the first attempt to solve this problem, we setup a new benchmark consisting of 4 Dialogue Version Control dataSets (DialogVCS). We formulate the intent detection with imperfect data in the system update as a multi-label classification task with positive but unlabeled intents, which asks the models to recognize all the proper intents, including the ones with semantic entanglement, in the inference. We also propose comprehensive baseline models and conduct in-depth analyses for the benchmark, showing that the semantically entangled intents can be effectively recognized with an automatic workflow.

Data

Our proposed data could be found at ./data

Citation

If you find DialogVCS useful for your research and applications, please kindly cite using this BibTeX:

@article{cai2023dialogvcs,
  title={Dialogvcs: Robust natural language understanding in dialogue system upgrade},
  author={Cai, Zefan and Zheng, Xin and Liu, Tianyu and Wang, Xu and Meng, Haoran and Han, Jiaqi and Yuan, Gang and Lin, Binghuai and Chang, Baobao and Cao, Yunbo},
  journal={arXiv preprint arXiv:2305.14751},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
model		model
training		training
utils		utils
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DialogVCS

Overview

Data

Citation

About

Releases

Packages

Languages

License

Zefan-Cai/DialogVCS

Folders and files

Latest commit

History

Repository files navigation

DialogVCS

Overview

Data

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages