GitHub - LeaLiLu/DeepContextModeling: Deep Context Modeling for Multi-turn Response Selection in Dialogue Systems

Codes for the paper Deep Context Modeling for Multi-turn Response Selection in Dialogue Systems

Instruction

Dataset

Since the datasets are quite large that exceed the Github file size limit, we only upload part of the data as examples. Do not forget to change to the data directory after you download the full data.

Datasets can be download from Ubuntu dataset, Douban dataset, and ECD dataset.
Unzip the dataset and put data directory into data/.

NUP pre-training

The steps to further pre-training BERT with NUP strategy is introduced as follows. We also provide the language model trained on Ubuntu training set. Our trained nup language model on Ubuntu training set can be accessed here.

https://www.dropbox.com/s/d1earb9ta6drqoy/ubuntu_nup_bert_base.zip?dl=0

You can unzip the model and put it into ubuntu_nup_bert_base directory then use it during model training.

Run make_lm_data.py to process the original training data format into a single file with one sentence(utterance) per line, and one blank line between documents(dialog context).

python nup_lm_finetuning/make_lm_data.py \
--data_file ../data/ubuntu_data/train.txt \
--output_file data/ubuntu_data/lm_train.txt

Use pregenerate_training_data_NUP.py to pre-process the data into training examples following the NUP methodology.

python nup_lm_finetuning/pregenerate_training_data_NUP.py \
--train_corpus ../data/ubuntu_data/lm_train.txt \
--bert_model bert-base-uncased \
--do_lower_case \
--output_dir ../data/ubuntu_data/ubuntu_NUP \
--epochs_to_generate 1 \
--max_seq_len 256

Train on the pregenerated data using finetune_on_pregenerated.py, and pointing it to the folder created by pregenerate_training_data.py .

python nup_lm_finetuning/finetune_on_pregenerated.py \
--pregenerated_data ../data/ubuntu_data/ubuntu_NUP \
--bert_model bert-base-uncased \
--train_batch_size 12 \
--reduce_memory \
--do_lower_case \
--output_dir ubuntu_finetuned_lm \
--epochs 1

Model training

Train a model

Change the --bert_model parameter to the path of the NUP-pretrained language model if need. Example as ubuntu_nup_bert_base for Ubuntu dataset.

An example:

python run_IE_CoAtt_CNN_DCM.py \
--data_dir data/ubuntu_data \
--task_name ubuntu \
--train_batch_size 64 \
--eval_batch_size 64 \
--max_seq_length 384 \
--max_utterance_num 20 \
--bert_model bert-base-uncased \ 
--cache_flag ubuntu \
--learning_rate 3e-5 \
--num_train_epochs 2 \
--do_train \   # set to do_eval when evaluation on test set
--do_lower_case \
--output_dir experiments/ubuntu

Evaluation

python run_IE_CoAtt_CNN_DCM.py \
--data_dir data/ubuntu_data \
--task_name ubuntu \
--train_batch_size 64 \
--eval_batch_size 64 \
--max_seq_length 384 \
--max_utterance_num 20 \
--bert_model bert-base-uncased \  
--cache_flag ubuntu \
--learning_rate 3e-5 \
--num_train_epochs 2 \
--do_eval \
--do_lower_case \
--output_dir experiments/ubuntu

Requirements

Python 3.6 + Pytorch 1.0.1

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data/ubuntu_data_example		data/ubuntu_data_example
nup_lm_finetuning		nup_lm_finetuning
pytorch_pretrained_bert		pytorch_pretrained_bert
README.md		README.md
run_FE_DAtt_RNN.py		run_FE_DAtt_RNN.py
run_IE_CoAtt_CNN_DCM.py		run_IE_CoAtt_CNN_DCM.py
run_IE_DAtt_CNN.py		run_IE_DAtt_CNN.py
run_IE_DAtt_RNN.py		run_IE_DAtt_RNN.py
run_IE_MHAtt_CNN.py		run_IE_MHAtt_CNN.py
run_bert.py		run_bert.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data/ubuntu_data_example

data/ubuntu_data_example

nup_lm_finetuning

nup_lm_finetuning

pytorch_pretrained_bert

pytorch_pretrained_bert

README.md

README.md

run_FE_DAtt_RNN.py

run_FE_DAtt_RNN.py

run_IE_CoAtt_CNN_DCM.py

run_IE_CoAtt_CNN_DCM.py

run_IE_DAtt_CNN.py

run_IE_DAtt_CNN.py

run_IE_DAtt_RNN.py

run_IE_DAtt_RNN.py

run_IE_MHAtt_CNN.py

run_IE_MHAtt_CNN.py

run_bert.py

run_bert.py

Repository files navigation

Instruction

Dataset

NUP pre-training

Model training

Requirements

About

Releases

Packages

Languages

LeaLiLu/DeepContextModeling

Folders and files

Latest commit

History

Repository files navigation

Instruction

Dataset

NUP pre-training

Model training

Requirements

About

Resources

Stars

Watchers

Forks

Languages