Improving Dialog Systems with Pretrained Models

Overview

This work is about exploring whether dialog systems can be improved with regards to context and natural langugage relevance through pretraining or not.

Results

Description	Non-pretrained HRED	Pretrained HRED
EPOCHS	30	60
Training PPL	61.67	5.53
Validation PPL	278.75	813.49
Test PPL	284.11	593.11
BLEU	0.03	0.22

From the generated dialog responses, it is inferred that after pretraining, the problem of natural language generation goes away but context relevance still remains to be an issue.

Dependencies

PyTorch
TorchText
spaCy

Installation

To install PyTorch, see installation instructions on the PyTorch website.
To install TorchText: pip install torchtext
To install spaCy:

pip install -U spacy
python -m spacy download en_core_web_sm

Dataset

This work makes use of the DailyDialog Dataset.

Data Preprocessing

The data in original form is a raw text file with _eou_ as the delimiter between any two sentences.

For Vanilla Seq2Seq

Use the make_data_vseq2seq.py script to process the data splits (train, validation and test) into a CSV file.

For HRED

Use the make_data_hred.py script to process the data splits (train, validation and test) into a CSV file. Additional samples for each dialog have been created since they have generic responses and also because the number of dialogs is less, which may not be enough to train the model effectively.

Reference

PyTorch Seq2Seq

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
docs		docs
predicted dialog responses		predicted dialog responses
README.md		README.md
hred.ipynb		hred.ipynb
make_data_hred.py		make_data_hred.py
make_data_vseq2seq.py		make_data_vseq2seq.py
make_loss_plots.ipynb		make_loss_plots.ipynb
vanilla_seq2seq.ipynb		vanilla_seq2seq.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Improving Dialog Systems with Pretrained Models

Overview

Results

Dependencies

Installation

Dataset

Data Preprocessing

For Vanilla Seq2Seq

For HRED

Reference

About

Releases

Packages

Contributors 2

Languages

rajatguptakgp/pretrained_dialog_system

Folders and files

Latest commit

History

Repository files navigation

Improving Dialog Systems with Pretrained Models

Overview

Results

Dependencies

Installation

Dataset

Data Preprocessing

For Vanilla Seq2Seq

For HRED

Reference

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages