An Equal-Size Hard EM Algorithm for Diverse Dialogue Generation

Training

To train a single-decoder model, use base_trainer.py:

python base_train.py \
    --train-path=path-to-the-training-csv-file \
    --val-path=path-to-the-validation-csv-file \
    --model-str=t5-small \

The resulting checkpoint is used as the initial checkpoint for multi-decoder training. For Weibo, use --model-str=uer/t5-small-chinese-cluecorpussmall, --language=zh, and --multi-ref in addition.

To train a multi-decoder model, use the script multi-decoder_trainer.py:

python multi-decoder_trainer.py \
    --train-path=path-to-the-training-csv-file \
    --val-path=path-to-the-validation-csv-file \
    --model-str=t5-small \
    --init-ckpt=path-to-warmstart-ckpt \
    --freeze \
    --num-modes=10 \
    --trainer=eqhem \
    --decoder=adapter \

where

--num-modes specifies the number of decoders
--trainer specifies the training algorithm.
- eqhem:EqHard-EM
- sem: Soft-EM
- trick-sem: Soft-EM with recurrent dropout trick
- hem: Hard-EM
- trick-hem: Hard-EM with recurrent dropout trick
- random: EqRandom-Fixed
- drandom: EqRandom-Dynamic
--decoder specifies the decoder architecture.
--lp enables learned priors. (uniform prior by default)

Monitoring Performance

The training script will automatically generate a timestamped logging directory to store the checkpoints as well as log files. The validation performance can be monitored during training through tensorboard:

tensorboard --logdir=path-to-the-timestamped-logging-folder

Continue Training

If the performance is still increasing at the end of training, you can resume with the following command

python base_train.py \
    --the-original-arguments-that-you-started-training-with
    --resume-path=path-to-the-timestamped-logging-folder

Evaluation

After the performance has peaked, you can evaluate the model using evaluate_generations.py:

python evaluate_generations.py --ckpt-path=path-to-the-best-validation-checkpoint --eval-path=path-to-the-test-csv-file

Additionally use --language=zh and --multi-ref for evaluating on Weibo.

Datasets

OST, Weibo

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
metrics		metrics
model		model
LICENSE		LICENSE
README.md		README.md
assignment_stats.py		assignment_stats.py
base_trainer.py		base_trainer.py
default_dataset.py		default_dataset.py
evaluate.py		evaluate.py
evaluate_generations.py		evaluate_generations.py
generate.py		generate.py
multi-decoder_trainer.py		multi-decoder_trainer.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

An Equal-Size Hard EM Algorithm for Diverse Dialogue Generation

Training

Monitoring Performance

Continue Training

Evaluation

Datasets

About

Releases

Packages

Contributors 2

Languages

License

MANGA-UOFA/EqHard-EM

Folders and files

Latest commit

History

Repository files navigation

An Equal-Size Hard EM Algorithm for Diverse Dialogue Generation

Training

Monitoring Performance

Continue Training

Evaluation

Datasets

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages