Skip to content
Permalink
Branch: master
Find file Copy path
Find file Copy path
Fetching contributors…
Cannot retrieve contributors at this time
49 lines (28 sloc) 1.31 KB

Structured Fusion Networks

Structured Fusion Networks for Dialog https://arxiv.org/abs/1907.10016

Code written by Shikib Mehri and Tejas Srinivasan.

Pretrained Models

The main models trained in the paper have been provided and can be used as follows:

Sequence-to-Sequence with Attention:

python3 test.py --model_name models/attn --use_attn --test_only

Structured Fusion Networks:

python3 test.py --model_name models/sfn --structured_fusion --test_only

Structured Fusion Networks with Reinforcement Learning:

python3 test.py --model_name models/sfn_rl --structured_fusion --test_only

Training

Structured Fusion Networks can be trained with the following command:

python3 train.py --model_name 'models/sfn' --structured_fusion --use_cuda True --tune_params True

SFNs can be fine-tuned with RL as follows (you may need to edit the code a bit):

python3 rl_train.py --structured_fusion --model_name models/rl_sfn --lr 0.00001

Questions

Please feel free to send any questions/concerns to amehri@andrew.cmu.edu.

Citing

The following paper should be cited if you use this code.

@article{mehri2019structured,
  title={Structured Fusion Networks for Dialog},
  author={Mehri, Shikib and Srinivasan, Tejas and Eskenazi, Maxine},
  journal={arXiv preprint arXiv:1907.10016},
  year={2019}
}
You can’t perform that action at this time.