Enhancing AMR-to-Text Generation with Dual Graph Representations

This repository contains the code for the EMNLP-IJCNLP 2019 paper: "Enhancing AMR-to-Text Generation with Dual Graph Representations".

This repository contains experimental software and is published for the sole purpose of giving additional background details on the respective publication.

This project is implemented using the framework OpenNMT-py and the library PyTorch Geometric. Please, refer to their website for further details on the installation and dependencies.

Environments and Dependencies

python 3
PyTorch 1.1.0
PyTorch Geometric 1.3.0
nltk
parsimonious

Datasets

In our experiments, we use the following datasets: LDC2015E86 and LDC2017T10.

Preprocess

First, convert the dataset into the format required for the model.

For the LDC2015E86 dataset, run:

./preprocess_LDC2015E86.sh <dataset_folder> <glove_emb_file>

For the LDC2017T10 dataset, run:

./preprocess_LDC2017T10.sh <dataset_folder> <glove_emb_file>

Training

For traning the model using the LDC2015E86 dataset, execute:

./train_LDC2015E86.sh <gpu_id> <gnn_type> <gnn_layers> <start_decay_steps> <decay_steps>

For the LDC2017T10 dataset, execute:

./train_LDC2017T10.sh <gpu_id> <gnn_type> <gnn_layers> <start_decay_steps> <decay_steps>

Options for <gnn_type> are ggnn, gat or gin. <gnn_layers> is the number of graph layers. Refer to OpenNMT-py for <start_decay_steps> and <decay_steps>.

We lower the learning rate during training, after some epochs, as in Konstas et al. (2017).

Examples:

./train_LDC2015E86.sh 0 gin 2 6720 4200
./train_LDC2017T10.sh 0 ggnn 5 14640 10980

Trained models

GIN-DualGraph trained on LDC2015E86 training set - BLEU on test set: 24.60 (download)
GAT-DualGraph trained on LDC2015E86 training set - BLEU on test set: 24.98 (download)
GGNN-DualGraph trained on LDC2015E86 training set - BLEU on test set: 25.01 (download)
GIN-DualGraph trained on LDC2017T10 training set - BLEU on test set: 28.05 (download)
GAT-DualGraph trained on LDC2017T10 training set - BLEU on test set: 27.26 (download)
GGNN-DualGraph trained on LDC2017T10 training set - BLEU on test set: 28.26 (download)

The output generated by the GGNN-DualGraph model trained on LDC2017T10 can be found here.

Decoding

For decode on the test set, run:

./decode.sh <gpu_id> <model> <nodes_file> <node1_file> <node2_file> <output>

Example:

./decode.sh 0 model_ggnn_ldc2015e86.pt test-amr-nodes.txt test-amr-node1.txt test-amr-node2.txt output-ggnn-test-ldc2015e86.txt

More

For more details regading hyperparameters, please refer to OpenNMT-py and PyTorch Geometric.

Contact person: Leonardo Ribeiro, ribeiro@aiphes.tu-darmstadt.de

Citation

@inproceedings{ribeiro-etal-2019-dualgraph,
    title = "Enhancing {AMR}-to-Text Generation with Dual Graph Representations",
    author = "Ribeiro, Leonardo F. R.  and
      Gardent, Claire  and
      Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)",
    month = nov,
    year = "2019",
    address = "Hong Kong, China",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/D19-1314",
    pages = "3174--3185",
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
available_models		available_models
config		config
docs		docs
onmt		onmt
output		output
process_amr		process_amr
tools		tools
.gitignore		.gitignore
.travis.yml		.travis.yml
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE.md		LICENSE.md
NOTICE.txt		NOTICE.txt
README-opennmt.md		README-opennmt.md
README.md		README.md
decode.sh		decode.sh
floyd.yml		floyd.yml
floyd_requirements.txt		floyd_requirements.txt
github_deploy_key_opennmt_opennmt_py.enc		github_deploy_key_opennmt_opennmt_py.enc
preprocess.py		preprocess.py
preprocess_LDC2015E86.sh		preprocess_LDC2015E86.sh
preprocess_LDC2017T10.sh		preprocess_LDC2017T10.sh
requirements.opt.txt		requirements.opt.txt
requirements.txt		requirements.txt
server.py		server.py
setup.py		setup.py
train.py		train.py
train_LDC2015E86.sh		train_LDC2015E86.sh
train_LDC2017T10.sh		train_LDC2017T10.sh
translate.py		translate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Enhancing AMR-to-Text Generation with Dual Graph Representations

Environments and Dependencies

Datasets

Preprocess

Training

Trained models

Decoding

More

Citation

About

Releases

Packages

Languages

License

UKPLab/emnlp2019-dualgraph

Folders and files

Latest commit

History

Repository files navigation

Enhancing AMR-to-Text Generation with Dual Graph Representations

Environments and Dependencies

Datasets

Preprocess

Training

Trained models

Decoding

More

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages