PyTorch (+ AllenNLP) Reimplementation of SPIGOT Parser

This repository is currently in early development state.

PyTorch (+ AllenNLP) Reimplementation of SPIGOT Parser

This repo tries to reimplement a pipeline system of a syntactic-then-semantic parser, trained end-to-end using the technique called "SPIGOT", proposed in ACL2018 paper Backpropagating through Structured Argmax using a SPIGOT by Peng et al.

SemEval 2015 Broad-Coverage Semantic Dependency Parsing

The dataset is available at LDC.

Running the code

Please refer to requirements.txt for the versions of libraries used in the reproduction.

$ pip install torch allennlp allennlp-models
$ git clone https://github.com/masashi-y/allennlp_spigot

For the SemEval datasets, please use the script in the semeval2015_data directory to preprocess (I borrowed this from the NeurboParser repo and modified so it works with Python 3). This will create the train/dev/test split files for the respective semantic dependency types, named such as english_id_dm_augmented_test.sdp (english, in-domain, DM-formalism, augmented with syntactic dependencies, test split).

For training, first configure paths in configs/syntactic_then_semantic_dependencies.jsonnet, and then (in the allennlp_spigot directory):

$ allennlp train --include-package spigot --serialization-dir results configs/syntactic_then_semantic_dependencies.jsonnet

For prediction,

when using a *.sdp file as input and use the annotated POS tags:

$ allennlp predict --use-dataset-reader --predictor semantic_dependencies_predictor --include-package spigot --silent --output-file system.sdp results/model.tar.gz english_id_dm_augmented_test.sdp

when using raw texts an input and use POS tags predicted by a spaCy model (default: en_core_web_sm):

$ cat input.jsonl
{"sentence": "this is an example sentence."}
$ allennlp predict --predictor semantic_dependencies_predictor --include-package spigot --silent --output-file system.sdp results/model.tar.gz input.jsonl

Results

(in-domain results in UF/LF)

Model	DM	PAS	PSD
Pipeline	88.93/87.64	91.56/90.48	86.44/74.41
SPIGOT	89.11/87.89	91.54/90.38	86.45/73.92

(out-of-domain results in UF/LF)

Model	DM	PAS	PSD
Pipeline	83.03/81.08	87.19/85.46	80.97/68.79
SPIGOT	82.88/80.64	87.14/85.48	80.75/67.84

Differences between the original and this implementations

The use of AD3 for decoding semantic dependencies is currently future work, and this implementation just outputs edges with the probabilities more than predefined threshold (default: 0.5) and assigns the most probable tags to them.
- As such, training is done by minimizing the negative log probabilities of these edges and labels, instead of using the SSVM loss.

Citation Information

@inproceedings{peng-etal-2018-backpropagating,
    title = "Backpropagating through Structured Argmax using a {SPIGOT}",
    author = "Peng, Hao  and
      Thomson, Sam  and
      Smith, Noah A.",
    booktitle = "Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
    month = jul,
    year = "2018",
    address = "Melbourne, Australia",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/P18-1173",
    doi = "10.18653/v1/P18-1173",
    pages = "1863--1873",
    abstract = "We introduce structured projection of intermediate gradients (SPIGOT), a new method for backpropagating through neural networks that include hard-decision structured predictions (e.g., parsing) in intermediate layers. SPIGOT requires no marginal inference, unlike structured attention networks and reinforcement learning-inspired solutions. Like so-called straight-through estimators, SPIGOT defines gradient-like quantities associated with intermediate nondifferentiable operations, allowing backpropagation before and after them; SPIGOT{'}s proxy aims to ensure that, after a parameter update, the intermediate structure will remain well-formed. We experiment on two structured NLP pipelines: syntactic-then-semantic dependency parsing, and semantic parsing followed by sentiment classification. We show that training with SPIGOT leads to a larger improvement on the downstream task than a modularly-trained pipeline, the straight-through estimator, and structured attention, reaching a new state of the art on semantic dependency parsing.",
}

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
configs		configs
images		images
semeval2015_data		semeval2015_data
spigot		spigot
vocabulary		vocabulary
README.md		README.md
requirements.txt		requirements.txt
run.sh		run.sh
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

configs

configs

images

images

semeval2015_data

semeval2015_data

spigot

spigot

vocabulary

vocabulary

README.md

README.md

requirements.txt

requirements.txt

run.sh

run.sh

setup.py

setup.py

Repository files navigation

PyTorch (+ AllenNLP) Reimplementation of SPIGOT Parser

SemEval 2015 Broad-Coverage Semantic Dependency Parsing

Running the code

Results

Differences between the original and this implementations

Citation Information

About

Releases

Packages

Languages

masashi-y/allennlp_spigot

Folders and files

Latest commit

History

Repository files navigation

PyTorch (+ AllenNLP) Reimplementation of SPIGOT Parser

SemEval 2015 Broad-Coverage Semantic Dependency Parsing

Running the code

Results

Differences between the original and this implementations

Citation Information

About

Resources

Stars

Watchers

Forks

Languages