Frame-Semantic Parsing with BERT Extracted Word and Span Embeddings

Romy Zilkha, Jonathan Schwartz, Jonathan Bofman, Gabriel Clinger

Inspired by the Frame-Semantic Parsing paper (open-SESAME) that uses a Syntactic Scaffolding approach for Frame-Semantic Parsing. We build upon the existing scaffolding approach by incorporating attention mechanism into two distinct embedding phases (words and spans). We decided to use the huggingface implementation of BERT in order to integrate the attention mechanism into the existing argument identification model.

Installation

$ pip install dynet==2.1
$ pip install nltk==3.5
$ python -m nltk.downloader averaged_perceptron_tagger wordnet

Data Preprocessing

$ python -m sesame.preprocess

Training

To train a model (currently only the token version works - argid_token_bert.py), run the following command and specifiy in the code itself (line 143) if you would like to use PCA or autoencoders for the vector dimensionality reduction. The output of the traninig is the trained model and a predicted conll file under logs/$MODEL_NAME/best-$MODEL-1.7-model.

$ python -m sesame.argid_token_bert --mode train --model_name $MODEL_NAME

Evaluation

To evaluate the model, run the following command on the predicted conll file.

$ python -m evaluation /PATH_TO_OPEN_SESAME_FOLDER/sesame/logs/MODEL_NAME/predicted-1.7-argid-dev.conll

References

Frame-Semantic Parsing With Softmax-Margin Segmental RNNs and a Syntactic Scaffold paper - https://arxiv.org/pdf/1706.09528.pdf
open-SESAME - https://github.com/swabhs/open-sesame
Extracting BERT word and span representation - https://mccormickml.com/2019/05/14/BERT-word-embeddings-tutorial/#3-extracting-embeddings
Huggingface BERT model - https://huggingface.co/transformers/model_doc/bert.html
Autoenconders - https://towardsdatascience.com/dimensionality-reduction-pca-versus-autoencoders-338fcaf3297d

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
configurations		configurations
data		data
logs		logs
sesame		sesame
LICENSE.md		LICENSE.md
README.md		README.md
nlp_final_poster.png		nlp_final_poster.png
preprocess-fn1.7.log		preprocess-fn1.7.log

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

configurations

configurations

data

data

logs

logs

sesame

sesame

LICENSE.md

LICENSE.md

README.md

README.md

nlp_final_poster.png

nlp_final_poster.png

preprocess-fn1.7.log

preprocess-fn1.7.log

Repository files navigation

Frame-Semantic Parsing with BERT Extracted Word and Span Embeddings

Installation

Data Preprocessing

Training

Evaluation

References

About

Releases

Packages

Contributors 2

Languages

License

clingergab/Bert_frame_semantic_parsing

Folders and files

Latest commit

History

Repository files navigation

Frame-Semantic Parsing with BERT Extracted Word and Span Embeddings

Installation

Data Preprocessing

Training

Evaluation

References

About

Resources

License

Stars

Watchers

Forks

Languages