Source code of ACL2022 short paper: Hierarchical Curriculum Learning for AMR Parsing
The sequence-to-sequence models have become a main stream paradigm for AMR parsing. However, there exists a gap between their flat training objective (i.e., equally treats all output tokens) and the hierarchical AMR structure, which limits the ability of model to learn the inherent hierarchical structure of AMR. To bridge this gap, we propose a Hierarchical Curriculum Learning (HCL) framework with Structure-level (SC) and Instance-level Curricula (IC). SC switches progressively from core to detail AMR semantic elements while IC transits from structure-simple to -complex AMR instances during training.
pip install -r requirements.txt
pip install -e .
Modify the data path in configs/HCL.yaml
. Then run:
bash bash/train.sh [gpu id]
bash bash/predict.sh [gpu_id] [AMR dataset version (e.g., 2)] [checpoint path]
use BLINK to post process the output of the model
bash bash/blinkified.sh [gpu_id] [AMR dataset version]
bash bash/fine-eval.sh [AMR dataset version]
To achieve scores in our paper, please use the ORIGIN scripts available at https://github.com/mdtux89/amr-evaluation, which returns results that are around ~0.3 Smatch points lower than those returned by bash bash/fine-eval.sh
If you use our code, please cite our paper:
@inproceedings{wang-etal-2022-hierarchical,
title = "Hierarchical Curriculum Learning for {AMR} Parsing",
author = "Wang, Peiyi and
Chen, Liang and
Liu, Tianyu and
Dai, Damai and
Cao, Yunbo and
Chang, Baobao and
Sui, Zhifang",
booktitle = "Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)",
month = may,
year = "2022",
address = "Dublin, Ireland",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2022.acl-short.37",
pages = "333--339",
abstract = "Abstract Meaning Representation (AMR) parsing aims to translate sentences to semantic representation with a hierarchical structure, and is recently empowered by pretrained sequence-to-sequence models. However, there exists a gap between their flat training objective (i.e., equally treats all output tokens) and the hierarchical AMR structure, which limits the model generalization. To bridge this gap, we propose a Hierarchical Curriculum Learning (HCL) framework with Structure-level (SC) and Instance-level Curricula (IC). SC switches progressively from core to detail AMR semantic elements while IC transits from structure-simple to -complex AMR instances during training. Through these two warming-up processes, HCL reduces the difficulty of learning complex structures, thus the flat model can better adapt to the AMR hierarchy. Extensive experiments on AMR2.0, AMR3.0, structure-complex and out-of-distribution situations verify the effectiveness of HCL.",
}
Our code is based on SPRING. Thanks for their high quality open codebase.
@inproceedings{bevilacqua-etal-2021-one,
title = {One {SPRING} to Rule Them Both: {S}ymmetric {AMR} Semantic Parsing and Generation without a Complex Pipeline},
author = {Bevilacqua, Michele and Blloshmi, Rexhina and Navigli, Roberto},
booktitle = {Proceedings of AAAI},
year = {2021}
}
- Todo