SACE: Sense Aware Context Exploitation (SACE) Architecture

This is the source code for SACE, built with BEM modules.

SACE implements a selective attention layer upon the original gloss encoder and a sentence selector before the context encoder. A try-again mechanism is also implemented after the training process.

Dependencies

To run this code, you'll need the following libraries:

We used the WSD Evaluation Framework for training and evaluating our model. Download the evaluation framework for convenient employment.

For cross-lingual datasets, we use mwsd-datasets.

For WordNet Tagged Gloss (WNGT), we use UFSAC.

Train

Use the following code to train the base model. It takes 6 hours to finish the job using early stopping (3 epoch without updating).

python biencoder-context.py --gloss-bsz 400 --epoch 10 --gloss_max_length 32 --step_mul 50 --warmup 10000 --gloss_mode sense-pred --lr 1e-5 --word word --encoder-name roberta-base --train_mode roberta-base --context_len 2 --train_data semcor --same --sec_wsd

For the large model, run the following code.

python biencoder-context.py --gloss-bsz 150 --epoch 10 --gloss_max_length 32 --step_mul 50 --warmup 10000 --gloss_mode sense-pred --lr 1e-6 --word word --encoder-name roberta-large --train_mode roberta-large --context_len 2 --train_data semcor --same --sec_wsd

For the large model that trains with more training data (SemCor+WNGT+WNE), run the following code.

python biencoder-context.py --gloss-bsz 150 --epoch 10 --gloss_max_length 48 --step_mul 50 --warmup 10000 --gloss_mode sense-pred --lr 1e-6 --word non --encoder-name roberta-large --train_mode roberta-large --context_len 2 --train_data semcor-wngt --same

For the multilingual model that trains with more training data (SemCor+WNGT+WNE), run the following code.

python biencoder-context.py --gloss-bsz 400 --epoch 10 --gloss_max_length 48 --step_mul 50 --warmup 10000 --gloss_mode sense-pred --lr 5e-6 --word non --encoder-name xlmroberta-base --train_mode xlmroberta-base --context_len 2 --train_data semcor-wngt

Evaluate

To evaluate the base model, run:

python biencoder-context.py --gloss-bsz 400 --epoch 10 --gloss_max_length 32 --step_mul 50 --warmup 10000 --gloss_mode sense-pred --lr 1e-5 --word word --encoder-name roberta-base --train_mode roberta-base --context_len 2 --train_data semcor --same --sec_wsd --eval

License

This codebase is Attribution-NonCommercial 4.0 International licensed, as found in the LICENSE file.

Systems	SE2	SE3	SE07	SE13	SE15	ALL	N	V	A	R
SACEbase	80.9	79.1	74.7*	82.4	84.6	80.9*	83.2	71.1	85.4	87.9
SACElarge	82.4	81.1	76.3*	82.5	83.7	81.9*	84.1	72.2	86.4	89.0
SACElarge+	83.6	81.4	77.8	82.4	87.3*	82.9*	85.3	74.2	85.9	87.3

Please cite:

{wang-wang-2021-word,
    title = "Word Sense Disambiguation: Towards Interactive Context Exploitation from Both Word and Sense Perspectives",
    author = "Wang, Ming  and
      Wang, Yinglin",
    booktitle = "Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)",
    month = aug,
    year = "2021",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.acl-long.406",
    doi = "10.18653/v1/2021.acl-long.406",
    pages = "5218--5229"
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.idea		.idea
__pycache__		__pycache__
data		data
wsd_models		wsd_models
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
Scorer.class		Scorer.class
biencoder-context.py		biencoder-context.py
nltk_download.py		nltk_download.py
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SACE: Sense Aware Context Exploitation (SACE) Architecture

Dependencies

Train

Evaluate

License

About

Releases

Packages

Languages

License

lwmlyy/SACE

Folders and files

Latest commit

History

Repository files navigation

SACE: Sense Aware Context Exploitation (SACE) Architecture

Dependencies

Train

Evaluate

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages