Interpretability for Morphological Inflection: from Character-level Predictions to Subword-level Rules

This repo contains seq2seq models for the task of morphological inflection for extraction of inflection patterns (see EACL 2021 paper) The system is implemented in PyTorch with a codebase derived from OpenNMT-py adaptation for this task at http://github.com/deep-spin/SIGMORPHON2019.

Supported Models

chED: character-level encoder-decoder model with sparse two-headed gated attention (see Peters and Martins, 2019 ACL)
chED+subwSELF-ATT: chED model with additional self-attention mechanism over subwords
chED+chSELF-ATT: chED model with additional self-attention mechanism over characters

Installation

Install pytorch v>=1.2 and additional python requirments:

pip install -r requirements.txt

Training models

Use provided train-subw.sh file to train subword-level model (chED+subwSELF-ATT) and train-ch.sh file to train character-level models (chED, chED+chSELF-ATT). Both files require paths to model directory (where trained models will be saved), model configuration, train and development files as arguments. In the subword-level setting, aditional segmented form of the data is required which can be produced with a provided script bpe-preprocess.sh. An example of training all three models:

 
#### PREPARE SEGMENTED DATA ####
bpemerges=1000

# pointers to data files - original (3-columns: lemma features inflected form)
train=path/to/train/file
dev=path/to/dev/file
test=path/to/test/file
	
# pointer to corpus to train BPE segmentation
bpe_train_corpus=path/to/BPE/corpus

# pointers to paths for saving segmentation model and segmented files
data_dir_bpe=path/to/save/BPE/model
train_segm==path/to/save/segmented/train/file
dev_segm=path/to/save/segmented/dev/file
test_segm=path/to/save/segmented/test/file

# Train BPE model, save model and segmented files to $data_dir/$lang/bpe$bpemerges/
./bpe-preprocess.sh $bpemerges $bpe_train_corpus $data_dir_bpe $train $train_segm $dev $dev_segm $test $test_segm

#### TRAIN MODELS ####

##### subword-level model (chED+subwSELF-ATT) #####
model_dir_subw=path/to/save/model
./train-subw.sh $model_dir_subw $train $train_segm $dev $dev_segm config/gate-sparse-enc-static-head.yml

###### character-level models (chED, chED+chSELF-ATT) #####
model_dir_ch=path/to/save/model
./train-ch.sh $model_dir_ch $train $dev config/gate-sparse.yml # chED model
./train-ch.sh $model_dir_ch $train $dev config/gate-sparse-enc-static-head.yml # chED+chSELF-ATT model

Evaluating models

Use provided translate-subw.sh file to train subword-level model (chED+subwSELF-ATT) and translate-ch.sh file to train character-level models (chED, chED+chSELF-ATT). Both files require paths to pretrained models and test file as arguments. In the subword-level setting, aditional segmented form of the data is required. An example of running evaluation all three models:

beam=1

# pointers to test data - original (3-columns: lemma features inflected form)
test=path/to/test/file

# pointers to segmented test data
test_segm=path/to/segmented/test/file

##### subword-level model (chED+subwSELF-ATT) #####
# pointers to pretrained models
modeldir=/path/to/models
./translate-subw.sh $modeldir $test $test_segm $beam

##### character-level models (chED, chED+chSELF-ATT)  #####
# pointers to pretrained models
modeldir=/path/to/models
./translate-ch.sh $modeldir $test $beam

Patterns extraction

Inflection patterns can be extracted using msd2pattern function for 'transformation' patterns and msd2decision function for 'lemma' patterns in pattern-extraction/patterns.py file. See examples of usage in notebooks pattern-extraction/Pattern-extr-* replicating examples from the paper.

EACL 2021 experiments

Instructions for training and evaluating the models from the paper are provided in the files commands-eacl-exp-train.sh and commands-eacl-exp-eval.sh. Replication might differ due to not fixed seed in the experiments (inhereted from the code base), therefore we provide best model checkpoints in the repo. In the current implementation shared in this repo, the seed is fixed now by default.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
config		config
old		old
onmt		onmt
pattern-extraction		pattern-extraction
tools		tools
.gitignore		.gitignore
MANIFEST.in		MANIFEST.in
README.md		README.md
best_model.py		best_model.py
bpe-preprocess.sh		bpe-preprocess.sh
commands-eacl-exp-eval.sh		commands-eacl-exp-eval.sh
commands-eacl-exp-train.sh		commands-eacl-exp-train.sh
evalm.py		evalm.py
pred2sigmorphon.py		pred2sigmorphon.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
setup.py		setup.py
subw-self-att.png		subw-self-att.png
train-ch.sh		train-ch.sh
train-subw.sh		train-subw.sh
train.py		train.py
translate-ch.sh		translate-ch.sh
translate-subw.sh		translate-subw.sh
translate.py		translate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Interpretability for Morphological Inflection: from Character-level Predictions to Subword-level Rules

Supported Models

Installation

Training models

Evaluating models

Patterns extraction

EACL 2021 experiments

About

Releases

Packages

Languages

tatyana-ruzsics/interpretable-inflection

Folders and files

Latest commit

History

Repository files navigation

Interpretability for Morphological Inflection: from Character-level Predictions to Subword-level Rules

Supported Models

Installation

Training models

Evaluating models

Patterns extraction

EACL 2021 experiments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages