The Sequential and Intensive Weighted Language Modeling for Natural Language Understanding

Note that we used three V100 GPUs and the same hyperparameter reported on MT-DNN paper for multi-task learning: a learning rate of 5e-5, a batch size of 32 and an optimizer of Adamax.

Also, for the fine-tuning stage, we followed the hyperparameter range suggested by SMART paper: a learning rate of {1e-5, 2e-5, 3e-5, 5e-5}, a batch size of {16, 32, 64} and an optimizer of Adam.

Citation

Son, S.; Hwang, S.; Bae, S.; Park, S.J.; Choi, J.-H. A Sequential and Intensive Weighted Language Modeling Scheme for Multi-Task Learning-Based Natural Language Understanding. Appl. Sci. 2021, 11, 3095. https://doi.org/10.3390/app11073095

Setup Environment

python 3.6

Reference to download and install : https://www.python.org/downloads/release/python-360/

install requirements

> pip install -r requirements.txt

Train a SIWLM model

Download data

> sh download.sh

Please refer to download GLUE dataset: https://gluebenchmark.com/

Preprocess data

> sh experiments/glue/prepro.sh

Set the task weight.

At experiments/glue/glue_task_def.yml, you can set the weight of each task.

We set the initial task weights as {1:1:1:1:1:1:1:1, 3:1:1:1:1:1:1:1, 6:1:1:1:1:1:1:1, 9:1:1:1:1:1:1:1, 12:1:1:1: 1:1:1:1, 15:1:1:1:1:1:1:1}. The first values in the task weights are for a central task.

Multi-task learning with the model with Sequential and Intensive Weighted language modeling method

Using scripts,

> sh scripts/run_mtdnn_{task_name}.sh {batch_size}

We provide example, MRPC. You can use similar scripts to train other GLUE tasks.

: > sh scripts/run_mtdnn_mrpc.sh 32

Strip the task-specific layers for fine-tuning

> python strip_model.py --model_path {multi-task learned model path} --fout {striped model path}

Fine-tuning

> sh scripts/run_{task_name}.sh {batch_size}

We provide example, MRPC. You can use similar scripts to fine-tune other GLUE tasks.

> sh scripts/run_mrpc.sh 32

Codebase

MT-DNN repo : https://github.com/namisan/mt-dnn

Contact

For help or issues using SIWLM, please submit a GitHub issue.

For personal communication related to this package, please contact Suhyune Son(handsuhyun@gmail.com), Seonjeong Hwang(tjswjd0228@gmail.com) and Sohyeun Bae(webby1815@gmail.com)

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
alum		alum
data_utils		data_utils
experiments		experiments
hnn		hnn
input_examples		input_examples
int_test_data/glue		int_test_data/glue
module		module
mt_dnn		mt_dnn
sample_data		sample_data
scripts		scripts
tasks		tasks
tests		tests
tutorials		tutorials
README.md		README.md
_config.yml		_config.yml
calc_metrics.py		calc_metrics.py
download.sh		download.sh
extractor.py		extractor.py
int_test_encoder.py		int_test_encoder.py
int_test_prepro_std.py		int_test_prepro_std.py
predict.py		predict.py
prepare_distillation_data.py		prepare_distillation_data.py
prepro_std.py		prepro_std.py
pretrained_models.py		pretrained_models.py
requirements.txt		requirements.txt
run_toy.sh		run_toy.sh
setup.cfg		setup.cfg
strip_model.py		strip_model.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Sequential and Intensive Weighted Language Modeling for Natural Language Understanding

Citation

Setup Environment

Train a SIWLM model

Codebase

Contact

About

Releases

Packages

Languages

sonsuhyune/SIWLM

Folders and files

Latest commit

History

Repository files navigation

The Sequential and Intensive Weighted Language Modeling for Natural Language Understanding

Citation

Setup Environment

Train a SIWLM model

Codebase

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages