Dependency-based Mixture Language Models

Code for paper Dependency-based Mixture Language Models by Zhixian Yang, and Xiaojun Wan. This paper is accepted by ACL'22.

Table of Contents
Setup
Training
Generation
Evaluation

Setup

Dependencies

Install fairseq , and place the code file in custom into the corresponding path of fairseq's source code.

Download the GPT2-base, and place the files in ~/pretrained_model/gpt2-base

Then to run Transformer-based models' scripts, enter DPLM-Transformer folder:

cd DPLM-Transformer/

Data preprocess

First, you can download the processed data from here

Then, extract and place them it in the parent directory data/

For custom dataset, you can use the HPSG-Neural-Parser to get the dependency parse tree for each sentence in the datasets. For train/valid/test data, rename the dependency head file as train/valid/test.head, and place it in data/YOUR_DATASET/dependency . Then, to preprocess the data for Transformer by fairseq:

sh scripts/preprocess_data.sh

To preprocess the data for GPT-2 by fairseq:

sh scripts/encode_data_gpt2.sh
sh scripts/preprocess_gpt2_data.sh

Please specify TEXT in scripts according to your data's path.

Training

*We tested these scripts on a machine with NVIDIA GTX 1080Ti 11GB gpus If you get OOM errors, try decreasing MAX_TOKEN of the training scripts.

To train base Transformer LM:

sh scripts/train_base_transformer_lm.sh

To train base GPT-2:

sh scripts/train_gpt2_base.sh

To train DM-Transformer, first train Transformer by Dependency Modeling:

sh scripts/train_dependency_decoder.sh

Then finetuning by MLE:

sh scripts/train_dp_transformer_lm.sh

To train DM-GPT-2,

sh scripts/train_dpgpt2.sh

Please specify TEXT in scripts according to your data's path.

Generation

To sample from Transformer or DM-Transformer:

sh scripts/sampling.sh

To sample from GPT-2 or DM-GPT-2:

sh scripts/gpt2_sampling.sh

To extract generated text from the sample output:

sh scripts/cut_samples.sh

If you want to decode text generated by GPT-2 or DM-GPT-2, run:

sh scripts/decode_gpt2_txt.sh

Please specify Model in scripts according to the model checkpoint path, and specify TEXT according to your data's path.

Evaluation

Code (eval_ppl_by_gpt2.py) is used to calculated the GPT-2 Perplexity of generated sentences, and it can evaluate all the sentences files in one folder:

python eval_ppl_by_gpt2.py --folderpath FOLDER_FOR_EVALUATION --model_file PATH_TO_GPT2CHECKPOINT

Code (eval_sentences.py) is used to evaulate the automatic metrics for unconditional text generation task:

python eval_sentences.py --folderpath FOLDER_FOR_EVALUATION

Code (eval_stories.py) is used to evaulate the automatic metrics of generated story endings:

python eval_stories.py --folderpath FOLDER_FOR_EVALUATION

Besides, we also calculate the UNION and BERTScore and for the story ending generation task.

Citation

If you find our paper useful to your work, please kindly cite our paper:

@inproceedings{yang-wan-2022-dependency,
    title = "Dependency-based Mixture Language Models",
    author = "Yang, Zhixian  and
      Wan, Xiaojun",
    booktitle = "Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
    month = may,
    year = "2022",
    address = "Dublin, Ireland",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.acl-long.535",
    pages = "7758--7773",
}

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
DPLM-LSTM		DPLM-LSTM
DPLM-Transformer		DPLM-Transformer
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DPLM-LSTM

DPLM-LSTM

DPLM-Transformer

DPLM-Transformer

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Dependency-based Mixture Language Models

Setup

Dependencies

Data preprocess

Training

Generation

Evaluation

Citation

About

Releases

Packages

Languages

License

FadedCosine/Dependency-Guided-Neural-Text-Generation

Folders and files

Latest commit

History

Repository files navigation

Dependency-based Mixture Language Models

Setup

Dependencies

Data preprocess

Training

Generation

Evaluation

Citation

About

Resources

License

Stars

Watchers

Forks

Languages