ASTRAPOP

The official repository for the paper "Authorship Style Transfer with Policy Optimization".

Installation

Commends for enviroment setup with conda.

conda create --name astrapop python=3.8
conda activate astrapop
pip install -U pip
pip install -r requirements.txt

Data

Please download the original Reddit Million User Dataset (MUD) from here and the original ETS Corpus of Non-Native Written English from here. We will publish the data preprocessing code soon.

Reproduce Results

Reddit

To reproduce the results on the Reddit dataset, please run the scirpts in scripts/reddit following the procedure below.

Train the paraphrase model and the reference SFT model by running 00_train_paraphraser.sh and 00_train_sft.sh.
Generate the data for DPO and CPO training by running 01_generate_dpo_cpo_data.sh.
Train the PO models using PPO/DPO/CPO by running 02_train_ppo.sh/02_train_dpo.sh/02_train_cpo.sh.
Transfer the texts in the test set by running 03_generate.sh.

ETS

To reproduce the results on the ETS dataset, please run the scirpts in scripts/ets.

Train the style reward model, the paraphrase model, and the reference SFT model by running 00_train_cls.sh, 00_train_paraphraser.sh, and 00_train_sft.sh.
Generate the data for DPO and CPO training by running 01_generate_dpo_cpo_data.sh.
Train the PO models using PPO/DPO/CPO by running 02_train_ppo.sh/02_train_dpo.sh/02_train_cpo.sh.
Transfer the texts in the test set by running 03_generate.sh.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scripts

scripts

src

src

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

ASTRAPOP

Installation

Data

Reproduce Results

Reddit

ETS

About

Releases

Packages

Languages

License

isi-nlp/ASTRAPOP

Folders and files

Latest commit

History

Repository files navigation

ASTRAPOP

Installation

Data

Reproduce Results

Reddit

ETS

About

Resources

License

Stars

Watchers

Forks

Languages