Learning Disentangled Meaning and Style Representations for Positive Text Reframing

Natural Language Process System-set (NLPx)

The code of paper is developed on a python package for processing NLP tasks based on transformers, pytorch, datasets etc. libraries. Its mission is to reduce the duplicate labors when we set up and NLP models or framework in current popular deep learning framework or methodology cross multiple nodes.

Data

MSCOCO: used to create the pseudo paraphrase generation dateset for PTR.
Yelp: used to create the pseudo sentiment transfer dateset for PTR.
PPF: used to evaluate methods for PTR.

Source of data: MSCOCO (version from here), Yelp (version from here), PPF

Installation

Install from source

Clone the repository and install NLPx with the following commands

git clone git@github.com:codesedoc/DMSR.git
cd DMSR
pip install -e .

Install with Docker

Preparation

Ubuntu (22.04 LTS)
Docker (>= 23.0.5)

To protect system data during running docker container, it is recommended to creat a user belong to docker group, but without root permission. Running follow command can create an account name "docker-1024"!

bash sh/docker-1024

Running follow command to build the image of basic environment of NLPx.

docker compose build nlpx-env

To use Nvidia GPU in docker containers, please install the "NVIDIA Container Toolkit" referring to here.

Environment Variables

There are some necessary variables used during building images. They are defined in the file ".env"

Conduct Experiments

Running follow command to start

bash sh/docker-run

To conduct different variants of method in paper, please define the value of variable "ARGS_FILE" to the path of experiment argument file.

Citation

@inproceedings{sheng-etal-2023-learning,
    title = "Learning Disentangled Meaning and Style Representations for Positive Text Reframing",
    author = "Sheng, Xu  and
      Fukumoto, Fumiyo  and
      Li, Jiyi  and
      Kentaro, Go  and
      Suzuki, Yoshimi",
    booktitle = "Proceedings of the 16th International Natural Language Generation Conference",
    month = sep,
    year = "2023",
    address = "Prague, Czechia",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2023.inlg-main.31",
    pages = "424--430",
    abstract = "The positive text reframing (PTR) task which generates a text giving a positive perspective with preserving the sense of the input text, has attracted considerable attention as one of the NLP applications. Due to the significant representation capability of the pre-trained language model (PLM), a beneficial baseline can be easily obtained by just fine-tuning the PLM. However, how to interpret a diversity of contexts to give a positive perspective is still an open problem. Especially, it is more serious when the size of the training data is limited. In this paper, we present a PTR framework, that learns representations where the meaning and style of text are structurally disentangled. The method utilizes pseudo-positive reframing datasets which are generated with two augmentation strategies. A simple but effective multi-task learning-based model is learned to fuse the generation capabilities from these datasets. Experimental results on Positive Psychology Frames (PPF) dataset, show that our approach outperforms the baselines, BART by five and T5 by six evaluation metrics. Our source codes and data are available online.",
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
docker		docker
environment		environment
experiment		experiment
output/dps		output/dps
sh		sh
src		src
storage		storage
.env		.env
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
example.py		example.py
main.py		main.py
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning Disentangled Meaning and Style Representations for Positive Text Reframing

Natural Language Process System-set (NLPx)

Data

Installation

Install from source

Install with Docker

Preparation

Environment Variables

Conduct Experiments

Citation

About

Releases

Packages

Languages

License

codesedoc/DMSR

Folders and files

Latest commit

History

Repository files navigation

Learning Disentangled Meaning and Style Representations for Positive Text Reframing

Natural Language Process System-set (NLPx)

Data

Installation

Install from source

Install with Docker

Preparation

Environment Variables

Conduct Experiments

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages