Mixture-of-Domain-Adapters

Code for Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models' Memories (ACL 2023, in camera ready).

Incompatibilities with New Versions of PyTorch Lightning

PyTorch Lightning made several breaking changes incompatible to the existing code in ver. 2.0.0. For now, please run the code with pytorch_lightning==1.9.0.

Datasets

Datasets can be found here. Pretrained Stage 1 models are here.

For Stage 1, datasets are in json files like:

[
  {
    "prompt": "This is a prompt with targets like this to be {} .",
    "targets": [
      "filled"
    ]
  },
]

For Stage 2 (classification tasks), datasets are in jsonl files like:

{"text": "This is a helpful sentence.", "label": "helpful"}

You can modify the code to accommodate the model to your dataset.

Running the Code

Please refer to instructions in stage_one_pretrain.sh and stage_two.sh, which give examples on how to execute Stage 1 and Stage 2 training respectively.

Citation

If you use or extend our work, please cite the following paper:

@article{diao2023mixture,
  title={Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models Memories},
  author={Diao, Shizhe and Xu, Tianyang and Xu, Ruijia and Wang, Jiawei and Zhang, Tong},
  journal={arXiv preprint arXiv:2306.05406},
  year={2023}
}

Questions?

Please raise your questions in the issue or direct them to xu1868@purdue.edu.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
requirements.txt		requirements.txt
stage_one_pretrain.sh		stage_one_pretrain.sh
stage_two.sh		stage_two.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scripts

scripts

src

src

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

init.py

init.py

requirements.txt

requirements.txt

stage_one_pretrain.sh

stage_one_pretrain.sh

stage_two.sh

stage_two.sh

Repository files navigation

Mixture-of-Domain-Adapters

Incompatibilities with New Versions of PyTorch Lightning

Datasets

Running the Code

Citation

Questions?

About

Releases

Packages

Contributors 3

Languages

License

xu1868/Mixture-of-Domain-Adapters

Folders and files

Latest commit

History

Repository files navigation

Mixture-of-Domain-Adapters

Incompatibilities with New Versions of PyTorch Lightning

Datasets

Running the Code

Citation

Questions?

About

Resources

License

Stars

Watchers

Forks

Languages