Self-Exploring Language Models (SELM)

Code for Self-Exploring Language Models: Active Preference Elicitation for Online Alignment.

Authors: Shenao Zhang¹, Donghan Yu², Hiteshi Sharma², Ziyi Yang², Shuohang Wang², Hany Hassan², Zhaoran Wang¹.

¹Northwestern University, ²Microsoft

🤗 Zephyr Models
🤗 Llama-3 Models
🤗 Phi-3 Models

Run the Code

To run the code in this project, first, create a Python virtual environment using e.g. Conda:

conda create -n selm python=3.10 && conda activate selm

You can then install the remaining package dependencies as follows:

 python -m pip install .

You will also need Flash Attention 2 installed, which can be done by running:

python -m pip install flash-attn==2.3.6 --no-build-isolation

Next, log into your Hugging Face account as follows:

huggingface-cli login

Finally, install Git LFS so that you can push models to the Hugging Face Hub:

sudo apt-get install git-lfs

To train SELM on Meta-Llama-3-8B-Instruct, you need to first apply for the access. To train SELM on Phi-3-mini-4k-instruct, upgrade vllm by pip install vllm==0.4.2.

Replace HF_USERNAME in train_zephyr.sh, train_llama.sh, train_phi.sh with your huggingface username. After the above preparation, run the following commands:

Train SELM on Zephyr-SFT:

sh run_zephyr.sh

Train SELM on Meta-Llama-3-8B-Instruct:

sh run_llama.sh

Train SELM on Phi-3-mini-4k-instruct:

sh run_phi.sh

Citation

@article{zhang2024self,
  title={Self-Exploring Language Models: Active Preference Elicitation for Online Alignment},
  author={Zhang, Shenao and Yu, Donghan and Sharma, Hiteshi and Yang, Ziyi and Wang, Shuohang and Hassan, Hany and Wang, Zhaoran},
  journal={arXiv preprint arXiv:2405.19332},
  year={2024}
}

Acknowledgement

This repo is built upon The Alignment Handbook. We thank the authors for their great work.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
figs		figs
recipes		recipes
scripts		scripts
src/alignment		src/alignment
tests		tests
README.md		README.md
setup.cfg		setup.cfg
setup.py		setup.py
train_llama.sh		train_llama.sh
train_phi.sh		train_phi.sh
train_zephyr.sh		train_zephyr.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Self-Exploring Language Models (SELM)

Run the Code

Citation

Acknowledgement

About

Releases

Packages

Languages

shenao-zhang/SELM

Folders and files

Latest commit

History

Repository files navigation

Self-Exploring Language Models (SELM)

Run the Code

Citation

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages