Align Once, Benefit Multilingually: Enforcing Multilingual Consistency for LLM Safety Alignment

Requirement & Installation

This repository is built upon LLaMA-Factory. Simply install all dependencies via:

conda create -n mlc python==3.10
conda activate mlc
pip install -r requirements.txt

For detailed environment setup (CUDA, deepspeed, vLLM, etc.), please also refer to the LLaMA-Factory installation guide.

Dataset

The data has been placed in the /data directory and registered in data_info.json, specifically we use multilin-pku-saferlhf-alpaca3-8b-train.json for training.

Training

Please run the following command to start the training process.

llamafactory-cli train examples/MLC/{model}.yaml

model = gemma2-9b / qwen2.5-7b

Evaluation

We provide the safety evaluation data in the /safe_eval directory. We adopt GPT-4o as the evaluation model and follow a deterministic decoding strategy with greedy sampling (temperature = 0, top-k = 1). The evaluation prompts are adapted from those used in the original papers corresponding to each dataset. For more details, please refer to Appendix.

We conduct general capability evaluation (MMLU, MMMLU-lite) on Opencompass.

Credits

The code of this repository relies on LLaMA-Factory and we would like to show the sincere gratitude to authors of it.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
docker		docker
evaluation		evaluation
examples		examples
safe_eval		safe_eval
scripts		scripts
src		src
tests		tests
.dockerignore		.dockerignore
.env.local		.env.local
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
README_llamafactory.md		README_llamafactory.md
README_zh_llamafactory.md		README_zh_llamafactory.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Align Once, Benefit Multilingually: Enforcing Multilingual Consistency for LLM Safety Alignment

Requirement & Installation

Dataset

Training

Evaluation

Credits

About

Uh oh!

Releases

Packages

Languages

License

Yuyan-B/MLC

Folders and files

Latest commit

History

Repository files navigation

Align Once, Benefit Multilingually: Enforcing Multilingual Consistency for LLM Safety Alignment

Requirement & Installation

Dataset

Training

Evaluation

Credits

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages