LifeAlign

Official codebase for the paper LifeAlign: Lifelong Alignment for Large Language Models with Memory-Augmented Focalized Preference Optimization.

This project is built on top of LlamaFactory and extends its DPO/SFT training pipeline for lifelong alignment. Following the paper, the implementation centers on two ideas:

Focalized Preference Optimization (FPO): Adaptively allocates learning intensity based on sample difficulty during the preference optimization process.
Short-to-Long Memory Consolidation (SLMC): Denoises task-specific updates and projects them into a stable long-term memory space to reduce forgetting.

At the code level, the main components are:

src/llamafactory/train/dpo/MyDPOTrainer.py: FPO-style preference loss.
src/llamafactory/train/dpo/CL_methods/CLManager.py: memory consolidation with denoising and subspace projection.
src/llamafactory/train/dpo/BaseTrainerCL.py: continual training pipeline and the built-in SFT initialization stage.

Environment Setup

Because this repo is based on LlamaFactory, the environment preparation can largely follow the standard LlamaFactory setup.

1. Create a Python environment

conda create -n lifealign python=3.10 -y
conda activate lifealign
pip install --upgrade pip

2. Install PyTorch

Please install the PyTorch version that matches your CUDA environment first. For example:

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121

If you use another CUDA version, please replace the wheel index accordingly.

3. Install this project

pip install -e ".[metrics]"

If you want to use DeepSpeed, install the optional dependency as well:

pip install -e ".[deepspeed]"

After installation, the main entry point is:

llamafactory-cli train ...

Data Format

The datasets used in this repo are registered in data/dataset_info.json. The current continual alignment setup includes:

Capybara-Preferences
HC3
hh-rlhf-harmless-base
hh-rlhf-helpful-base
safe-rlhf
TruthfulQA

Each task also has corresponding -sft and -test-sft variants for initialization and evaluation.

Training and Evaluation

This repo supports three launch modes.

1. Fully automated script via `bash.sh`

bash.sh runs the whole pipeline automatically:

builds the task sequence
launches alignment training
evaluates the resulting adapter on current and previous tasks

Run it with:

bash bash.sh

You can adjust the following fields in bash.sh before running:

ORDER: task order such as order1, order2, order3
GPU_ID
MODEL_NAME / MODEL_PATH
BASE_ADAPTER_PATH
BASE_OUTPUT_DIR

The repo also includes:

bash-MTL.sh: multi-task baseline
bash-Sep.sh: single-task / separate-task baseline
predict.sh: prediction-only evaluation script

Important:

The current bash.sh is a runnable automation template in this repo.
If you want to explicitly reproduce the LifeAlign-style setting from the paper, pay attention to arguments such as --CL_method, --loss_func, --use_replay, --denoising_threshold, and --projection_gamma.

2. Launch with `llamafactory-cli train` and explicit arguments

Below is an example of continual alignment training with a LifeAlign-style configuration:

CUDA_VISIBLE_DEVICES=0,1 llamafactory-cli train \
  --model_name_or_path /path/to/Qwen2.5-7B-Instruct \
  --stage dpo \
  --do_train \
  --finetuning_type lora \
  --lora_target q_proj,v_proj \
  --pref_beta 0.1 \
  --pref_loss sigmoid \
  --CL_method my_method \
  --loss_func FPO \
  --use_replay \
  --denoising_threshold 0.9 \
  --projection_gamma 0.5 \
  --dataset Capybara-Preferences,HC3,hh-rlhf-harmless-base,hh-rlhf-helpful-base,safe-rlhf,TruthfulQA \
  --template qwen \
  --cutoff_len 2048 \
  --overwrite_cache \
  --preprocessing_num_workers 4 \
  --output_dir saves/Qwen2.5-7B-Instruct/lora/LifeAlign-order1 \
  --logging_steps 10 \
  --save_steps 1000 \
  --plot_loss \
  --overwrite_output_dir \
  --report_to none \
  --per_device_train_batch_size 1 \
  --gradient_accumulation_steps 16 \
  --learning_rate 5.0e-6 \
  --num_train_epochs 3.0 \
  --lr_scheduler_type cosine \
  --warmup_ratio 0.1 \
  --bf16

Prediction / evaluation example:

CUDA_VISIBLE_DEVICES=0 llamafactory-cli train \
  --model_name_or_path /path/to/Qwen2.5-7B-Instruct \
  --adapter_name_or_path saves/Qwen2.5-7B-Instruct/lora/LifeAlign-order1/TruthfulQA \
  --stage sft \
  --do_predict \
  --finetuning_type lora \
  --eval_dataset Capybara-Preferences-test-sft,HC3-test-sft,hh-rlhf-harmless-base-test-sft,hh-rlhf-helpful-base-test-sft,safe-rlhf-test-sft,TruthfulQA-test-sft \
  --template qwen \
  --cutoff_len 2048 \
  --overwrite_cache \
  --preprocessing_num_workers 4 \
  --max_new_tokens 1024 \
  --output_dir save_test/Qwen2.5-7B-Instruct/lora/LifeAlign-order1/TruthfulQA \
  --overwrite_output_dir \
  --report_to none \
  --per_device_eval_batch_size 4 \
  --predict_with_generate

3. Launch with a YAML config file

You can also run training by passing a YAML file directly:

llamafactory-cli train examples/train_lora/llama3_lora_dpo.yaml

Or prepare your own YAML and run:

llamafactory-cli train /path/to/your_config.yaml

The repo already contains several examples under examples/train_lora/, such as:

examples/train_lora/llama3_lora_dpo.yaml
examples/train_lora/llama3_lora_sft.yaml
examples/train_lora/llama3_lora_sft_initialize.yaml

Built-in SFT Initialization

examples/train_lora/llama3_lora_sft_initialize.yaml is the configuration file for SFT initialization.

In the current implementation, this step is built into the code path:

every continual alignment training job first performs SFT initialization
then the alignment stage starts

This means the initialization is not merely an optional example YAML; it is part of the actual training pipeline.

If you want to modify:

the base model
template
LoRA target modules
datasets used in initialization
output path
batch size, learning rate, or other SFT hyperparameters

you need to edit:

examples/train_lora/llama3_lora_sft_initialize.yaml

before launching training.

Output Structure

Typical outputs are:

saves/...: checkpoints and LoRA adapters
save_test/...: prediction and evaluation results

For continual learning runs, each task usually has its own subdirectory under the main output directory.

Notes

This repo inherits most training conventions from LlamaFactory.
Multi-GPU training can be launched directly with llamafactory-cli train; LlamaFactory will invoke distributed training automatically when multiple devices are visible.
Before large-scale experiments, it is recommended to first check model path, dataset registration, template name, and output directory settings.

Citation

If you find this project useful, please cite the paper:

@inproceedings{li2026lifealign,
  title={Lifealign: Lifelong alignment for large language models with memory-augmented focalized preference optimization},
  author={Li, Junsong and Zhou, Jie and Zhan, Bihao and Yang, Yutao and Pan, Qianjun and Chen, Shilian and Huai, Tianyu and Li, Xin and Chen, Qin and He, Liang},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={40},
  number={37},
  pages={31618--31626},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
Images		Images
data		data
evaluation		evaluation
examples		examples
fig		fig
llm_judge		llm_judge
poster		poster
src		src
.gitignore		.gitignore
README.md		README.md
README_llamafactory_base.md		README_llamafactory_base.md
README_llamafactory_base_zh.md		README_llamafactory_base_zh.md
README_zh.md		README_zh.md
bash-MTL.sh		bash-MTL.sh
bash-Sep.sh		bash-Sep.sh
bash.sh		bash.sh
predict.sh		predict.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LifeAlign

Environment Setup

1. Create a Python environment

2. Install PyTorch

3. Install this project

Data Format

Training and Evaluation

1. Fully automated script via `bash.sh`

2. Launch with `llamafactory-cli train` and explicit arguments

3. Launch with a YAML config file

Built-in SFT Initialization

Output Structure

Notes

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LifeAlign

Environment Setup

1. Create a Python environment

2. Install PyTorch

3. Install this project

Data Format

Training and Evaluation

1. Fully automated script via bash.sh

2. Launch with llamafactory-cli train and explicit arguments

3. Launch with a YAML config file

Built-in SFT Initialization

Output Structure

Notes

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. Fully automated script via `bash.sh`

2. Launch with `llamafactory-cli train` and explicit arguments

Packages