SHIFT: Gate-Modulated Activation Steering for Knowledge Conflict Mitigation in Retrieval-Augmented Generation

Ruochang Li^1, Pengcheng Huang^1, Zhenghao Liu^1†, Yukun Yan²,
Huiyuan Xie², Yu Gu¹, Ge Yu¹, Maosong Sun²

¹Northeastern University, ²Tsinghua University

📖 Overview

SHIFT is a lightweight framework for resolving knowledge conflicts in retrieval-augmented generation. Instead of directly editing internal neurons, SHIFT adds a small learnable gate module to frozen LLMs, allowing them to adaptively balance retrieved context and parametric knowledge during generation. With fewer than 0.01% trainable parameters, SHIFT improves context reliance while minimizing unintended effects on general model capabilities.

⚙️ Setup

Create Conda Environment

conda create --name shift python==3.10.0
conda activate shift
git clone https://github.com/OpenBMB/SHIFT.git
cd SHIFT

Install PyTorch

pip install torch==2.6.0 --index-url https://download.pytorch.org/whl/cu124

Install Flash Attention

pip install https://github.com/Dao-AILab/flash-attention/releases/download/v2.7.4.post1/flash_attn-2.7.4.post1+cu12torch2.6cxx11abiFALSE-cp310-cp310-linux_x86_64.whl

Install the rest of the dependencies

pip install -r requirements.txt

Patch vLLM model files
This project requires modifications to the vLLM implementations of Qwen3 and LLaMA. After installing the requirements, run the patch script:

bash vllm/patch_vllm.sh

Note: Please run this script after 'pip install -r requirements.', because installing or reinstalling vLLM may overwrite the patched files.

⚡ Data

Our training data can be downloaded from SHIFT_Training_Data. After downloading, place the files into the dataset folder.

To construct the data from scratch, download the files from MRQA-Shared-Task-2019.
Use the downloaded data to synthesize it with FlashRAG.

🔥 Training

GRPO with a single GPU:

python single_gpu.py

GRPO with multiple GPUs:

python multi_gpu.py

📊 Evaluation

For MRQA and ConfiQA:

python eval.py

For MMLU, use lm-evaluation-harness

🎯 Analysis

We also provide the t-SNE visualization pipeline for gates in SHIFT, with corresponding figures available under the figs folder:

python run_batch_tsne.py

For Qwen-3-0.6B:

For Qwen-3-8B:

🎉 Acknowledgement

Our work is built on the following codebases, and we are deeply grateful for their contributions.

🥰 Citation

If you find this work useful, please cite our paper and give us a shining star 🌟

@article{Li2026shift,
      title={SHIFT: Gate-Modulated Activation Steering for Knowledge Conflict Mitigation in Retrieval-Augmented Generation},
      author={Li, Ruochang and Huang, Pengcheng and Liu, Zhenghao and Yan, Yukun and Xie, Huiyuan and Gu, Yu and Yu, Ge and Sun, Maosong},
      year={2026}
      url={}, 
}

📧 Contact

If you have questions, collaboration opportunities, or potential PhD opportunities in the United States, please feel free to email:

ruochangli@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
datasets		datasets
figs		figs
models		models
src		src
vLLM		vLLM
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SHIFT: Gate-Modulated Activation Steering for Knowledge Conflict Mitigation in Retrieval-Augmented Generation

Ruochang Li^1, Pengcheng Huang^1, Zhenghao Liu^1†, Yukun Yan²,
Huiyuan Xie², Yu Gu¹, Ge Yu¹, Maosong Sun²

¹Northeastern University, ²Tsinghua University

📖 Overview

⚙️ Setup

⚡ Data

🔥 Training

📊 Evaluation

🎯 Analysis

🎉 Acknowledgement

🥰 Citation

📧 Contact

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SHIFT: Gate-Modulated Activation Steering for Knowledge Conflict Mitigation in Retrieval-Augmented Generation

Ruochang Li1*, Pengcheng Huang1*, Zhenghao Liu1†, Yukun Yan2, Huiyuan Xie2, Yu Gu1, Ge Yu1, Maosong Sun2 1Northeastern University, 2Tsinghua University

📖 Overview

⚙️ Setup

⚡ Data

🔥 Training

📊 Evaluation

🎯 Analysis

🎉 Acknowledgement

🥰 Citation

📧 Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Ruochang Li^1, Pengcheng Huang^1, Zhenghao Liu^1†, Yukun Yan²,
Huiyuan Xie², Yu Gu¹, Ge Yu¹, Maosong Sun²

¹Northeastern University, ²Tsinghua University

Packages