Training Prompt Matters: State-Adaptive Optimization for Robust Fine-Tuning

ICML 2026

State-Adaptive Prompt Optimization (SAPO)

This repository contains the official implementation for our work "Training Prompt Matters: State-Adaptive Optimization for Robust Fine-Tuning".

💡Introduction

Unlike previous prompt engineering works that solely focus on evaluation performance during inference, our work is the first to systematically investigate the impact of training prompts on model capabilities after fine-tuning. We discover that:

While different paraphrased prompts have a negligible impact on the current task's performance, they exert a profound influence on catastrophic forgetting of previously trained tasks and generalization to unseen tasks.
These cross-task impacts are positively correlated, indicating that training prompt formulation is a tractable optimization objective.
There exists an effective pre-indicator (pre-update loss) to reliably identify superior prompts prior to learning.

Based on these findings, we propose SAPO (State-Adaptive Prompt Optimization), a lightweight and dynamic training strategy.

🛠 Setup

1. Environment

You can install the required libraries by running the following command:

pip install -r requirements.txt

2. API Configuration for Prompt Generation

Before training with SAPO, you need to configure the LLM used for generating paraphrase prompts.

Please modify the API_KEY and URL in the following file: src/prompt_gen/API.py

📂 Data Preparation

The datasets used in our experiments are organized in the following directories:

SUPERNI: Located in SUPERNI/
TRACE: Located in TRACE/

🚀 Training

We provide training scripts for 4 methods: LoRAInc, LoRAInc + SAPO, O-LoRA, and O-LoRA + SAPO. The scripts are located in the scripts/ folder.

Below are the examples for training Qwen3-8B on the NI_SEQ_C1 task sequence.

1. LoRAInc

mkdir -p saves/LoraInc/qwen3-8b/ni_seq_c1/logs/ && \
bash scripts/LoraInc/qwen3-8b/ni_seq_c1.sh > saves/LoraInc/qwen3-8b/ni_seq_c1/logs/train.log 2>&1

2. LoRAInc + SAPO (Ours)

mkdir -p saves/LoraInc_SAPO/qwen3-8b/ni_seq_c1/logs/ && \
bash scripts/LoraInc_SAPO/qwen3-8b/ni_seq_c1.sh > saves/LoraInc_SAPO/qwen3-8b/ni_seq_c1/logs/train.log 2>&1

3. O-LoRA

mkdir -p saves/O-LoRA/qwen3-8b/ni_seq_c1/logs/ && \
bash scripts/O-LoRA/qwen3-8b/ni_seq_c1.sh > saves/O-LoRA/qwen3-8b/ni_seq_c1/logs/train.log 2>&1

4. O-LoRA + SAPO (Ours)

mkdir -p saves/O-LoRA_SAPO/qwen3-8b/ni_seq_c1/logs/ && \
bash scripts/O-LoRA_SAPO/qwen3-8b/ni_seq_c1.sh > saves/O-LoRA_SAPO/qwen3-8b/ni_seq_c1/logs/train.log 2>&1

📊 Evaluation

We provide the initial zero-shot performance of different models on SUPERNI and TRACE in the original_performance/ directory.

To evaluate the Continual Learning metrics (AP, BWT, FWT), run src/run_evaluation.py.

Example: Evaluate LoRAInc on Qwen3-8B (NI_seq_C1):

python3 src/run_evaluation.py \
    --outputs_dir saves/LoraInc/qwen3-8b/ni_seq_c1/outputs \
    --initial_metric_file original_performance/qwen3-8b.json

Arguments:

--outputs_dir: The directory containing the training outputs (e.g., subfolders like 1-taskXXX, 2-taskXXX).
--initial_metric_file: The JSON file containing the model's initial performance.

💐 Acknowledgments

This repository benefits from O-LoRA. We thank the authors for their wonderful work.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Training Prompt Matters: State-Adaptive Optimization for Robust Fine-Tuning

ICML 2026

State-Adaptive Prompt Optimization (SAPO)

💡Introduction

🛠 Setup

1. Environment

2. API Configuration for Prompt Generation

📂 Data Preparation

🚀 Training

1. LoRAInc

2. LoRAInc + SAPO (Ours)

3. O-LoRA

4. O-LoRA + SAPO (Ours)

📊 Evaluation

💐 Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
CL_configs/order		CL_configs/order
SUPERNI		SUPERNI
TRACE		TRACE
configs		configs
data		data
scripts		scripts
src		src
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Training Prompt Matters: State-Adaptive Optimization for Robust Fine-Tuning ICML 2026

State-Adaptive Prompt Optimization (SAPO)

💡Introduction

🛠 Setup

1. Environment

2. API Configuration for Prompt Generation

📂 Data Preparation

🚀 Training

1. LoRAInc

2. LoRAInc + SAPO (Ours)

3. O-LoRA

4. O-LoRA + SAPO (Ours)

📊 Evaluation

💐 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Training Prompt Matters: State-Adaptive Optimization for Robust Fine-Tuning

ICML 2026

Packages