GitHub

PSRT: Accelerating LRM-based Guard Models via Prefilled Safe Reasoning Traces

Environment Setup

Install dependencies using:

pip install -r requirements.txt

Step 0: SFT Training

Perform SFT (Supervised Fine-Tuning).
You can use the openr1 SFT script.
Optionally, reinforcement learning steps can be added.

Step 1.0: PSRT Training

Edit the parameters in train_step_1.sh before starting training. The model should be the output from the previous step.

MODEL_NAME_OR_PATH=""          # Path to the base model
DATASET_PATH="./dataset/step_1_train_format.json"
OUTPUT_DIR=""                  # Path to save training outputs

Start training with:

bash train_step_1.sh

Step 1.1: PSRT Inference

Edit the parameters in inference_step_1_dataset.sh:

FILE_PATHS=( path/to/your/files )    # List of input files
model_names=("Ministral-8B-Instruct-2410")
prompt_lengths=(260)                 # Length used in Step 1.0 training

Run inference with:

bash inference_step_1_dataset.sh

Step 2: PBC

Edit the parameters in inference_step_2_dataset.sh:

FILE_PATHS=( path/to/your/files )

Run inference with:

bash inference_step_2_dataset.sh

Additional Datasets

Other training datasets can be found in the train.zip file.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
dataset_eval		dataset_eval
dataset_train		dataset_train
fig		fig
open-r1-main		open-r1-main
output_step_1		output_step_1
reasoner_guard		reasoner_guard
README.md		README.md
inference_step_1_dataset.py		inference_step_1_dataset.py
inference_step_1_dataset.sh		inference_step_1_dataset.sh
inference_step_2_dataset.py		inference_step_2_dataset.py
inference_step_2_dataset.sh		inference_step_2_dataset.sh
requirements.txt		requirements.txt
train.zip		train.zip
train_step_1.py		train_step_1.py
train_step_1.sh		train_step_1.sh
training_part.py		training_part.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PSRT: Accelerating LRM-based Guard Models via Prefilled Safe Reasoning Traces

Environment Setup

Step 0: SFT Training

Step 1.0: PSRT Training

Step 1.1: PSRT Inference

Step 2: PBC

Additional Datasets

About

Uh oh!

Releases

Packages

Languages

weiyezhimeng/PSRT

Folders and files

Latest commit

History

Repository files navigation

PSRT: Accelerating LRM-based Guard Models via Prefilled Safe Reasoning Traces

Environment Setup

Step 0: SFT Training

Step 1.0: PSRT Training

Step 1.1: PSRT Inference

Step 2: PBC

Additional Datasets

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages