DITS: Efficient Multi-Agent System Training with Data Influence-Oriented Tree Search

DITS a novel framework that incorporates influence scores to guide both tree search and data selection. By leveraging influence scores, we effectively identify the most impactful data for system improvement, thereby enhancing model performance. Furthermore, we derive influence score estimation methods tailored for non-differentiable metrics, significantly reducing computational overhead by utilizing inference computations.

🛠 Installation

DITS requires two conda environments: one for vLLM deployment and another for training, both using Python 3.11. Follow these steps to set up your environments:

vLLM Environment

conda create -n DITS-vllm python=3.11
# Install NVCC for CUDA
conda install nvidia/label/cuda-12.1.0::cuda-nvcc
# Install PyTorch 2.3.1 (required by VLLM==0.6.1.post1)
conda install pytorch=2.3.1 torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia
# Install VLLM 0.6.1.post1
pip install vllm==0.6.1.post1

Training Environment

conda create -n DITS-train python=3.11
# Install NVCC for CUDA
conda install nvidia/label/cuda-12.1.0::cuda-nvcc
# Install PyTorch 2.3.1 (required by VLLM==0.6.1.post1)
conda install pytorch=2.3.1 torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia
# Install Alignment Handbook 
cd alignment-handbook
pip install -e .
cd ..
# Install Dependencies
pip install flash-attn --no-build-isolation
pip install -r requirements.txt

🏃‍♂️ How to Run

Prepare datasets

To ensure proper execution of the code, the following path adjustments are required:

File Path Replacement
- Replace all instances of YOUR_HOME_PATH_HERE and YOUR_FILE_PATH_HERE in the codebase with the corresponding local file paths.
Model Path Specification
- Substitute YOUR_MODEL_PATH_HERE with the actual path to the downloaded model.

These modifications are necessary for the code to function as intended.

Run the combined DITS-iSFT-DPO setting with hotpotqa dataset:

export INITIAL_MODEL_PATH="YOUR_MODEL_PAHT_HERE"
export DATASET='hotpot_qa'
export TOKENIZERS_PARALLELISM=false

MKL_THREADING_LAYER=GNU OUTLINES_CACHE_DIR='./outlines/hotpot_qa' python sft_dpo_script_DI.py \
    --train_config_path train/sft_dpo_recipes/hotpot_qa_DI.yaml \
    --vllm_env DITS-vllm \
    --alignment_env DITS-train

📊 Inference and Evaluation

Inference Script

To get inference results for all models on a specified test set:

OUTLINES_CACHE_DIR=./outlines/${dataset_type} 

python inference_script.py \
    --model_root_path YOUT_HOME_PATH_HERE/checkpoints/${dataset_type}_sft_dpo_DI/ \
    --tokenizer_path YOUR_MODEL_PATH_HERE \
    --device 0 \
    --port 8000 \
    --dataset_type ${dataset_type} \
    --num_thread 128 \
    --num_gpu 4 \
    --output_root_path ${output_file} \
    --vllm_env DITS-vllm \

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
agent		agent
alignment-handbook		alignment-handbook
analysis		analysis
answerParser		answerParser
dataloader		dataloader
local_influence		local_influence
message		message
model		model
reward		reward
scrips		scrips
train		train
utils		utils
LICENSE		LICENSE
README.md		README.md
dpo_script.py		dpo_script.py
dpo_script_datainfluence.py		dpo_script_datainfluence.py
dpo_script_disim.py		dpo_script_disim.py
dpo_script_generate_only.py		dpo_script_generate_only.py
inference_main.py		inference_main.py
inference_script.py		inference_script.py
inference_script_lora.py		inference_script_lora.py
inference_script_lora_evaluation.py		inference_script_lora_evaluation.py
inference_script_one.py		inference_script_one.py
inference_script_server_train.py		inference_script_server_train.py
inference_script_server_train_dynamic.py		inference_script_server_train_dynamic.py
inference_script_transformer.py		inference_script_transformer.py
llama-3-instruct.jinja		llama-3-instruct.jinja
multi_sc_analysis_main.py		multi_sc_analysis_main.py
multi_sc_data_generate_script.py		multi_sc_data_generate_script.py
ppl_deploy.py		ppl_deploy.py
requirements.txt		requirements.txt
reward_main.py		reward_main.py
sft_dpo_script.py		sft_dpo_script.py
sft_dpo_script_DI.py		sft_dpo_script_DI.py
sft_dpo_script_test.py		sft_dpo_script_test.py
sft_script.py		sft_script.py
sft_script_lora.py		sft_script_lora.py
stats_main.py		stats_main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DITS: Efficient Multi-Agent System Training with Data Influence-Oriented Tree Search

🛠 Installation

vLLM Environment

Training Environment

🏃‍♂️ How to Run

Prepare datasets

📊 Inference and Evaluation

Inference Script

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DITS: Efficient Multi-Agent System Training with Data Influence-Oriented Tree Search

🛠 Installation

vLLM Environment

Training Environment

🏃‍♂️ How to Run

Prepare datasets

📊 Inference and Evaluation

Inference Script

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages