Peacemaker or Troublemaker: How Sycophancy Shapes Multi-Agent Debate

This project implements a multi-agent debate system for understanding how sycophancy dynamics shape the system performance.

1. Environment Setup

Create a virtual enviroment and install dependencies.

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Download models from Huggingface to run experiments in parallel

bash ./model_download/download_all_models.sh [model_dir]

Before downloading the Llama model, you need to apply for the permission on Huggingface first if you haven't applied before, and then log in by huggingface-cli login.

2. Standard Debate

Test by calling APIs of OpenAI or Bedrock

For OpenAI, you need to set your API key first

export OPENAI_API_KEY="YOUR_KEY"

Single Agent Testing The usage case for mmlu pro is at scripts_api/run_single_agent.sh, and the use case for commonsenseqa is at scripts_api/run_multi_agent.sh

bash scripts_api/run_single_agent.sh

Multi Agent Testing by Decentralized Structure

# 2 agent
bash scripts_api/run_multi_agent.sh

#3 agent
bash scripts_api/run_multi_agent_3.sh

Multi Agent Testing by Centralized Structure

bash scripts_api/run_mad.sh

Test by Batch Inference on Local GPUs (8*40G A100s)

Single Agent Testing

bash scripts_local/run_batch_single_agent.sh

Multi Agent Testing by Decentralized Structure

## 2 agent
bash scripts_local/run_batch_multi_agent.sh

## 3 agent
bash scripts_local/run_batch_multi_agent_3.sh

Multi Agent Testing by Centralized Structure

bash scripts_local/run_batch_mad.sh

Submit job to the cluster by

bash scripts_local/run_cluster.sh

3. Control Agent Sycophancy by System Prompts

Test by Batch Inference on Local GPUs (8*40G A100s)

Multi Agent Testing by Decentralized Structure

## 2 agent
bash scripts_syco/run_batch_multi_agent_sycophancy.sh

## 3 agent
bash scripts_syco/run_batch_multi_agent_3_sycophancy.sh

Multi Agent Testing by Centralized Structure

bash scripts_local/run_batch_mad.sh

Submit job to the cluster to test different sycophancy combinations by

bash scripts_syco/run_cluster_multi_agent_sycophancy_homo.sh
bash scripts_syco/run_cluster_multi_agent_sycophancy_heter.sh

bash scripts_syco/run_cluster_multi_agent_3_sycophancy_homo.sh
bash scripts_syco/run_cluster_multi_agent_3_sycophancy_heter.sh

bash scripts_syco/run_cluster_batch_mad.sh

Gather results from different combinations

python gather_results.py --output-dir output_dir

4. Control Agent Sycophancy by Persona Vectors

Set up the persona_vectors at scripts scripts_syco/run_batch_steering_multi_agent.sh Run the multi-agent testing by

bash scripts_syco/run_batch_steering_multi_agent.sh

5. Evaluation and Analysis

Evaluation

bash scripts_eval/run_evaluate_debater.sh
bash scripts_eval/run_evaluate_judge.sh

Analysis

bash scripts_eval/run_analyze.sh

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
analysis		analysis
clients		clients
conf		conf
scripts_api		scripts_api
scripts_eval		scripts_eval
scripts_local		scripts_local
scripts_syco		scripts_syco
utils		utils
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONFIG_README.md		CONFIG_README.md
CONTRIBUTING.md		CONTRIBUTING.md
FILE_README.md		FILE_README.md
LICENSE.txt		LICENSE.txt
README.md		README.md
__init__.py		__init__.py
accelerate_batch_judger.py		accelerate_batch_judger.py
accelerate_batch_main.py		accelerate_batch_main.py
accelerate_batch_steering_main.py		accelerate_batch_steering_main.py
blind_agreement_evaluator.py		blind_agreement_evaluator.py
blind_agreement_evaluator_all.py		blind_agreement_evaluator_all.py
blind_agreement_evaluator_judge.py		blind_agreement_evaluator_judge.py
concat_results.py		concat_results.py
config_schema.py		config_schema.py
dataclass.py		dataclass.py
dataloader.py		dataloader.py
evaluate.py		evaluate.py
evaluate_debater_all.py		evaluate_debater_all.py
evaluate_debater_bar.py		evaluate_debater_bar.py
evaluate_judge_all.py		evaluate_judge_all.py
gather_results.py		gather_results.py
judger_evaluator.py		judger_evaluator.py
mad_judger.py		mad_judger.py
mad_main.py		mad_main.py
main.py		main.py
py.typed		py.typed
requirements.txt		requirements.txt
round_analyzer.py		round_analyzer.py
sentence_similarity_evaluator.py		sentence_similarity_evaluator.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Peacemaker or Troublemaker: How Sycophancy Shapes Multi-Agent Debate

1. Environment Setup

2. Standard Debate

Test by calling APIs of OpenAI or Bedrock

Test by Batch Inference on Local GPUs (8*40G A100s)

3. Control Agent Sycophancy by System Prompts

Test by Batch Inference on Local GPUs (8*40G A100s)

4. Control Agent Sycophancy by Persona Vectors

5. Evaluation and Analysis

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

amazon-science/Multi-Agent-Sycophancy

Folders and files

Latest commit

History

Repository files navigation

Peacemaker or Troublemaker: How Sycophancy Shapes Multi-Agent Debate

1. Environment Setup

2. Standard Debate

Test by calling APIs of OpenAI or Bedrock

Test by Batch Inference on Local GPUs (8*40G A100s)

3. Control Agent Sycophancy by System Prompts

Test by Batch Inference on Local GPUs (8*40G A100s)

4. Control Agent Sycophancy by Persona Vectors

5. Evaluation and Analysis

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages