Hypothetical-Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models

Overview

Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module that scaffolds the high-level planning process by generating, evaluating, and refining hypotheses about other agents’ strategies in natural language.

To learn more:

Installation

Install MeltingPot in editable mode from https://github.com/locross93/meltingpot, then install this repo

pip install -e .

Set up your API key with an environment variable:

export OPENAI_API_KEY=your_api_key_here

Running Hypothetical Minds and Baselines

To run an episode of Hypothetical Minds, use main.py as in the following example with "Running With Scissors Repeated":

python main.py --substrate rws --scenario_num 0 --agent_type hm --llm_type gpt4

To loop through every scenario in a substrate, use run_scenarios.py as in the following example running the Reflexion baseline on "Collaborative Cooking Asymmetric":

python run_scenarios.py --agent reflexion --substrate cc --num_seeds 5

Substrates

Alias	Description
`cc`	`collaborative_cooking__asymmetric` - In this environment, two players operate on opposite sides of a divided kitchen, where they must collaborate to efficiently prepare tomato soup, with each player specializing in tasks based on their proximity to resources.
`rws`	`running_with_scissors_in_the_matrix__repeated` - A zero-sum competitive environment where two players navigate a map collecting resources represented as yellow (rock), purple (paper), or blue (scissors). Players can "zap" each other to initiate a rock-paper-scissors style interaction based on their collected resources, resulting in one player receiving a positive reward and the other a corresponding negative reward.
`rws_arena`	`running_with_scissors_in_the_matrix__arena` - An eight player extension of RWS, where the focal agent controls one player against a background population of 7 strategies.
`pd`	`prisoners_dilemma_in_the_matrix__repeated` - Agents navigate a map similar to RWS, where they collect resources that correspond to cooperation or defection, reflecting the choices in the iterated prisoner’s dilemma game.

Using Open Source Models

To run open source models like LLaMA 3, you need to set up vllm first.

Install vllm:

pip install vllm

Start the vllm server:

CUDA_VISIBLE_DEVICES=0,1,2,3 python -m vllm.entrypoints.openai.api_server --model meta-llama/Meta-Llama-3-70B-Instruct --port 8000 --tensor-parallel-size 4 --seed 1234

Run the agent with LLaMA 3:

python main.py --substrate rws --scenario_num 0 --agent_type hm --llm_type llama3

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
environments		environments
llm_plan		llm_plan
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
all_games.gif		all_games.gif
create_videos.sh		create_videos.sh
environment.yml		environment.yml
main.py		main.py
requirements.txt		requirements.txt
run_scenarios.py		run_scenarios.py
setup.py		setup.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hypothetical-Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models

Overview

Installation

Running Hypothetical Minds and Baselines

Substrates

Using Open Source Models

About

Releases

Packages

Languages

License

locross93/Hypothetical-Minds

Folders and files

Latest commit

History

Repository files navigation

Hypothetical-Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models

Overview

Installation

Running Hypothetical Minds and Baselines

Substrates

Using Open Source Models

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages