LMAC : LLM-Guided Communication for Cooperative Multi-Agent Reinforcement Learning

🌐 Project page: https://saaangjun.github.io/LMAC/

This repository extends the original EPyMARL framework with an LLM-guided communication pipeline. All algorithmic components (controllers, learners, runners) and the general project layout still follow EPyMARL conventions.

Environment Setup

# 1) Create and activate the Conda environment (Python 3.9)
conda create -n epymarl-llm python=3.9 -y
conda activate epymarl-llm

# 2) Install the core MARL dependencies
pip install -r requirements.txt

# 3) Install any environment-specific extras (e.g., SC2, SMACv2, PettingZoo, etc.)
pip install -r env_requirements.txt
# 4) (Optional) Enable development tooling or notebook support
pip install -r dev_requirements.txt

LLM API Keys

Open src/llm_final.py and replace the placeholders with your own credentials:

# src/llm_final.py
OPENAI_KEY = "your-openai-key"
# GEMINI_KEY = "your-gemini-key"
# CLAUDE_KEY = "your-claude-key"

Weights & Biases Token

If you plan to log runs to W&B, store the API token locally:

echo "<your-wandb-key>" > wandb_key.txt

Data Preparation

The src/llm_final.py script requires pre-collected trajectory data to be placed in specific directories. The script does not collect data itself; it strictly requires existing .pkl files.

Create a data directory in the project root if it doesn't exist.
Create a subdirectory matching your map_name (for SC2/GRF) or env_key (for LBF).
Place training data (.pkl files) directly inside this subdirectory.
Create a test subdirectory inside the map directory.
Place test data (.pkl files) inside the test subdirectory.

Directory Structure Example (for SC2 map 1o_10b_vs_1r):

LMAC/Code/
├── data/
│   └── 1o_10b_vs_1r/          <-- BUFFER_DIR
│       ├── training_traj_1.pkl
│       ├── training_traj_2.pkl
│       ├── ...
│       └── test/              <-- TEST_BUFFER_DIR
│           ├── test_traj_1.pkl
│           ├── test_traj_2.pkl
│           └── ...

Note: The script checks for .pkl files in these directories. Ensure both training and test directories contain sufficient data files (default batch size requires at least 32 files, though check_buffer_availability default is 10).

How to Run

CUDA_VISIBLE_DEVICES=0 python src/llm_final.py \
  --config=lmac \
  --env=sc2 \
  --map=1o_10b_vs_1r \
  --mse_thres=0.05 \
  --meta_lambda=0.1 \
  --recon_lambda=1 \
  --consistency_lambda=1

Key flags:

--config, --env, --map: select the algorithm and SC2 map (replace with your target setup).
--mse_thres, --meta_lambda, --recon_lambda, --consistency_lambda: tune discriminator and training hyperparameters.
export or edit keys in src/llm_final.py before running if you rely on the LLM pipeline.

References

EPyMARL original paper and repository: https://github.com/uoe-agents/epymarl
SMAC / SMACv2 environments: https://github.com/oxwhirl/smac and https://github.com/oxwhirl/smacv2
PettingZoo cooperative benchmarks: https://pettingzoo.farama.org/

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
docker		docker
docs		docs
grf_maps		grf_maps
sc2_maps		sc2_maps
src		src
.gitignore		.gitignore
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
clean.sh		clean.sh
env_requirements.txt		env_requirements.txt
install_sc2.sh		install_sc2.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LMAC : LLM-Guided Communication for Cooperative Multi-Agent Reinforcement Learning

Environment Setup

LLM API Keys

Weights & Biases Token

Data Preparation

How to Run

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LMAC : LLM-Guided Communication for Cooperative Multi-Agent Reinforcement Learning

Environment Setup

LLM API Keys

Weights & Biases Token

Data Preparation

How to Run

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages