Melissa - A Multi-Agent Reinforcement Learning Environment for Information Dissemination

Melissa implements a Multi-Agent Reinforcement Learning environment for message dissemination. The environment is structured as a graph, where each node is an agent and the number of "active" agents varies as the information is disseminated. At the beginning of an episode, a source node emits a message, and agents will be considered "active" in the environment only once they have received the information. Once an agent has taken a pre-defined number of timesteps, its experience will terminate. Agents make decisions based only on their one-hop neighborhood features and their neighborhood behavior.

While different policies can be defined, at the current state of the repository, we provide three learning algorithms based on Graph Convolutional Reinforcement Learning, namely DGN-R, L-DGN, and HL-DGN.

Implementation Details

The framework is written in Python and is based on PyTorch. It implements a customized extension of Tianshou and defines the MARL environment following the PettingZoo API. The GAT and global max pooling employ the implementation provided by PyTorch Geometric. Training and testing graphs were generated using the NetworkX library.

Framework Components

The main components of our framework can be found in the graph_env/env folder.
Environment dynamics are defined in:
- graph_env/env/graph.py
- graph_env/env/utils/core.py
Networks and policy implementations can be found in:
- graph_env/env/utils/networks
- graph_env/env/utils/policies

Training and Testing

Build the Docker image:
```
docker build -t melissa .
```
For machines without a GPU or Apple Mac devices (including ones employing Apple MX SoC):
```
 docker build -t melissa -f Dockefile.cpu .
```

Run the container:

docker run --ipc=host --gpus all -v ${PWD}:/home/devuser/dev:Z -it --rm melissa

(Optional) Visualization:

To visualize agents in action on testing graphs, use the following command:
```
docker run --ipc=host --gpus all -e DISPLAY=unix$DISPLAY -v /tmp/.X11-unix:/tmp/.X11-unix -v ${PWD}:/home/devuser/dev:Z -it --rm melissa
```
Alternatively, you can install the requirements in a Python (≥ 3.8) virtual environment:
```
pip install -r requirements.txt
```

Training Models

Trained models are saved in log/algorithm_name/weights as model_name_last.pth. Before training, you'll be prompted to log training data using the Weight and Biases (WANDB) logging tool.

DGN-R

python train_dgn_r.py --model-name DGN-R

L-DGN

python train_l_dgn.py --model-name L-DGN

HL-DGN

python train_hl_dgn.py --model-name HL-DGN

Seeding

By default, the seed is set to 9. You can change this value using --seed X where X is your chosen seed.

Testing Models

Trained models can be found in their respective subfolders of the /log folder. To reproduce results:

DGN-R:

python train_dgn_r.py --watch --model-name DGN-R.pth

L-DGN:

python train_l_dgn.py --watch --model-name L-DGN.pth

HL-DGN:

python train_hl_dgn.py --watch --model-name HL-DGN.pth

MPR Heuristic:

python train_hl_dgn.py --watch --model-name HL-DGN.pth --mpr-policy

Topologies Dataset

We've generated two datasets containing connected graph topologies. The first has 20 nodes per graph, and the second has 50 nodes. Training and testing sets for each dataset contain 50K and 100 graphs, respectively. You can switch between different numbers of agents (20/50) using the --n-agents argument (default is 20). All topologies can be downloaded here. The compressed folder should be unzipped in the root directory of the project.

Hyperparameter Optimization [WIP]

To run an automated hyperparameter study with Optuna, run a training experiment specifying the --optimize flag. The results of the study will be saved in the hyp_studies folder. You can specify the maximum number of trials with --trials flag and the maximum number of epochs per trial with --epoch flag.

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
graph_env		graph_env
tests/unit/graph_env/env/utils		tests/unit/graph_env/env/utils
.gitignore		.gitignore
Dockerfile		Dockerfile
Dockerfile.macos		Dockerfile.macos
README.md		README.md
conftest.py		conftest.py
dgn_r.py		dgn_r.py
hl_dgn.py		hl_dgn.py
l_dgn.py		l_dgn.py
lstm_hl_dgn.py		lstm_hl_dgn.py
lstm_l_dgn.py		lstm_l_dgn.py
lstm_lr_dgn.py		lstm_lr_dgn.py
requirements.txt		requirements.txt
train_dgn_r.py		train_dgn_r.py
train_hl_dgn.py		train_hl_dgn.py
train_l_dgn.py		train_l_dgn.py
train_lstm_hl_dgn.py		train_lstm_hl_dgn.py
train_lstm_l_dgn.py		train_lstm_l_dgn.py
train_lstm_lr_dgn.py		train_lstm_lr_dgn.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Melissa - A Multi-Agent Reinforcement Learning Environment for Information Dissemination

Implementation Details

Framework Components

Training and Testing

Training Models

Seeding

Testing Models

Topologies Dataset

Hyperparameter Optimization [WIP]

About

Releases

Packages

Contributors 2

Languages

RaffaeleGalliera/melissa

Folders and files

Latest commit

History

Repository files navigation

Melissa - A Multi-Agent Reinforcement Learning Environment for Information Dissemination

Implementation Details

Framework Components

Training and Testing

Training Models

Seeding

Testing Models

Topologies Dataset

Hyperparameter Optimization [WIP]

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages